1
|
Bitran A, Park K, Serebryany E, Shakhnovich EI. Co-translational formation of disulfides guides folding of the SARS-CoV-2 receptor binding domain. Biophys J 2023; 122:3238-3253. [PMID: 37422697 PMCID: PMC10465708 DOI: 10.1016/j.bpj.2023.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 05/27/2023] [Accepted: 07/03/2023] [Indexed: 07/10/2023] Open
Abstract
Many secreted proteins, including viral proteins, contain multiple disulfide bonds. How disulfide formation is coupled to protein folding in the cell remains poorly understood at the molecular level. Here, we combine experiment and simulation to address this question as it pertains to the SARS-CoV-2 receptor binding domain (RBD). We show that the RBD can only refold reversibly if its native disulfides are present before folding. But in their absence, the RBD spontaneously misfolds into a nonnative, molten-globule-like state that is structurally incompatible with complete disulfide formation and that is highly prone to aggregation. Thus, the RBD native structure represents a metastable state on the protein's energy landscape with reduced disulfides, indicating that nonequilibrium mechanisms are needed to ensure native disulfides form before folding. Our atomistic simulations suggest that this may be achieved via co-translational folding during RBD secretion into the endoplasmic reticulum. Namely, at intermediate translation lengths, native disulfide pairs are predicted to come together with high probability, and thus, under suitable kinetic conditions, this process may lock the protein into its native state and circumvent highly aggregation-prone nonnative intermediates. This detailed molecular picture of the RBD folding landscape may shed light on SARS-CoV-2 pathology and molecular constraints governing SARS-CoV-2 evolution.
Collapse
Affiliation(s)
- Amir Bitran
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts; PhD Program in Biophysics, Harvard University, Cambridge, Massachusetts.
| | - Kibum Park
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Eugene Serebryany
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
2
|
Serebryany E, Zhao VY, Park K, Bitran A, Trauger SA, Budnik B, Shakhnovich EI. Systematic conformation-to-phenotype mapping via limited deep sequencing of proteins. Mol Cell 2023; 83:1936-1952.e7. [PMID: 37267908 PMCID: PMC10281453 DOI: 10.1016/j.molcel.2023.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 01/29/2023] [Accepted: 05/03/2023] [Indexed: 06/04/2023]
Abstract
Non-native conformations drive protein-misfolding diseases, complicate bioengineering efforts, and fuel molecular evolution. No current experimental technique is well suited for elucidating them and their phenotypic effects. Especially intractable are the transient conformations populated by intrinsically disordered proteins. We describe an approach to systematically discover, stabilize, and purify native and non-native conformations, generated in vitro or in vivo, and directly link conformations to molecular, organismal, or evolutionary phenotypes. This approach involves high-throughput disulfide scanning (HTDS) of the entire protein. To reveal which disulfides trap which chromatographically resolvable conformers, we devised a deep-sequencing method for double-Cys variant libraries of proteins that precisely and simultaneously locates both Cys residues within each polypeptide. HTDS of the abundant E. coli periplasmic chaperone HdeA revealed distinct classes of disordered hydrophobic conformers with variable cytotoxicity depending on where the backbone was cross-linked. HTDS can bridge conformational and phenotypic landscapes for many proteins that function in disulfide-permissive environments.
Collapse
Affiliation(s)
- Eugene Serebryany
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA.
| | - Victor Y Zhao
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA
| | - Kibum Park
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA
| | - Amir Bitran
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA
| | - Sunia A Trauger
- Center for Mass Spectrometry, Harvard University, Cambridge, MA 02138, USA
| | - Bogdan Budnik
- Center for Mass Spectrometry, Harvard University, Cambridge, MA 02138, USA
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
3
|
Bitran A, Park K, Serebryany E, Shakhnovich EI. Cotranslational formation of disulfides guides folding of the SARS COV-2 receptor binding domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.11.10.516025. [PMID: 36380756 PMCID: PMC9665344 DOI: 10.1101/2022.11.10.516025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Many secreted proteins contain multiple disulfide bonds. How disulfide formation is coupled to protein folding in the cell remains poorly understood at the molecular level. Here, we combine experiment and simulation to address this question as it pertains to the SARS-CoV-2 receptor binding domain (RBD). We show that, whereas RBD can refold reversibly when its disulfides are intact, their disruption causes misfolding into a nonnative molten-globule state that is highly prone to aggregation and disulfide scrambling. Thus, non-equilibrium mechanisms are needed to ensure disulfides form prior to folding in vivo. Our simulations suggest that co-translational folding may accomplish this, as native disulfide pairs are predicted to form with high probability at intermediate lengths, ultimately committing the RBD to its metastable native state and circumventing nonnative intermediates. This detailed molecular picture of the RBD folding landscape may shed light on SARS-CoV-2 pathology and molecular constraints governing SARS-CoV-2 evolution.
Collapse
|
4
|
Abstract
![]()
AlphaFold has burst into our lives. A powerful algorithm
that underscores
the strength of biological sequence data and artificial intelligence
(AI). AlphaFold has appended projects and research directions. The
database it has been creating promises an untold number of applications
with vast potential impacts that are still difficult to surmise. AI
approaches can revolutionize personalized treatments and usher in
better-informed clinical trials. They promise to make giant leaps
toward reshaping and revamping drug discovery strategies, selecting
and prioritizing combinations of drug targets. Here, we briefly overview
AI in structural biology, including in molecular dynamics simulations
and prediction of microbiota–human protein–protein interactions.
We highlight the advancements accomplished by the deep-learning-powered
AlphaFold in protein structure prediction and their powerful impact
on the life sciences. At the same time, AlphaFold does not resolve
the decades-long protein folding challenge, nor does it identify the
folding pathways. The models that AlphaFold provides do not capture
conformational mechanisms like frustration and allostery, which are
rooted in ensembles, and controlled by their dynamic distributions.
Allostery and signaling are properties of populations. AlphaFold also
does not generate ensembles of intrinsically disordered proteins and
regions, instead describing them by their low structural probabilities.
Since AlphaFold generates single ranked structures, rather than conformational
ensembles, it cannot elucidate the mechanisms of allosteric activating
driver hotspot mutations nor of allosteric drug resistance. However,
by capturing key features, deep learning techniques can use the single
predicted conformation as the basis for generating a diverse ensemble.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702, United States.,Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702, United States
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, Maryland 21702, United States
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland 21702, United States
| |
Collapse
|
5
|
Statistical potentials from the Gaussian scaling behaviour of chain fragments buried within protein globules. PLoS One 2022; 17:e0254969. [PMID: 35085247 PMCID: PMC8794220 DOI: 10.1371/journal.pone.0254969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 10/28/2021] [Indexed: 11/19/2022] Open
Abstract
Knowledge-based approaches use the statistics collected from protein data-bank structures to estimate effective interaction potentials between amino acid pairs. Empirical relations are typically employed that are based on the crucial choice of a reference state associated to the null interaction case. Despite their significant effectiveness, the physical interpretation of knowledge-based potentials has been repeatedly questioned, with no consensus on the choice of the reference state. Here we use the fact that the Flory theorem, originally derived for chains in a dense polymer melt, holds also for chain fragments within the core of globular proteins, if the average over buried fragments collected from different non-redundant native structures is considered. After verifying that the ensuing Gaussian statistics, a hallmark of effectively non-interacting polymer chains, holds for a wide range of fragment lengths, although with significant deviations at short spatial scales, we use it to define a ‘bona fide’ reference state. Notably, despite the latter does depend on fragment length, deviations from it do not. This allows to estimate an effective interaction potential which is not biased by the presence of correlations due to the connectivity of the protein chain. We show how different sequence-independent effective statistical potentials can be derived using this approach by coarse-graining the protein representation at varying levels. The possibility of defining sequence-dependent potentials is explored.
Collapse
|
6
|
Validation of DBFOLD: An efficient algorithm for computing folding pathways of complex proteins. PLoS Comput Biol 2020; 16:e1008323. [PMID: 33196646 PMCID: PMC7704049 DOI: 10.1371/journal.pcbi.1008323] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 11/30/2020] [Accepted: 10/17/2020] [Indexed: 11/19/2022] Open
Abstract
Atomistic simulations can provide valuable, experimentally-verifiable insights into protein folding mechanisms, but existing ab initio simulation methods are restricted to only the smallest proteins due to severe computational speed limits. The folding of larger proteins has been studied using native-centric potential functions, but such models omit the potentially crucial role of non-native interactions. Here, we present an algorithm, entitled DBFOLD, which can predict folding pathways for a wide range of proteins while accounting for the effects of non-native contacts. In addition, DBFOLD can predict the relative rates of different transitions within a protein’s folding pathway. To accomplish this, rather than directly simulating folding, our method combines equilibrium Monte-Carlo simulations, which deploy enhanced sampling, with unfolding simulations at high temperatures. We show that under certain conditions, trajectories from these two types of simulations can be jointly analyzed to compute unknown folding rates from detailed balance. This requires inferring free energies from the equilibrium simulations, and extrapolating transition rates from the unfolding simulations to lower, physiologically-reasonable temperatures at which the native state is marginally stable. As a proof of principle, we show that our method can accurately predict folding pathways and Monte-Carlo rates for the well-characterized Streptococcal protein G. We then show that our method significantly reduces the amount of computation time required to compute the folding pathways of large, misfolding-prone proteins that lie beyond the reach of existing direct simulation. Our algorithm, which is available online, can generate detailed atomistic models of protein folding mechanisms while shedding light on the role of non-native intermediates which may crucially affect organismal fitness and are frequently implicated in disease. Many proteins must adopt a specific structure in order to function. Computational simulations have been used to shed light on the mechanisms of protein folding, but unfortunately, realistic simulations can typically only be run for small proteins, due to severe limits in computational speed. Here, we present a method to solve this problem, whereby instead of directly simulating folding from an unfolded state, we run simulations that allow for computation of equilibrium folding free energies, alongside high temperature simulations to compute unfolding rates. From these quantities, folding rates can be computed using detailed balance. Importantly, our method can account for the effects of nonnative contacts which transiently form during folding and must be broken prior to adoption of the native state. Such contacts, which are often excluded from simple models of folding, may crucially affect real protein folding pathways and are often observed in folding intermediates implicated in disease.
Collapse
|
7
|
Cotranslational folding allows misfolding-prone proteins to circumvent deep kinetic traps. Proc Natl Acad Sci U S A 2020; 117:1485-1495. [PMID: 31911473 DOI: 10.1073/pnas.1913207117] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Many large proteins suffer from slow or inefficient folding in vitro. It has long been known that this problem can be alleviated in vivo if proteins start folding cotranslationally. However, the molecular mechanisms underlying this improvement have not been well established. To address this question, we use an all-atom simulation-based algorithm to compute the folding properties of various large protein domains as a function of nascent chain length. We find that for certain proteins, there exists a narrow window of lengths that confers both thermodynamic stability and fast folding kinetics. Beyond these lengths, folding is drastically slowed by nonnative interactions involving C-terminal residues. Thus, cotranslational folding is predicted to be beneficial because it allows proteins to take advantage of this optimal window of lengths and thus avoid kinetic traps. Interestingly, many of these proteins' sequences contain conserved rare codons that may slow down synthesis at this optimal window, suggesting that synthesis rates may be evolutionarily tuned to optimize folding. Using kinetic modeling, we show that under certain conditions, such a slowdown indeed improves cotranslational folding efficiency by giving these nascent chains more time to fold. In contrast, other proteins are predicted not to benefit from cotranslational folding due to a lack of significant nonnative interactions, and indeed these proteins' sequences lack conserved C-terminal rare codons. Together, these results shed light on the factors that promote proper protein folding in the cell and how biomolecular self-assembly may be optimized evolutionarily.
Collapse
|
8
|
Ferina J, Daggett V. Visualizing Protein Folding and Unfolding. J Mol Biol 2019; 431:1540-1564. [PMID: 30840846 DOI: 10.1016/j.jmb.2019.02.026] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Revised: 02/14/2019] [Accepted: 02/18/2019] [Indexed: 12/27/2022]
Abstract
Protein folding/unfolding is a complicated process that defies high-resolution characterization by experimental methods. As an alternative, atomistic molecular dynamics simulations are now routinely employed to elucidate and magnify the accompanying conformational changes and the role of solvent in the folding process. However, the level of detail necessary to map the process at high spatial-temporal resolution provides an overwhelming amount of data. As more and better tools are developed for analysis of these large data sets and validation of the simulations, one is still left with the problem of visualizing the results in ways that provide insight into the folding/unfolding process. While viewing and interrogating static crystal structures has become commonplace, more and different approaches are required for dynamic, interconverting, unfolding, and refolding proteins. Here we review a variety of approaches, ranging from straightforward to complex and unintuitive for multiscale analysis and visualization of protein folding and unfolding.
Collapse
Affiliation(s)
- Jennifer Ferina
- Department of Bioengineering, Box 355013, University of Washington, Seattle, WA 98195-5013, USA
| | - Valerie Daggett
- Department of Bioengineering, Box 355013, University of Washington, Seattle, WA 98195-5013, USA.
| |
Collapse
|
9
|
Levartovsky Y, Shemesh A, Asor R, Raviv U. Effect of Weakly Interacting Cosolutes on Lysozyme Conformations. ACS OMEGA 2018; 3:16246-16252. [PMID: 31458260 PMCID: PMC6643829 DOI: 10.1021/acsomega.8b01289] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Accepted: 11/15/2018] [Indexed: 06/10/2023]
Abstract
Exposure of a protein to cosolutes, like denaturants, changes its folding equilibrium. To determine the ensemble of protein conformations at equilibrium, in the presence of weakly interacting cosolutes, we present a two-stage analysis of solution X-ray scattering data. In the first stage, Guinier analysis and Kratky plot revealed information about the compactness and flexibility of the protein. In the second stage, elastic network contact model and coarse-grained normal mode analysis were used to generate an ensemble of conformations. The scattering curves of the conformations were computed and fitted to the measured scattering curves to get insights into the dominating folding states at equilibrium. Urea and guanidine hydrochloride (GuHCl) behaved as preferentially included weakly interacting cosolutes and induced denaturation of hen egg-white lysozyme, which served as our test case. The computed models adequately fit the data and gave ensembles of conformations that were consistent with our measurements. The analysis suggests that in the presence of urea, lysozyme retained its compactness and assumed molten globule characteristics, whereas in the presence of GuHCl lysozyme adopted random coiled conformations. Interestingly, no equilibrium intermediate states were observed in both urea and GuHCl.
Collapse
|
10
|
Wang Y, Tian P, Boomsma W, Lindorff-Larsen K. Monte Carlo Sampling of Protein Folding by Combining an All-Atom Physics-Based Model with a Native State Bias. J Phys Chem B 2018; 122:11174-11185. [DOI: 10.1021/acs.jpcb.8b06335] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Yong Wang
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Pengfei Tian
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Wouter Boomsma
- Department of Computer Science, University of Copenhagen, 2100 Copenhagen Ø, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
11
|
Aina A, Wallin S. Multisequence algorithm for coarse-grained biomolecular simulations: Exploring the sequence-structure relationship of proteins. J Chem Phys 2017; 147:095102. [DOI: 10.1063/1.4986933] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Affiliation(s)
- A. Aina
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John’s, Newfoundland and Labrador A1B 3X7, Canada
| | - S. Wallin
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John’s, Newfoundland and Labrador A1B 3X7, Canada
| |
Collapse
|
12
|
Serebryany E, Woodard JC, Adkar BV, Shabab M, King JA, Shakhnovich EI. An Internal Disulfide Locks a Misfolded Aggregation-prone Intermediate in Cataract-linked Mutants of Human γD-Crystallin. J Biol Chem 2016; 291:19172-83. [PMID: 27417136 DOI: 10.1074/jbc.m116.735977] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Indexed: 11/06/2022] Open
Abstract
Considerable mechanistic insight has been gained into amyloid aggregation; however, a large number of non-amyloid protein aggregates are considered "amorphous," and in most cases, little is known about their mechanisms. Amorphous aggregation of γ-crystallins in the eye lens causes cataract, a widespread disease of aging. We combined simulations and experiments to study the mechanism of aggregation of two γD-crystallin mutants, W42R and W42Q: the former a congenital cataract mutation, and the latter a mimic of age-related oxidative damage. We found that formation of an internal disulfide was necessary and sufficient for aggregation under physiological conditions. Two-chain all-atom simulations predicted that one non-native disulfide in particular, between Cys(32) and Cys(41), was likely to stabilize an unfolding intermediate prone to intermolecular interactions. Mass spectrometry and mutagenesis experiments confirmed the presence of this bond in the aggregates and its necessity for oxidative aggregation under physiological conditions in vitro Mining the simulation data linked formation of this disulfide to extrusion of the N-terminal β-hairpin and rearrangement of the native β-sheet topology. Specific binding between the extruded hairpin and a distal β-sheet, in an intermolecular chain reaction similar to domain swapping, is the most probable mechanism of aggregate propagation.
Collapse
Affiliation(s)
- Eugene Serebryany
- From the Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139 and
| | - Jaie C Woodard
- the Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138
| | - Bharat V Adkar
- the Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138
| | - Mohammed Shabab
- From the Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139 and
| | - Jonathan A King
- From the Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139 and
| | - Eugene I Shakhnovich
- the Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138
| |
Collapse
|
13
|
Raval A, Piana S, Eastwood MP, Shaw DE. Assessment of the utility of contact-based restraints in accelerating the prediction of protein structure using molecular dynamics simulations. Protein Sci 2015; 25:19-29. [PMID: 26266489 PMCID: PMC4815320 DOI: 10.1002/pro.2770] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2015] [Revised: 08/07/2015] [Accepted: 08/11/2015] [Indexed: 12/15/2022]
Abstract
Molecular dynamics (MD) simulation is a well-established tool for the computational study of protein structure and dynamics, but its application to the important problem of protein structure prediction remains challenging, in part because extremely long timescales can be required to reach the native structure. Here, we examine the extent to which the use of low-resolution information in the form of residue-residue contacts, which can often be inferred from bioinformatics or experimental studies, can accelerate the determination of protein structure in simulation. We incorporated sets of 62, 31, or 15 contact-based restraints in MD simulations of ubiquitin, a benchmark system known to fold to the native state on the millisecond timescale in unrestrained simulations. One-third of the restrained simulations folded to the native state within a few tens of microseconds-a speedup of over an order of magnitude compared with unrestrained simulations and a demonstration of the potential for limited amounts of structural information to accelerate structure determination. Almost all of the remaining ubiquitin simulations reached near-native conformations within a few tens of microseconds, but remained trapped there, apparently due to the restraints. We discuss potential methodological improvements that would facilitate escape from these near-native traps and allow more simulations to quickly reach the native state. Finally, using a target from the Critical Assessment of protein Structure Prediction (CASP) experiment, we show that distance restraints can improve simulation accuracy: In our simulations, restraints stabilized the native state of the protein, enabling a reasonable structural model to be inferred.
Collapse
Affiliation(s)
- Alpan Raval
- D. E. Shaw Research, New York, New York, 10036
| | | | | | - David E Shaw
- D. E. Shaw Research, New York, New York, 10036.,Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, 10032
| |
Collapse
|
14
|
Thermal stabilization of dihydrofolate reductase using monte carlo unfolding simulations and its functional consequences. PLoS Comput Biol 2015; 11:e1004207. [PMID: 25905910 PMCID: PMC4407897 DOI: 10.1371/journal.pcbi.1004207] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Accepted: 02/20/2015] [Indexed: 12/28/2022] Open
Abstract
Design of proteins with desired thermal properties is important for scientific and biotechnological applications. Here we developed a theoretical approach to predict the effect of mutations on protein stability from non-equilibrium unfolding simulations. We establish a relative measure based on apparent simulated melting temperatures that is independent of simulation length and, under certain assumptions, proportional to equilibrium stability, and we justify this theoretical development with extensive simulations and experimental data. Using our new method based on all-atom Monte-Carlo unfolding simulations, we carried out a saturating mutagenesis of Dihydrofolate Reductase (DHFR), a key target of antibiotics and chemotherapeutic drugs. The method predicted more than 500 stabilizing mutations, several of which were selected for detailed computational and experimental analysis. We find a highly significant correlation of r = 0.65–0.68 between predicted and experimentally determined melting temperatures and unfolding denaturant concentrations for WT DHFR and 42 mutants. The correlation between energy of the native state and experimental denaturation temperature was much weaker, indicating the important role of entropy in protein stability. The most stabilizing point mutation was D27F, which is located in the active site of the protein, rendering it inactive. However for the rest of mutations outside of the active site we observed a weak yet statistically significant positive correlation between thermal stability and catalytic activity indicating the lack of a stability-activity tradeoff for DHFR. By combining stabilizing mutations predicted by our method, we created a highly stable catalytically active E. coli DHFR mutant with measured denaturation temperature 7.2°C higher than WT. Prediction results for DHFR and several other proteins indicate that computational approaches based on unfolding simulations are useful as a general technique to discover stabilizing mutations. All-atom molecular simulations have provided valuable insight into the workings of molecular machines and the folding and unfolding of proteins. However, commonly employed molecular dynamics simulations suffer from a limitation in accessible time scale, making it difficult to model large-scale unfolding events in a realistic amount of simulation time without employing unrealistically high temperatures. Here, we describe a rapid all-atom Monte Carlo simulation approach to simulate unfolding of the essential bacterial enzyme Dihydrofolate Reductase (DHFR) and all possible single point-mutants. We use these simulations to predict which mutants will be more thermodynamically stable (i.e., reside more often in the native folded state vs. the unfolded state) than the wild-type protein, and we confirm our predictions experimentally, creating several highly stable and catalytically active mutants. Thermally stable active engineered proteins can be used as a starting point in directed evolution experiments to evolve new functions on the background of this additional “reservoir of stability.” The stabilized enzyme may be able to accumulate a greater number of destabilizing yet functionally important mutations before unfolding, protease digestion, and aggregation abolish its activity.
Collapse
|
15
|
Shao Q. Important roles of hydrophobic interactions in folding and charge interactions in misfolding of α-helix bundle protein. RSC Adv 2015. [DOI: 10.1039/c4ra14265a] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
An enhanced-sampling molecular dynamics simulation is presented to quantitatively demonstrate the important roles of hydrophobic and charge interactions in the folding and misfolding of α-helix bundle protein, respectively.
Collapse
Affiliation(s)
- Qiang Shao
- Drug Discovery and Design Center
- Shanghai Institute of Materia Medica
- Chinese Academy of Sciences
- Shanghai
- China
| |
Collapse
|
16
|
Berrondo M, Kaufmann S, Berrondo M. Automated Aufbau
of antibody structures from given sequences using Macromoltek's SmrtMolAntibody. Proteins 2014; 82:1636-45. [DOI: 10.1002/prot.24595] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Revised: 04/15/2014] [Accepted: 04/16/2014] [Indexed: 12/28/2022]
Affiliation(s)
| | | | - Manuel Berrondo
- Department of Physics and Astronomy; Brigham Young University; Provo Utah 84602
| |
Collapse
|
17
|
Duan LL, Zhu T, Zhang QG, Tang B, Zhang JZH. Electronic polarization stabilizes tertiary structure prediction of HP-36. J Mol Model 2014; 20:2195. [PMID: 24715046 PMCID: PMC3996369 DOI: 10.1007/s00894-014-2195-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 03/02/2014] [Indexed: 01/10/2023]
Abstract
Molecular dynamic (MD) simulations with both implicit and explicit solvent models have been carried out to study the folding dynamics of HP-36 protein. Starting from the extended conformation, the secondary structure of all three helices in HP-36 was formed in about 50 ns and remained stable in the remaining simulation. However, the formation of the tertiary structure was difficult. Although some intermediates were close to the native structure, the overall conformation was not stable. Further analysis revealed that the large structure fluctuation of loop and hydrophobic core regions was devoted mostly to the instability of the structure during MD simulation. The backbone root-mean-square deviation (RMSD) of the loop and hydrophobic core regions showed strong correlation with the backbone RMSD of the whole protein. The free energy landscape indicated that the distribution of main chain torsions in loop and turn regions was far away from the native state. Starting from an intermediate structure extracted from the initial AMBER simulation, HP-36 was found to generally fold to the native state under the dynamically adjusted polarized protein-specific charge (DPPC) simulation, while the peptide did not fold into the native structure when AMBER force filed was used. The two best folded structures were extracted and taken into further simulations in water employing AMBER03 charge and DPPC for 25 ns. Result showed that introducing polarization effect into interacting potential could stabilize the near-native protein structure.
Collapse
Affiliation(s)
- Li L Duan
- College of Physics and Electronics, Shandong Normal University, Jinan, 250014, China
| | | | | | | | | |
Collapse
|
18
|
Abstract
By focusing on essential features, while averaging over less important details, coarse-grained (CG) models provide significant computational and conceptual advantages with respect to more detailed models. Consequently, despite dramatic advances in computational methodologies and resources, CG models enjoy surging popularity and are becoming increasingly equal partners to atomically detailed models. This perspective surveys the rapidly developing landscape of CG models for biomolecular systems. In particular, this review seeks to provide a balanced, coherent, and unified presentation of several distinct approaches for developing CG models, including top-down, network-based, native-centric, knowledge-based, and bottom-up modeling strategies. The review summarizes their basic philosophies, theoretical foundations, typical applications, and recent developments. Additionally, the review identifies fundamental inter-relationships among the diverse approaches and discusses outstanding challenges in the field. When carefully applied and assessed, current CG models provide highly efficient means for investigating the biological consequences of basic physicochemical principles. Moreover, rigorous bottom-up approaches hold great promise for further improving the accuracy and scope of CG models for biomolecular systems.
Collapse
Affiliation(s)
- W G Noid
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
19
|
van der Linden MG, Ferreira DC, de Oliveira LC, Onuchic JN, Pereira de Araújo AF. Ab initio protein folding simulations using atomic burials as informational intermediates between sequence and structure. Proteins 2013; 82:1186-99. [DOI: 10.1002/prot.24483] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Revised: 11/08/2013] [Accepted: 11/19/2013] [Indexed: 11/06/2022]
Affiliation(s)
- Marx Gomes van der Linden
- Departamento de Biologia Celular, Laboratório de Biologia Teórica e Computacional; Universidade de Brasília; Brasília-DF 70910-900 Brazil
| | - Diogo César Ferreira
- Departamento de Biologia Celular, Laboratório de Biologia Teórica e Computacional; Universidade de Brasília; Brasília-DF 70910-900 Brazil
| | - Leandro Cristante de Oliveira
- Departamento de Biologia Celular, Laboratório de Biologia Teórica e Computacional; Universidade de Brasília; Brasília-DF 70910-900 Brazil
- Departamento de Física; Instituto de Biociências, Letras e Ciências Exatas; UNESP - Univ Estadual Paulista; São José do Rio Preto-SP 15054-000 Brazil
| | - José N. Onuchic
- Center for Theoretical Biological Physics; Rice University; Houston Texas 77005
| | - Antônio F. Pereira de Araújo
- Departamento de Biologia Celular, Laboratório de Biologia Teórica e Computacional; Universidade de Brasília; Brasília-DF 70910-900 Brazil
| |
Collapse
|
20
|
Vajda S, Hall DR, Kozakov D. Sampling and scoring: a marriage made in heaven. Proteins 2013; 81:1874-84. [PMID: 23775627 DOI: 10.1002/prot.24343] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 05/14/2013] [Accepted: 05/31/2013] [Indexed: 12/11/2022]
Abstract
Most structure prediction algorithms consist of initial sampling of the conformational space, followed by rescoring and possibly refinement of a number of selected structures. Here we focus on protein docking, and show that while decoupling sampling and scoring facilitates method development, integration of the two steps can lead to substantial improvements in docking results. Since decoupling is usually achieved by generating a decoy set containing both non-native and near-native docked structures, which can be then used for scoring function construction, we first review the roles and potential pitfalls of decoys in protein-protein docking, and show that some type of decoys are better than others for method development. We then describe three case studies showing that complete decoupling of scoring from sampling is not the best choice for solving realistic docking problems. Although some of the examples are based on our own experience, the results of the CAPRI docking and scoring experiments also show that performing both sampling and scoring generally yields better results than scoring the structures generated by all predictors. Next we investigate how the selection of training and decoy sets affects the performance of the scoring functions obtained. Finally, we discuss pathways to better alignment of the two steps, and show some algorithms that achieve a certain level of integration. Although we focus on protein-protein docking, our observations most likely also apply to other conformational search problems, including protein structure prediction and the docking of small molecules to proteins.
Collapse
Affiliation(s)
- Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215
| | | | | |
Collapse
|
21
|
Kar P, Gopal SM, Cheng YM, Predeus A, Feig M. PRIMO: A Transferable Coarse-grained Force Field for Proteins. J Chem Theory Comput 2013; 9:3769-3788. [PMID: 23997693 PMCID: PMC3755638 DOI: 10.1021/ct400230y] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We describe here the PRIMO (PRotein Intermediate Model) force field, a physics-based fully transferable additive coarse-grained potential energy function that is compatible with an all-atom force field for multi-scale simulations. The energy function consists of standard molecular dynamics energy terms plus a hydrogen-bonding potential term and is mainly parameterized based on the CHARMM22/CMAP force field in a bottom-up fashion. The solvent is treated implicitly via the generalized Born model. The bonded interactions are either harmonic or distance-based spline interpolated potentials. These potentials are defined on the basis of all-atom molecular dynamics (MD) simulations of dipeptides with the CHARMM22/CMAP force field. The non-bonded parameters are tuned by matching conformational free energies of diverse set of conformations with that of CHARMM all-atom results. PRIMO is designed to provide a correct description of conformational distribution of the backbone (ϕ/ψ) and side chains (χ1) for all amino acids with a CMAP correction term. The CMAP potential in PRIMO is optimized based on the new CHARMM C36 CMAP. The resulting optimized force field has been applied in MD simulations of several proteins of 36-155 amino acids and shown that the root-mean-squared-deviation of the average structure from the corresponding crystallographic structure varies between 1.80 and 4.03 Å. PRIMO is shown to fold several small peptides to their native-like structures from extended conformations. These results suggest the applicability of the PRIMO force field in the study of protein structures in aqueous solution, structure predictions as well as ab initio folding of small peptides.
Collapse
Affiliation(s)
- Parimal Kar
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Srinivasa Murthy Gopal
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Yi-Ming Cheng
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Alexander Predeus
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
- Department of Chemistry, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
22
|
Mohanty S, Meinke JH, Zimmermann O. Folding of Top7 in unbiased all-atom Monte Carlo simulations. Proteins 2013; 81:1446-56. [DOI: 10.1002/prot.24295] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2012] [Revised: 03/05/2013] [Accepted: 03/17/2013] [Indexed: 02/04/2023]
Affiliation(s)
- Sandipan Mohanty
- Jülich Supercomputing Centre; Institute for Advanced Simulation; Forschungszentrum Jülich; D-52425; Jülich; Germany
| | - Jan H. Meinke
- Jülich Supercomputing Centre; Institute for Advanced Simulation; Forschungszentrum Jülich; D-52425; Jülich; Germany
| | - Olav Zimmermann
- Jülich Supercomputing Centre; Institute for Advanced Simulation; Forschungszentrum Jülich; D-52425; Jülich; Germany
| |
Collapse
|
23
|
Miyazawa S. Prediction of contact residue pairs based on co-substitution between sites in protein structures. PLoS One 2013; 8:e54252. [PMID: 23342110 PMCID: PMC3546969 DOI: 10.1371/journal.pone.0054252] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Accepted: 12/10/2012] [Indexed: 11/18/2022] Open
Abstract
Residue-residue interactions that fold a protein into a unique three-dimensional structure and make it play a specific function impose structural and functional constraints in varying degrees on each residue site. Selective constraints on residue sites are recorded in amino acid orders in homologous sequences and also in the evolutionary trace of amino acid substitutions. A challenge is to extract direct dependences between residue sites by removing phylogenetic correlations and indirect dependences through other residues within a protein or even through other molecules. Rapid growth of protein families with unknown folds requires an accurate de novo prediction method for protein structure. Recent attempts of disentangling direct from indirect dependences of amino acid types between residue positions in multiple sequence alignments have revealed that inferred residue-residue proximities can be sufficient information to predict a protein fold without the use of known three-dimensional structures. Here, we propose an alternative method of inferring coevolving site pairs from concurrent and compensatory substitutions between sites in each branch of a phylogenetic tree. Substitution probability and physico-chemical changes (volume, charge, hydrogen-bonding capability, and others) accompanied by substitutions at each site in each branch of a phylogenetic tree are estimated with the likelihood of each substitution, and their direct correlations between sites are used to detect concurrent and compensatory substitutions. In order to extract direct dependences between sites, partial correlation coefficients of the characteristic changes along branches between sites, in which linear multiple dependences on feature vectors at other sites are removed, are calculated and used to rank coevolving site pairs. Accuracy of contact prediction based on the present coevolution score is comparable to that achieved by a maximum entropy model of protein sequences for 15 protein families taken from the Pfam release 26.0. Besides, this excellent accuracy indicates that compensatory substitutions are significant in protein evolution.
Collapse
Affiliation(s)
- Sanzo Miyazawa
- Graduate School of Engineering, Gunma University, Kiryu, Gunma, Japan.
| |
Collapse
|
24
|
De novo prediction of protein folding pathways and structure using the principle of sequential stabilization. Proc Natl Acad Sci U S A 2012; 109:17442-7. [PMID: 23045636 DOI: 10.1073/pnas.1209000109] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Motivated by the relationship between the folding mechanism and the native structure, we develop a unified approach for predicting folding pathways and tertiary structure using only the primary sequence as input. Simulations begin from a realistic unfolded state devoid of secondary structure and use a chain representation lacking explicit side chains, rendering the simulations many orders of magnitude faster than molecular dynamics simulations. The multiple round nature of the algorithm mimics the authentic folding process and tests the effectiveness of sequential stabilization (SS) as a search strategy wherein 2° structural elements add onto existing structures in a process of progressive learning and stabilization of structure found in prior rounds of folding. Because no a priori knowledge is used, we can identify kinetically significant non-native interactions and intermediates, sometimes generated by only two mutations, while the evolution of contact matrices is often consistent with experiments. Moreover, structure prediction improves substantially by incorporating information from prior rounds. The success of our simple, homology-free approach affirms the validity of our description of the primary determinants of folding pathways and structure, and the effectiveness of SS as a search strategy.
Collapse
|
25
|
Chebaro Y, Pasquali S, Derreumaux P. The Coarse-Grained OPEP Force Field for Non-Amyloid and Amyloid Proteins. J Phys Chem B 2012; 116:8741-52. [DOI: 10.1021/jp301665f] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Yassmine Chebaro
- Laboratoire de Biochimie Théorique,
CNRS UPR 9080, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-Chimique,
13 rue Pierre et Marie Curie, 75005 Paris
| | - Samuela Pasquali
- Laboratoire de Biochimie Théorique,
CNRS UPR 9080, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-Chimique,
13 rue Pierre et Marie Curie, 75005 Paris
| | - Philippe Derreumaux
- Laboratoire de Biochimie Théorique,
CNRS UPR 9080, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-Chimique,
13 rue Pierre et Marie Curie, 75005 Paris
- Institut Universitaire de France, 103 Bvd Saint-Michel, Paris 75005, France
| |
Collapse
|
26
|
Folding helical proteins in explicit solvent using dihedral-biased tempering. Proc Natl Acad Sci U S A 2012; 109:8139-44. [PMID: 22573819 DOI: 10.1073/pnas.1112143109] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Using a single-trajectory-based tempering method with a high-temperature dihedral bias, we repeatedly folded four helical proteins [α(3)D (PDB ID: 2A3D, 73 residues), α(3)W (1LQ7, 67 residues), Fap1-NR(α) (2KUB, 81 residues) and S-836 (2JUA, 102 residues)] and some of the mutants in explicit solvent within several microseconds. The lowest root-mean-square deviations of backbone atoms from the experimentally determined structures were 1.9, 1.4, 1.0, and 2.1 Å, respectively. Cluster analyses of folding trajectories showed the native conformation usually occupied the most populated cluster. The simulation protocol can be applied to large-scale simulations of other helical proteins on commonly accessible computing platforms.
Collapse
|
27
|
Marks DS, Colwell LJ, Sheridan R, Hopf TA, Pagnani A, Zecchina R, Sander C. Protein 3D structure computed from evolutionary sequence variation. PLoS One 2011; 6:e28766. [PMID: 22163331 PMCID: PMC3233603 DOI: 10.1371/journal.pone.0028766] [Citation(s) in RCA: 743] [Impact Index Per Article: 57.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2011] [Accepted: 11/14/2011] [Indexed: 11/19/2022] Open
Abstract
The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing. In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy. We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues., including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7–4.8 Å Cα-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org). This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of protein structures, new strategies in protein and drug design, and the identification of functional genetic variants in normal and disease genomes.
Collapse
Affiliation(s)
- Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America.
| | | | | | | | | | | | | |
Collapse
|
28
|
Kondov I, Verma A, Wenzel W. Performance assessment of different constraining potentials in computational structure prediction for disulfide-bridged proteins. Comput Biol Chem 2011; 35:230-9. [PMID: 21864792 DOI: 10.1016/j.compbiolchem.2011.04.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2011] [Revised: 04/18/2011] [Accepted: 04/20/2011] [Indexed: 11/17/2022]
Abstract
The presence of disulfide bonds in proteins has very important implications on the three-dimensional structure and folding of proteins. An adequate treatment of disulfide bonds in de-novo protein simulations is therefore very important. Here we present a computational study of a set of small disulfide-bridged proteins using an all-atom stochastic search approach and including various constraining potentials to describe the disulfide bonds. The proposed potentials can easily be implemented in any code based on all-atom force fields and employed in simulations to achieve an improved prediction of protein structure. Exploring different potential parameters and comparing the structures to those from unconstrained simulations and to experimental structures by means of a scoring function we demonstrate that the inclusion of constraining potentials improves the quality of final structures significantly. For some proteins (1KVG and 1PG1) the native conformation is visited only in simulations in presence of constraints. Overall, we found that the Morse potential has optimal performance, in particular for the β-sheet proteins.
Collapse
Affiliation(s)
- Ivan Kondov
- Steinbuch Centre for Computing, Karlsruhe Institute of Technology, Karlsruhe, Germany.
| | | | | |
Collapse
|
29
|
Tian L, Wu A, Cao Y, Dong X, Hu Y, Jiang T. NCACO-score: an effective main-chain dependent scoring function for structure modeling. BMC Bioinformatics 2011; 12:208. [PMID: 21612673 PMCID: PMC3123610 DOI: 10.1186/1471-2105-12-208] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2011] [Accepted: 05/26/2011] [Indexed: 11/10/2022] Open
Abstract
Background Development of effective scoring functions is a critical component to the success of protein structure modeling. Previously, many efforts have been dedicated to the development of scoring functions. Despite these efforts, development of an effective scoring function that can achieve both good accuracy and fast speed still presents a grand challenge. Results Based on a coarse-grained representation of a protein structure by using only four main-chain atoms: N, Cα, C and O, we develop a knowledge-based scoring function, called NCACO-score, that integrates different structural information to rapidly model protein structure from sequence. In testing on the Decoys'R'Us sets, we found that NCACO-score can effectively recognize native conformers from their decoys. Furthermore, we demonstrate that NCACO-score can effectively guide fragment assembly for protein structure prediction, which has achieved a good performance in building the structure models for hard targets from CASP8 in terms of both accuracy and speed. Conclusions Although NCACO-score is developed based on a coarse-grained model, it is able to discriminate native conformers from decoy conformers with high accuracy. NCACO is a very effective scoring function for structure modeling.
Collapse
Affiliation(s)
- Liqing Tian
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | | | | | | | | | | |
Collapse
|
30
|
Simmons KJ, Chopra I, Fishwick CWG. Structure-based discovery of antibacterial drugs. Nat Rev Microbiol 2011; 8:501-10. [PMID: 20551974 DOI: 10.1038/nrmicro2349] [Citation(s) in RCA: 108] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The modern era of antibacterial chemotherapy began in the 1930s, and the next four decades saw the discovery of almost all the major classes of antibacterial agents that are currently in use. However, bacterial resistance to many of these drugs is becoming an increasing problem. As such, the discovery of drugs with novel modes of action will be vital to meet the threats created by the emergence of resistance. Success in discovering inhibitors using high-throughput screening of chemical libraries is rare. In this Review we explore the exciting opportunities for antibacterial-drug discovery arising from structure-based drug design.
Collapse
Affiliation(s)
- Katie J Simmons
- Antimicrobial Research Centre, University of Leeds, Leeds, UK
| | | | | |
Collapse
|
31
|
Xu J, Huang L, Shakhnovich EI. The ensemble folding kinetics of the FBP28 WW domain revealed by an all-atom Monte Carlo simulation in a knowledge-based potential. Proteins 2011; 79:1704-14. [PMID: 21365688 DOI: 10.1002/prot.22993] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2010] [Revised: 12/10/2010] [Accepted: 01/04/2011] [Indexed: 11/10/2022]
Abstract
In this work, we apply a detailed all-atom model with a transferable knowledge-based potential to study the folding kinetics of Formin-Binding protein, FBP28, which is a canonical three-stranded β-sheet WW domain. Replica exchange Monte Carlo simulations starting from random coils find native-like (Cα RMSD of 2.68 Å) lowest energy structure. We also study the folding kinetics of FBP28 WW domain by performing a large number of ab initio Monte Carlo folding simulations. Using these trajectories, we examine the order of formation of two β-hairpins, the folding mechanism of each individual β-hairpin, and transition state ensemble (TSE) of FBP28 WW domain and compare our results with experimental data and previous computational studies. To obtain detailed structural information on the folding dynamics viewed as an ensemble process, we perform a clustering analysis procedure based on graph theory. Further, a rigorous P(fold) analysis is used to obtain representative samples of the TSEs showing good quantitative agreement between experimental and simulated Φ values. Our analysis shows that the turn structure between first and second β strands is a partially stable structural motif that gets formed before entering the TSE in FBP28 WW domain and there exist two major pathways for the folding of FBP28 WW domain, which differ in the order and mechanism of hairpin formation.
Collapse
Affiliation(s)
- Jiabin Xu
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, Massachusetts 02138, USA
| | | | | |
Collapse
|
32
|
Hu Y, Dong X, Wu A, Cao Y, Tian L, Jiang T. Incorporation of local structural preference potential improves fold recognition. PLoS One 2011; 6:e17215. [PMID: 21365008 PMCID: PMC3041821 DOI: 10.1371/journal.pone.0017215] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Accepted: 01/25/2011] [Indexed: 11/19/2022] Open
Abstract
Fold recognition, or threading, is a popular protein structure modeling approach that uses known structure templates to build structures for those of unknown. The key to the success of fold recognition methods lies in the proper integration of sequence, physiochemical and structural information. Here we introduce another type of information, local structural preference potentials of 3-residue and 9-residue fragments, for fold recognition. By combining the two local structural preference potentials with the widely used sequence profile, secondary structure information and hydrophobic score, we have developed a new threading method called FR-t5 (fold recognition by use of 5 terms). In benchmark testings, we have found the consideration of local structural preference potentials in FR-t5 not only greatly enhances the alignment accuracy and recognition sensitivity, but also significantly improves the quality of prediction models.
Collapse
Affiliation(s)
- Yun Hu
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Graduate University of Chinese Academy of Sciences, Beijing, China
| | - Xiaoxi Dong
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Graduate University of Chinese Academy of Sciences, Beijing, China
| | - Aiping Wu
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Yang Cao
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Graduate University of Chinese Academy of Sciences, Beijing, China
| | - Liqing Tian
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- Graduate University of Chinese Academy of Sciences, Beijing, China
| | - Taijiao Jiang
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
- * E-mail:
| |
Collapse
|
33
|
Mitternacht S, Berezovsky IN. On the Importance of Amino Acid Sequence and Spatial Proximity of Interacting Residues for Protein Folding. J Biomol Struct Dyn 2011; 28:607-9; discussion 669-674. [DOI: 10.1080/073911011010524961] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
34
|
Abstract
Motivation: One of the major bottlenecks with ab initio protein folding is an effective conformation sampling algorithm that can generate native-like conformations quickly. The popular fragment assembly method generates conformations by restricting the local conformations of a protein to short structural fragments in the PDB. This method may limit conformations to a subspace to which the native fold does not belong because (i) a protein with really new fold may contain some structural fragments not in the PDB and (ii) the discrete nature of fragments may prevent them from building a native-like fold. Previously we have developed a conditional random fields (CRF) method for fragment-free protein folding that can sample conformations in a continuous space and demonstrated that this CRF method compares favorably to the popular fragment assembly method. However, the CRF method is still limited by its capability of generating conformations compatible with a sequence. Results: We present a new fragment-free approach to protein folding using a recently invented probabilistic graphical model conditional neural fields (CNF). This new CNF method is much more powerful than CRF in modeling the sophisticated protein sequence-structure relationship and thus, enables us to generate native-like conformations more easily. We show that when coupled with a simple energy function and replica exchange Monte Carlo simulation, our CNF method can generate decoys much better than CRF on a variety of test proteins including the CASP8 free-modeling targets. In particular, our CNF method can predict a correct fold for T0496_D1, one of the two CASP8 targets with truly new fold. Our predicted model for T0496 is significantly better than all the CASP8 models. Contact:jinboxu@gmail.com
Collapse
Affiliation(s)
- Feng Zhao
- Toyota Technological Institute, Chicago, IL 60637, USA
| | | | | |
Collapse
|
35
|
Klenin K, Strodel B, Wales DJ, Wenzel W. Modelling proteins: conformational sampling and reconstruction of folding kinetics. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2010; 1814:977-1000. [PMID: 20851219 DOI: 10.1016/j.bbapap.2010.09.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2010] [Revised: 09/03/2010] [Accepted: 09/05/2010] [Indexed: 01/08/2023]
Abstract
In the last decades biomolecular simulation has made tremendous inroads to help elucidate biomolecular processes in-silico. Despite enormous advances in molecular dynamics techniques and the available computational power, many problems involve long time scales and large-scale molecular rearrangements that are still difficult to sample adequately. In this review we therefore summarise recent efforts to fundamentally improve this situation by decoupling the sampling of the energy landscape from the description of the kinetics of the process. Recent years have seen the emergence of many advanced sampling techniques, which permit efficient characterisation of the relevant family of molecular conformations by dispensing with the details of the short-term kinetics of the process. Because these methods generate thermodynamic information at best, they must be complemented by techniques to reconstruct the kinetics of the process using the ensemble of relevant conformations. Here we review recent advances for both types of methods and discuss their perspectives to permit efficient and accurate modelling of large-scale conformational changes in biomolecules. This article is part of a Special Issue entitled: Protein Dynamics: Experimental and Computational Approaches.
Collapse
Affiliation(s)
- Konstantin Klenin
- Steinbuch Centre for Computing, Karlsruhe Institute of Technology, P.O. Box 3640, D-76021 Karlsruhe, Germany
| | | | | | | |
Collapse
|
36
|
DeBartolo J, Hocky G, Wilde M, Xu J, Freed KF, Sosnick TR. Protein structure prediction enhanced with evolutionary diversity: SPEED. Protein Sci 2010; 19:520-34. [PMID: 20066664 DOI: 10.1002/pro.330] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
For naturally occurring proteins, similar sequence implies similar structure. Consequently, multiple sequence alignments (MSAs) often are used in template-based modeling of protein structure and have been incorporated into fragment-based assembly methods. Our previous homology-free structure prediction study introduced an algorithm that mimics the folding pathway by coupling the formation of secondary and tertiary structure. Moves in the Monte Carlo procedure involve only a change in a single pair of phi,psi backbone dihedral angles that are obtained from a Protein Data Bank-based distribution appropriate for each amino acid, conditional on the type and conformation of the flanking residues. We improve this method by using MSAs to enrich the sampling distribution, but in a manner that does not require structural knowledge of any protein sequence (i.e., not homologous fragment insertion). In combination with other tools, including clustering and refinement, the accuracies of the predicted secondary and tertiary structures are substantially improved and a global and position-resolved measure of confidence is introduced for the accuracy of the predictions. Performance of the method in the Critical Assessment of Structure Prediction (CASP8) is discussed.
Collapse
Affiliation(s)
- Joe DeBartolo
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | | | | | |
Collapse
|
37
|
Kuzmanic A, Zagrovic B. Determination of ensemble-average pairwise root mean-square deviation from experimental B-factors. Biophys J 2010; 98:861-71. [PMID: 20197040 DOI: 10.1016/j.bpj.2009.11.011] [Citation(s) in RCA: 264] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2009] [Revised: 10/30/2009] [Accepted: 11/03/2009] [Indexed: 11/19/2022] Open
Abstract
Root mean-square deviation (RMSD) after roto-translational least-squares fitting is a measure of global structural similarity of macromolecules used commonly. On the other hand, experimental x-ray B-factors are used frequently to study local structural heterogeneity and dynamics in macromolecules by providing direct information about root mean-square fluctuations (RMSF) that can also be calculated from molecular dynamics simulations. We provide a mathematical derivation showing that, given a set of conservative assumptions, a root mean-square ensemble-average of an all-against-all distribution of pairwise RMSD for a single molecular species, <RMSD(2)>(1/2), is directly related to average B-factors (<B>) and <RMSF(2)>(1/2). We show this relationship and explore its limits of validity on a heterogeneous ensemble of structures taken from molecular dynamics simulations of villin headpiece generated using distributed-computing techniques and the Folding@Home cluster. Our results provide a basis for quantifying global structural diversity of macromolecules in crystals directly from x-ray experiments, and we show this on a large set of structures taken from the Protein Data Bank. In particular, we show that the ensemble-average pairwise backbone RMSD for a microscopic ensemble underlying a typical protein x-ray structure is approximately 1.1 A, under the assumption that the principal contribution to experimental B-factors is conformational variability.
Collapse
|
38
|
Maupetit J, Derreumaux P, Tufféry P. A fast method for large-scale de novo peptide and miniprotein structure prediction. J Comput Chem 2010; 31:726-38. [PMID: 19569182 DOI: 10.1002/jcc.21365] [Citation(s) in RCA: 93] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Although peptides have many biological and biomedical implications, an accurate method predicting their equilibrium structural ensembles from amino acid sequences and suitable for large-scale experiments is still missing. We introduce a new approach-PEP-FOLD-to the de novo prediction of peptides and miniproteins. It first predicts, in the terms of a Hidden Markov Model-derived structural alphabet, a limited number of local conformations at each position of the structure. It then performs their assembly using a greedy procedure driven by a coarse-grained energy score. On a benchmark of 52 peptides with 9-23 amino acids, PEP-FOLD generates lowest-energy conformations within 2.8 and 2.3 A Calpha root-mean-square deviation from the full nuclear magnetic resonance structures (NMR) and the NMR rigid cores, respectively, outperforming previous approaches. For 13 miniproteins with 27-49 amino acids, PEP-FOLD reaches an accuracy of 3.6 and 4.6 A Calpha root-mean-square deviation for the most-native and lowest-energy conformations, using the nonflexible regions identified by NMR. PEP-FOLD simulations are fast-a few minutes only-opening therefore, the door to in silico large-scale rational design of new bioactive peptides and miniproteins.
Collapse
Affiliation(s)
- Julien Maupetit
- MTi, INSERM UMR-S973 and RPBS, Université Paris Diderot - Paris 7, 5 rue Marie-Andrée Lagroua Weill-Halle, 75205 Paris, Cedex 13, France
| | | | | |
Collapse
|
39
|
Maisuradze GG, Liwo A, Scheraga HA. Relation between free energy landscapes of proteins and dynamics. J Chem Theory Comput 2010; 6:583-595. [PMID: 23620713 DOI: 10.1021/ct9005745] [Citation(s) in RCA: 113] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
By examining the molecular dynamics (MD) of protein folding trajectories, generated with the coarse-grained UNRES force field, for the B-domain of staphylococcal protein A and the triple β-strand WW domain from the Formin binding protein 28 (FBP), by principal component analysis (PCA), it is demonstrated how different free energy landscapes (FELs) and folding pathways of trajectories can be, even though they appear to be very similar by visual inspection of the time-dependence of the root-mean-square deviation (rmsd). Approaches to determine the minimal dimensionality of FELs for a correct description of protein folding dynamics are discussed. The correlation between the amplitude of the fluctuations of proteins and the dimensionality of the FELs is shown. The advantage of internal coordinate PCA over Cartesian PCA for small proteins is also illustrated.
Collapse
Affiliation(s)
- Gia G Maisuradze
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University Ithaca, New York 14853-1301
| | | | | |
Collapse
|
40
|
Deb D, Vishveshwara S, Vishveshwara S. Understanding protein structure from a percolation perspective. Biophys J 2009; 97:1787-94. [PMID: 19751685 DOI: 10.1016/j.bpj.2009.07.016] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2009] [Revised: 07/02/2009] [Accepted: 07/15/2009] [Indexed: 01/03/2023] Open
Abstract
Underlying the unique structures and diverse functions of proteins are a vast range of amino-acid sequences and a highly limited number of folds taken up by the polypeptide backbone. By investigating the role of noncovalent connections at the backbone level and at the detailed side-chain level, we show that these unique structures emerge from interplay between random and selected features. Primarily, the protein structure network formed by these connections shows simple (bond) and higher order (clique) percolation behavior distinctly reminiscent of random network models. However, the clique percolation specific to the side-chain interaction network bears signatures unique to proteins characterized by a larger degree of connectivity than in random networks. These studies reflect some salient features of the manner in which amino acid sequences select the unique structure of proteins from the pool of a limited number of available folds.
Collapse
Affiliation(s)
- Dhruba Deb
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | | | | |
Collapse
|
41
|
A sequence-compatible amount of native burial information is sufficient for determining the structure of small globular proteins. Proc Natl Acad Sci U S A 2009; 106:19001-4. [PMID: 19858496 DOI: 10.1073/pnas.0910851106] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Protein tertiary structures are known to be encoded in amino acid sequences, but the problem of structure prediction from sequence continues to be a challenge. With this question in mind, recent simulations have shown that atomic burials, as expressed by atom distances to the molecular geometrical center, are sufficiently informative for determining native conformations of small globular proteins. Here we use a simple computational experiment to estimate the amount of this required burial information and find it to be surprisingly small, actually comparable with the stringent limit imposed by sequence statistics. Atomic burials appear to satisfy, therefore, minimal requirements for a putative dominating property in the folding code because they provide an amount of information sufficiently large for structural determination but, at the same time, sufficiently small to be encodable in sequences. In a simple analogy with human communication, atomic burials could correspond to the actual "language" encoded in the amino acid "script" from which the complexity of native conformations is recovered during the folding process.
Collapse
|
42
|
He Y, Xiao Y, Liwo A, Scheraga HA. Exploring the parameter space of the coarse-grained UNRES force field by random search: selecting a transferable medium-resolution force field. J Comput Chem 2009; 30:2127-35. [PMID: 19242966 DOI: 10.1002/jcc.21215] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
We explored the energy-parameter space of our coarse-grained UNRES force field for large-scale ab initio simulations of protein folding, to obtain good initial approximations for hierarchical optimization of the force field with new virtual-bond-angle bending and side-chain-rotamer potentials which we recently introduced to replace the statistical potentials. 100 sets of energy-term weights were generated randomly, and good sets were selected by carrying out replica-exchange molecular dynamics simulations of two peptides with a minimal alpha-helical and a minimal beta-hairpin fold, respectively: the tryptophan cage (PDB code: 1L2Y) and tryptophan zipper (PDB code: 1LE1). Eight sets of parameters produced native-like structures of these two peptides. These eight sets were tested on two larger proteins: the engrailed homeodomain (PDB code: 1ENH) and FBP WW domain (PDB code: 1E0L); two sets were found to produce native-like conformations of these proteins. These two sets were tested further on a larger set of nine proteins with alpha or alpha + beta structure and found to locate native-like structures of most of them. These results demonstrate that, in addition to finding reasonable initial starting points for optimization, an extensive search of parameter space is a powerful method to produce a transferable force field.
Collapse
Affiliation(s)
- Yi He
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan 430074, China
| | | | | | | |
Collapse
|
43
|
Kondov I, Verma A, Wenzel W. Folding Path and Funnel Scenarios for Two Small Disulfide-Bridged Proteins. Biochemistry 2009; 48:8195-205. [PMID: 19610617 DOI: 10.1021/bi900702m] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- Ivan Kondov
- Steinbuch Centre for Computing, Forschungszentrum Karlsruhe, P.O. Box 3640, 76021 Karlsruhe, Germany
| | - Abhinav Verma
- Steinbuch Centre for Computing, Forschungszentrum Karlsruhe, P.O. Box 3640, 76021 Karlsruhe, Germany
| | - Wolfgang Wenzel
- Institute of Nanotechnology, Forschungszentrum Karlsruhe, P.O. Box 3640, 76021 Karlsruhe, Germany, and DFG Center for Functional Nanostructures, Department of Physics, Universität Karlsruhe, Wolfgang Gaede Strasse 1, 76131 Karlsruhe, Germany
| |
Collapse
|
44
|
Levy-Moonshine A, Amir EAD, Keasar C. Enhancement of beta-sheet assembly by cooperative hydrogen bonds potential. ACTA ACUST UNITED AC 2009; 25:2639-45. [PMID: 19628506 DOI: 10.1093/bioinformatics/btp449] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION The roughness of energy landscapes is a major obstacle to protein structure prediction, since it forces conformational searches to spend much time struggling to escape numerous traps. Specifically, beta-sheet formation is prone to stray, since many possible combinations of hydrogen bonds are dead ends in terms of beta-sheet assembly. It has been shown that cooperative terms for backbone hydrogen bonds ease this problem by augmenting hydrogen bond patterns that are consistent with beta sheets. Here, we present a novel cooperative hydrogen-bond term that is both effective in promoting beta sheets and computationally efficient. In addition, the new term is differentiable and operates on all-atom protein models. RESULTS Energy optimization of poly-alanine chains under the new term led to significantly more beta-sheet content than optimization under a non-cooperative term. Furthermore, the optimized structure included very few non-native patterns. AVAILABILITY The new term is implemented within the MESHI package and is freely available at http://cs.bgu.ac.il/ approximately meshi.
Collapse
Affiliation(s)
- Ami Levy-Moonshine
- Department of Computer Science and Department of Life Sciences, Ben-Gurion University of the Negev, 84105 Beer-Sheva, Israel
| | | | | |
Collapse
|
45
|
Kang TS, Kini RM. Structural determinants of protein folding. Cell Mol Life Sci 2009; 66:2341-61. [PMID: 19367367 PMCID: PMC11115868 DOI: 10.1007/s00018-009-0023-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2009] [Revised: 03/07/2009] [Accepted: 03/20/2009] [Indexed: 12/11/2022]
Abstract
The last several decades have seen an explosion of knowledge in the field of structural biology. With critical advances in spectroscopic techniques in examining structures of biomacromolecules, in maturation of molecular biology techniques, as well as vast improvements in computation prowess, protein structures are now being elucidated at an unprecedented rate. In spite of all the recent advances, the protein folding puzzle remains as one of the fundamental biochemical challenges. A facet to this empiric problem is the structural determinants of protein folding. What are the driving forces that pivot a polypeptide chain to a specific conformation amongst the vast conformation space? In this review, we shall discuss some of the structural determinants to protein folding that have been identified in the recent decades.
Collapse
Affiliation(s)
- Tse Siang Kang
- The Scripps Research Institute, 10550 North Torrey Pines Road GAC 1200, La Jolla, CA 92037 USA
- Department of Pharmacy, National University of Singapore, 18 Science Drive 4, Block S4, Singapore, 117543 Singapore
| | - R. Manjunatha Kini
- Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Block S3 #03-17, Singapore, 117543 Singapore
| |
Collapse
|
46
|
Kutchukian PS, Yang JS, Verdine GL, Shakhnovich EI. All-atom model for stabilization of alpha-helical structure in peptides by hydrocarbon staples. J Am Chem Soc 2009; 131:4622-7. [PMID: 19334772 DOI: 10.1021/ja805037p] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Recent work has shown that the incorporation of an all-hydrocarbon "staple" into peptides can greatly increase their alpha-helix propensity, leading to an improvement in pharmaceutical properties such as proteolytic stability, receptor affinity, and cell permeability. Stapled peptides thus show promise as a new class of drugs capable of accessing intractable targets such as those that engage in intracellular protein-protein interactions. The extent of alpha-helix stabilization provided by stapling has proven to be substantially context dependent, requiring cumbersome screening to identify the optimal site for staple incorporation. In certain cases, a staple encompassing one turn of the helix (attached at residues i and i+4) furnishes greater helix stabilization than one encompassing two turns (i,i+7 staple), which runs counter to expectation based on polymer theory. These findings highlight the need for a more thorough understanding of the forces that underlie helix stabilization by hydrocarbon staples. Here we report all-atom Monte Carlo folding simulations comparing unmodified peptides derived from RNase A and BID BH3 with various i,i+4 and i,i+7 stapled versions thereof. The results of these simulations were found to be in quantitative agreement with experimentally determined helix propensities. We also discovered that staples can stabilize quasi-stable decoy conformations, and that the removal of these states plays a major role in determining the helix stability of stapled peptides. Finally, we critically investigate why our method works, exposing the underlying physical forces that stabilize stapled peptides.
Collapse
Affiliation(s)
- Peter S Kutchukian
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, Massachusetts 02138, USA
| | | | | | | |
Collapse
|
47
|
Zhao LL, Wu A, Bi LJ, Liu P, Zhang XE, Jiang T, Jin G, Qi Z. Length-dependent regulation of the Kv1.2 channel activation by its C-terminus. Mol Membr Biol 2009; 26:186-93. [PMID: 19247844 DOI: 10.1080/09687680802714741] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
The cytoplasmic C-terminus plays regulatory roles in the gating of many ion channels. However, lack of structural information on the C-terminus prevents the elucidation of how the C-terminal domain interacts with the gating machinery to exert its effects on the channel gating. In this report, we investigated the regulatory role of the C-terminus with functional study and structural modeling of a succession of C-terminal truncations of the Kv1.2 and Kv1.2(427)-KcsA(112-160) chimeric channels. Functional study demonstrated a length-dependent shift of the activation curves for the C-terminal truncations of the Kv1.2 channel. Structural modeling indicated that the C-terminus of one subunit could dynamically interact with the S4-S5 linker of a neighboring subunit and the probability of interaction was dependent on the length of the C-terminal truncated Kv1.2 channels. In contrast, no length-dependent shift of the activation curve and probability of interaction between C-terminus and the neighboring S4-S5 linker were observed for the truncations of the Kv1.2-KcsA chimeric channel, suggesting that the native C-terminus of the Kv1.2 channel is essential for the interaction. Furthermore, surface plasmon resonance measurements indicated that there is direct interaction between the C-terminal domain and the S4-S5 linker of the Kv1.2 channel. These results imply that the dynamic interaction of the C-terminus with the S4-S5 linker from a neighboring subunit of the Kv1.2 channel provides a mechanism for its C-terminus to regulate the channel activation.
Collapse
Affiliation(s)
- Li-Li Zhao
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, PR China
| | | | | | | | | | | | | | | |
Collapse
|
48
|
Mimicking the folding pathway to improve homology-free protein structure prediction. Proc Natl Acad Sci U S A 2009; 106:3734-9. [PMID: 19237560 DOI: 10.1073/pnas.0811363106] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Since the demonstration that the sequence of a protein encodes its structure, the prediction of structure from sequence remains an outstanding problem that impacts numerous scientific disciplines, including many genome projects. By iteratively fixing secondary structure assignments of residues during Monte Carlo simulations of folding, our coarse-grained model without information concerning homology or explicit side chains can outperform current homology-based secondary structure prediction methods for many proteins. The computationally rapid algorithm using only single (phi,psi) dihedral angle moves also generates tertiary structures of accuracy comparable with existing all-atom methods for many small proteins, particularly those with low homology. Hence, given appropriate search strategies and scoring functions, reduced representations can be used for accurately predicting secondary structure and providing 3D structures, thereby increasing the size of proteins approachable by homology-free methods and the accuracy of template methods that depend on a high-quality input secondary structure.
Collapse
|
49
|
Bowman GR, Pande VS. Simulated tempering yields insight into the low-resolution Rosetta scoring functions. Proteins 2009; 74:777-88. [PMID: 18767152 DOI: 10.1002/prot.22210] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Rosetta is a structure prediction package that has been employed successfully in numerous protein design and other applications.1 Previous reports have attributed the current limitations of the Rosetta de novo structure prediction algorithm to inadequate sampling, particularly during the low-resolution phase.2-5 Here, we implement the Simulated Tempering (ST) sampling algorithm67 in Rosetta to address this issue. ST is intended to yield canonical sampling by inducing a random walk in temperatures space such that broad sampling is achieved at high temperatures and detailed exploration of local free energy minima is achieved at low temperatures. ST should therefore visit basins in accordance with their free energies rather than their energies and achieve more global sampling than the localized scheme currently implemented in Rosetta. However, we find that ST does not improve structure prediction with Rosetta. To understand why, we carried out a detailed analysis of the low-resolution scoring functions and find that they do not provide a strong bias towards the native state. In addition, we find that both ST and standard Rosetta runs started from the native state are biased away from the native state. Although the low-resolution scoring functions could be improved, we propose that working entirely at full-atom resolution is now possible and may be a better option due to superior native-state discrimination at full-atom resolution. Such an approach will require more attention to the kinetics of convergence, however, as functions capable of native state discrimination are not necessarily capable of rapidly guiding non-native conformations to the native state.
Collapse
Affiliation(s)
- Gregory R Bowman
- Biophysics Program, Stanford University, Stanford, California 94305, USA
| | | |
Collapse
|
50
|
Protein Structure Prediction Using an Associated Memory Hamiltonian and All-Atom Molecular Dynamics Simulations. B KOREAN CHEM SOC 2008. [DOI: 10.5012/bkcs.2008.29.11.2172] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|