1
|
Capelli R, Paissoni C, Sormanni P, Tiana G. Iterative derivation of effective potentials to sample the conformational space of proteins at atomistic scale. J Chem Phys 2014; 140:195101. [PMID: 24852563 DOI: 10.1063/1.4876219] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The current capacity of computers makes it possible to perform simulations of small systems with portable, explicit-solvent potentials achieving high degree of accuracy. However, simplified models must be employed to exploit the behavior of large systems or to perform systematic scans of smaller systems. While powerful algorithms are available to facilitate the sampling of the conformational space, successful applications of such models are hindered by the availability of simple enough potentials able to satisfactorily reproduce known properties of the system. We develop an interatomic potential to account for a number of properties of proteins in a computationally economic way. The potential is defined within an all-atom, implicit solvent model by contact functions between the different atom types. The associated numerical values can be optimized by an iterative Monte Carlo scheme on any available experimental data, provided that they are expressible as thermal averages of some conformational properties. We test this model on three different proteins, for which we also perform a scan of all possible point mutations with explicit conformational sampling. The resulting models, optimized solely on a subset of native distances, not only reproduce the native conformations within a few Angstroms from the experimental ones, but show the cooperative transition between native and denatured state and correctly predict the measured free-energy changes associated with point mutations. Moreover, differently from other structure-based models, our method leaves a residual degree of frustration, which is known to be present in protein molecules.
Collapse
Affiliation(s)
- Riccardo Capelli
- Department of Physics, Università degli Studi di Milano, via Celoria 16, 20133 Milano, Italy
| | - Cristina Paissoni
- Department of Chemistry, Università degli Studi di Milano, via Venezian 21, 20133 Milano, Italy
| | - Pietro Sormanni
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Guido Tiana
- Department of Physics, Università degli Studi di Milano and INFN, via Celoria 16, 20133 Milano, Italy
| |
Collapse
|
2
|
Abstract
By focusing on essential features, while averaging over less important details, coarse-grained (CG) models provide significant computational and conceptual advantages with respect to more detailed models. Consequently, despite dramatic advances in computational methodologies and resources, CG models enjoy surging popularity and are becoming increasingly equal partners to atomically detailed models. This perspective surveys the rapidly developing landscape of CG models for biomolecular systems. In particular, this review seeks to provide a balanced, coherent, and unified presentation of several distinct approaches for developing CG models, including top-down, network-based, native-centric, knowledge-based, and bottom-up modeling strategies. The review summarizes their basic philosophies, theoretical foundations, typical applications, and recent developments. Additionally, the review identifies fundamental inter-relationships among the diverse approaches and discusses outstanding challenges in the field. When carefully applied and assessed, current CG models provide highly efficient means for investigating the biological consequences of basic physicochemical principles. Moreover, rigorous bottom-up approaches hold great promise for further improving the accuracy and scope of CG models for biomolecular systems.
Collapse
Affiliation(s)
- W G Noid
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
3
|
Huang SY, Zou X. ITScorePro: an efficient scoring program for evaluating the energy scores of protein structures for structure prediction. Methods Mol Biol 2014; 1137:71-81. [PMID: 24573475 PMCID: PMC11121506 DOI: 10.1007/978-1-4939-0366-5_6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
One important component in protein structure prediction is to evaluate the free energy of a given conformation. Given the enormous number of possible conformations for a sequence, it is extremely challenging to quickly and accurately score the energies of these conformations and predict a reasonable structure within a practical computational time. Here, we describe an efficient program for energy evaluation, referred to as ITScorePro (Copyright © 2012). The energy scoring function in the ITScorePro program is based on the distance-dependent, pairwise atomic potentials for protein structure prediction that we recently derived by using statistical mechanics principles (Huang and Zou, Proteins 79:2648-2661, 2011). ITScorePro is a stand-alone program and can also be easily implemented in other software suites for protein structure prediction.
Collapse
Affiliation(s)
- Sheng-You Huang
- Department of Physics and Astronomy, Dalton Cardiovascular Research Center, Informatics Institute, University of Missouri, Columbia, MO, USA
| | | |
Collapse
|
4
|
Várnai C, Burkoff NS, Wild DL. Efficient Parameter Estimation of Generalizable Coarse-Grained Protein Force Fields Using Contrastive Divergence: A Maximum Likelihood Approach. J Chem Theory Comput 2013; 9:5718-5733. [PMID: 24683370 PMCID: PMC3966533 DOI: 10.1021/ct400628h] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2013] [Indexed: 01/05/2023]
Abstract
Maximum Likelihood (ML) optimization schemes are widely used for parameter inference. They maximize the likelihood of some experimentally observed data, with respect to the model parameters iteratively, following the gradient of the logarithm of the likelihood. Here, we employ a ML inference scheme to infer a generalizable, physics-based coarse-grained protein model (which includes Go̅-like biasing terms to stabilize secondary structure elements in room-temperature simulations), using native conformations of a training set of proteins as the observed data. Contrastive divergence, a novel statistical machine learning technique, is used to efficiently approximate the direction of the gradient ascent, which enables the use of a large training set of proteins. Unlike previous work, the generalizability of the protein model allows the folding of peptides and a protein (protein G) which are not part of the training set. We compare the same force field with different van der Waals (vdW) potential forms: a hard cutoff model, and a Lennard-Jones (LJ) potential with vdW parameters inferred or adopted from the CHARMM or AMBER force fields. Simulations of peptides and protein G show that the LJ model with inferred parameters outperforms the hard cutoff potential, which is consistent with previous observations. Simulations using the LJ potential with inferred vdW parameters also outperforms the protein models with adopted vdW parameter values, demonstrating that model parameters generally cannot be used with force fields with different energy functions. The software is available at https://sites.google.com/site/crankite/.
Collapse
Affiliation(s)
- Csilla Várnai
- Systems Biology Centre, University of Warwick, Coventry, United Kingdom
| | | | - David L. Wild
- Systems Biology Centre, University of Warwick, Coventry, United Kingdom
| |
Collapse
|
5
|
Huang SY, Zou X. Statistical mechanics-based method to extract atomic distance-dependent potentials from protein structures. Proteins 2011; 79:2648-61. [PMID: 21732421 PMCID: PMC11108592 DOI: 10.1002/prot.23086] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2011] [Revised: 04/21/2011] [Accepted: 05/09/2011] [Indexed: 12/25/2022]
Abstract
In this study, we have developed a statistical mechanics-based iterative method to extract statistical atomic interaction potentials from known, nonredundant protein structures. Our method circumvents the long-standing reference state problem in deriving traditional knowledge-based scoring functions, by using rapid iterations through a physical, global convergence function. The rapid convergence of this physics-based method, unlike other parameter optimization methods, warrants the feasibility of deriving distance-dependent, all-atom statistical potentials to keep the scoring accuracy. The derived potentials, referred to as ITScore/Pro, have been validated using three diverse benchmarks: the high-resolution decoy set, the AMBER benchmark decoy set, and the CASP8 decoy set. Significant improvement in performance has been achieved. Finally, comparisons between the potentials of our model and potentials of a knowledge-based scoring function with a randomized reference state have revealed the reason for the better performance of our scoring function, which could provide useful insight into the development of other physical scoring functions. The potentials developed in this study are generally applicable for structural selection in protein structure prediction.
Collapse
Affiliation(s)
- Sheng-You Huang
- Department of Physics and Astronomy, Department of Biochemistry, Dalton Cardiovascular Research Center, and Informatics Institute, University of Missouri, Columbia, MO 65211
| | - Xiaoqin Zou
- Department of Physics and Astronomy, Department of Biochemistry, Dalton Cardiovascular Research Center, and Informatics Institute, University of Missouri, Columbia, MO 65211
| |
Collapse
|
6
|
Bhattacherjee A, Biswas P. Designing Misfolded Proteins by Energy Landscaping. J Phys Chem B 2010; 115:113-9. [DOI: 10.1021/jp108416c] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
| | - Parbati Biswas
- Department of Chemistry, University of Delhi, Delhi-110007
| |
Collapse
|
7
|
Abstract
Knowledge-based approaches frequently employ empirical relations to determine effective potentials for coarse-grained protein models directly from protein databank structures. Although these approaches have enjoyed considerable success and widespread popularity in computational protein science, their fundamental basis has been widely questioned. It is well established that conventional knowledge-based approaches do not correctly treat many-body correlations between amino acids. Moreover, the physical significance of potentials determined by using structural statistics from different proteins has remained obscure. In the present work, we address both of these concerns by introducing and demonstrating a theory for calculating transferable potentials directly from a databank of protein structures. This approach assumes that the databank structures correspond to representative configurations sampled from equilibrium solution ensembles for different proteins. Given this assumption, this physics-based theory exactly treats many-body structural correlations and directly determines the transferable potentials that provide a variationally optimized approximation to the free energy landscape for each protein. We illustrate this approach by first constructing a databank of protein structures using a model potential and then quantitatively recovering this potential from the structure databank. The proposed framework will clarify the assumptions and physical significance of knowledge-based potentials, allow for their systematic improvement, and provide new insight into many-body correlations and cooperativity in folded proteins.
Collapse
|
8
|
Bhattacherjee A, Biswas P. Neutrality and evolvability of designed protein sequences. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 82:011906. [PMID: 20866647 DOI: 10.1103/physreve.82.011906] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2009] [Revised: 03/25/2010] [Indexed: 05/29/2023]
Abstract
The effect of foldability on protein's evolvability is analyzed by a two-prong approach consisting of a self-consistent mean-field theory and Monte Carlo simulations. Theory and simulation models representing protein sequences with binary patterning of amino acid residues compatible with a particular foldability criteria are used. This generalized foldability criterion is derived using the high temperature cumulant expansion approximating the free energy of folding. The effect of cumulative point mutations on these designed proteins is studied under neutral condition. The robustness, protein's ability to tolerate random point mutations is determined with a selective pressure of stability (ΔΔG) for the theory designed sequences, which are found to be more robust than that of Monte Carlo and mean-field-biased Monte Carlo generated sequences. The results show that this foldability criterion selects viable protein sequences more effectively compared to the Monte Carlo method, which has a marked effect on how the selective pressure shapes the evolutionary sequence space. These observations may impact de novo sequence design and its applications in protein engineering.
Collapse
|
9
|
Lezon TR, Bahar I. Using entropy maximization to understand the determinants of structural dynamics beyond native contact topology. PLoS Comput Biol 2010; 6:e1000816. [PMID: 20585542 PMCID: PMC2887458 DOI: 10.1371/journal.pcbi.1000816] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2009] [Accepted: 05/13/2010] [Indexed: 11/19/2022] Open
Abstract
Comparison of elastic network model predictions with experimental data has provided important insights on the dominant role of the network of inter-residue contacts in defining the global dynamics of proteins. Most of these studies have focused on interpreting the mean-square fluctuations of residues, or deriving the most collective, or softest, modes of motions that are known to be insensitive to structural and energetic details. However, with increasing structural data, we are in a position to perform a more critical assessment of the structure-dynamics relations in proteins, and gain a deeper understanding of the major determinants of not only the mean-square fluctuations and lowest frequency modes, but the covariance or the cross-correlations between residue fluctuations and the shapes of higher modes. A systematic study of a large set of NMR-determined proteins is analyzed using a novel method based on entropy maximization to demonstrate that the next level of refinement in the elastic network model description of proteins ought to take into consideration properties such as contact order (or sequential separation between contacting residues) and the secondary structure types of the interacting residues, whereas the types of amino acids do not play a critical role. Most importantly, an optimal description of observed cross-correlations requires the inclusion of destabilizing, as opposed to exclusively stabilizing, interactions, stipulating the functional significance of local frustration in imparting native-like dynamics. This study provides us with a deeper understanding of the structural basis of experimentally observed behavior, and opens the way to the development of more accurate models for exploring protein dynamics. As more protein structures are solved, we are able to perform a more critical assessment of the relationship between protein structure and dynamics, and to gain a deeper understanding of the major determinants of structural dynamics. Here we perform a systematic study on a set of proteins structurally determined by NMR spectroscopy. The dynamics are analyzed using elastic network models and a novel method based on entropy maximization to demonstrate that properties such as contact order and secondary structure do play a role in defining the experimentally observed covariance data. Most importantly, an optimal description of observed cross-correlations requires the inclusion of destabilizing, as well as stabilizing, interactions, stipulating the functional significance of local frustration in imparting native-like dynamics.
Collapse
Affiliation(s)
| | - Ivet Bahar
- Department of Computational Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
10
|
Statistical theory of neutral protein evolution by random site mutations. J CHEM SCI 2009. [DOI: 10.1007/s12039-009-0105-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
11
|
Yang YD, Park C, Kihara D. Threading without optimizing weighting factors for scoring function. Proteins 2008; 73:581-96. [DOI: 10.1002/prot.22082] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
12
|
Kenzaki H, Kikuchi M. Free-energy landscape of kinesin by a realistic lattice model. Proteins 2008; 71:389-95. [DOI: 10.1002/prot.21707] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
13
|
Skolnick J, Kolinski A. Monte Carlo Approaches to the Protein Folding Problem. ADVANCES IN CHEMICAL PHYSICS 2007. [DOI: 10.1002/9780470141649.ch7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
|
14
|
Scheraga HA, Hao MH. Entropy Sampling Monte Carlo for Polypeptides and Proteins. ADVANCES IN CHEMICAL PHYSICS 2007. [DOI: 10.1002/9780470141649.ch8] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
15
|
Calculation of the Free Energy and the Entropy of Macromolecular Systems by Computer Simulation. REVIEWS IN COMPUTATIONAL CHEMISTRY 2007. [DOI: 10.1002/9780470125892.ch1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register]
|
16
|
Winther O, Krogh A. Teaching computers to fold proteins. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2004; 70:030903. [PMID: 15524499 DOI: 10.1103/physreve.70.030903] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2003] [Revised: 04/26/2004] [Indexed: 05/24/2023]
Abstract
A new general algorithm for optimization of potential functions for protein folding is introduced. It is based upon gradient optimization of the thermodynamic stability of native folds of a training set of proteins with known structure. The iterative update rule contains two thermodynamic averages which are estimated by (generalized ensemble) Monte Carlo. We test the learning algorithm on a Lennard-Jones (LJ) force field with a torsional angle degrees-of-freedom and a single-atom side-chain. In a test with 24 peptides of known structure, none folded correctly with the initial potential functions, but two-thirds came within 3 A to their native fold after optimizing the potential functions.
Collapse
Affiliation(s)
- Ole Winther
- Center for Biological Sequence Analysis, The Technical University of Denmark, Building 208, DK-2800 Lyngby, Denmark.
| | | |
Collapse
|
17
|
Abstract
A model of the protein backbone is considered in which each residue is characterized by the location of its C(alpha) atom and one of a discrete set of conformal (phi, psi) states. We investigate the key differences between a description that offers a locally precise fit to known backbone structures and one that provides a globally accurate fit to protein structures. Using a statistical scoring scheme and threading, a protein's local best-fit conformation is highly recognizable, but its global structure cannot be directly determined from an amino acid sequence. The incorporation of information about the conformal states of neighboring residues along the chain allows one to accurately translate the local structure into a global structure. We present a two-step algorithm, which recognizes up to 95% of the tested protein native-state structures to within a 2.5 A root mean square deviation.
Collapse
Affiliation(s)
- Timothy Lezon
- Department of Physics, Pennsylvania State University, University Park, Pennsylvania, USA.
| | | | | |
Collapse
|
18
|
|
19
|
Pattabiraman N. Role of residue packing in protein folding. Trends Analyt Chem 2003. [DOI: 10.1016/s0165-9936(03)00906-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
20
|
Blackburne BP, Hirst JD. Three-dimensional functional model proteins: Structure function and evolution. J Chem Phys 2003. [DOI: 10.1063/1.1590310] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
21
|
Abstract
Multiple sequence alignments are a routine tool in protein fold recognition, but multiple structure alignments are computationally less cooperative. This work describes a method for protein sequence threading and sequence-to-structure alignments that uses multiple aligned structures, the aim being to improve models from protein threading calculations. Sequences are aligned into a field due to corresponding sites in homologous proteins. On the basis of a test set of more than 570 protein pairs, the procedure does improve alignment quality, although no more than averaging over sequences. For the force field tested, the benefit of structure averaging is smaller than that of adding sequence similarity terms or a contribution from secondary structure predictions. Although there is a significant improvement in the quality of sequence-to-structure alignments, this does not directly translate to an immediate improvement in fold recognition capability.
Collapse
Affiliation(s)
- Anthony J Russell
- Research School of Chemistry, Australian National University, Canberra, Australia
| | | |
Collapse
|
22
|
Kussell E, Shimada J, Shakhnovich EI. A structure-based method for derivation of all-atom potentials for protein folding. Proc Natl Acad Sci U S A 2002; 99:5343-8. [PMID: 11943859 PMCID: PMC122771 DOI: 10.1073/pnas.072665799] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A method for deriving all-atom protein folding potentials is presented and tested on a three-helix bundle protein, as well as on hairpin and helical sequences. The potentials obtained are composed of a contact term between pairs of atoms, and a local density term for each atom, mimicking solvent exposure preferences. Using this potential in an all-atom protein folding simulation, we repeatedly folded the three-helix bundle, with the lowest energy conformations having a C(alpha) distance rms from the native structure of less than 2 A. Similar results were obtained for the hairpin and helices by using different potentials. We derived potentials for several different proteins and found a high correlation between the derived parameters, suggesting that a potential of this form eventually could be found that folds multiple, unrelated proteins at the atomic level of detail.
Collapse
Affiliation(s)
- Edo Kussell
- Department of Biophysics, Harvard University, 240 Longwood Avenue, Boston, MA 02115, USA
| | | | | |
Collapse
|
23
|
Hardin C, Eastwood MP, Prentiss M, Luthey-Schulten Z, Wolynes PG. Folding funnels: the key to robust protein structure prediction. J Comput Chem 2002; 23:138-46. [PMID: 11913379 DOI: 10.1002/jcc.1162] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Natural proteins fold because their free energy landscapes are funneled to their native states. The degree to which a model energy function for protein structure prediction can avoid the multiple minima problem and reliably yield at least low-resolution predictions is also dependent on the topography of the energy landscape. We show that the degree of funneling can be quantitatively expressed in terms of a few averaged properties of the landscape. This allows us to optimize simplified energy functions for protein structure prediction even in the absence of homology information. Here we outline the optimization procedure in the context of associative memory energy functions originally introduced for tertiary structure recognition and demonstrate that even partially funneled landscapes lead to qualitatively correct, low-resolution predictions.
Collapse
Affiliation(s)
- Corey Hardin
- Center for Biophysics and Computational Biology, University of Illinois, Urbana 61801, USA
| | | | | | | | | |
Collapse
|
24
|
Liu Y, Beveridge DL. Exploratory studies of ab initio protein structure prediction: multiple copy simulated annealing, AMBER energy functions, and a generalized born/solvent accessibility solvation model. Proteins 2002; 46:128-46. [PMID: 11746709 DOI: 10.1002/prot.10020] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
A theoretical and computational approach to ab initio structure prediction for polypeptides in water is described and applied to selected amino acid sequences for testing and preliminary validation. The method builds systematically on the extensive efforts applied to parameterization of molecular dynamics (MD) force fields, employs an empirically well-validated continuum dielectric model for solvation, and an eminently parallelizable approach to conformational search. The effective free energy of polypeptide chains is estimated from AMBER united atom potential functions, with internal degrees of freedom for both backbone and amino acid side chains explicitly treated. The hydration free energy of each structure is determined using the Generalized Born/Solvent Accessibility (GBSA) method, modified and reparameterized to include atom types consistent with the AMBER force field. The conformational search procedure employs a multiple copy, Monte Carlo simulated annealing (MCSA) protocol in full torsion angle space, applied iteratively on sets of structures of progressively lower free energy until a prediction of a structure with lowest effective free energy is obtained. Calibration tests for the effective energy function and search algorithm are performed on the alanine dipeptide, selected protein crystal structures, and united atom decoys on barnase, crambin, and six examples from the Rosetta set. Specific demonstration cases of the method are provided for the 8-mer sequence of Ala residues, a 12-residue peptide with longer side chains QLLKKLLQQLKQ, a de novo designed 16 residue peptide of sequence (AAQAA)3Y, a 15-residue sequence with a beta sheet motif, GEWTWDATKTFTVTE, and a 36 residue small protein, Villin headpiece. The Ala 8-mer readily formed an alpha-helix. An alpha-helix structure was predicted for the 16-mer, consistent with observed results from IR and CD spectroscopy and with the pattern in psi/straight phi angles of known protein structures. The predicted structure for the 12-mer, composed of a mix of helix and less regular elements of secondary structure, lies 2.65 A RMS from the observed crystal structure. Structure prediction for the 8-mer beta-motif resulted in form 4.50 A RMS from the crystal geometry. For Villin, the predicted native form is very close to the crystal structure, RMS values of 3.5 A (including sidechains), and 1.01 A (main chain only). The methodology permits a detailed analysis of the molecular forces which dominate various segments of the predicted folding trajectory. Analysis of the results in terms of internal torsional, electrostatic and van der Waals and the electrostatic and non-electrostatic contributions to hydration, including the hydrophobic effect, is presented.
Collapse
Affiliation(s)
- Yongxing Liu
- Chemistry Department and Molecular Biophysics Program, Wesleyan University, Middletown, Connecticut 06457, USA
| | | |
Collapse
|
25
|
Affiliation(s)
- J G Saven
- Department of Chemistry, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| |
Collapse
|
26
|
Liwo A, Czaplewski C, Pillardy J, Scheraga HA. Cumulant-based expressions for the multibody terms for the correlation between local and electrostatic interactions in the united-residue force field. J Chem Phys 2001. [DOI: 10.1063/1.1383989] [Citation(s) in RCA: 207] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
27
|
Bastolla U, Farwer J, Knapp EW, Vendruscolo M. How to guarantee optimal stability for most representative structures in the Protein Data Bank. Proteins 2001; 44:79-96. [PMID: 11391771 DOI: 10.1002/prot.1075] [Citation(s) in RCA: 101] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
We proposed recently an optimization method to derive energy parameters for simplified models of protein folding. The method is based on the maximization of the thermodynamic average of the overlap between protein native structures and a Boltzmann ensemble of alternative structures. Such a condition enforces protein models whose ground states are most similar to the corresponding native states. We present here an extensive testing of the method for a simple residue-residue contact energy function and for alternative structures generated by threading. The optimized energy function guarantees high stability and a well-correlated energy landscape to most representative structures in the PDB database. Failures in the recognition of the native structure can be attributed to the neglect of interactions between different chains in oligomeric proteins or with cofactors. When these are taken into account, only very few X-ray structures are not recognized. Most of them are short inhibitors or fragments and one is a structure that presents serious inconsistencies. Finally, we discuss the reasons that make NMR structures more difficult to recognizeCopyright 2001 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- U Bastolla
- Free University of Berlin, Department of Biology, Chemistry and Pharmacy, Institute of Chemistry, Berlin, Germany.
| | | | | | | |
Collapse
|
28
|
Pillardy J, Czaplewski C, Liwo A, Wedemeyer WJ, Lee J, Ripoll DR, Arłukowicz P, Ołdziej S, Arnautova YA, Scheraga HA. Development of Physics-Based Energy Functions that Predict Medium-Resolution Structures for Proteins of the α, β, and α/β Structural Classes. J Phys Chem B 2001. [DOI: 10.1021/jp0111012] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Jarosław Pillardy
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Cezary Czaplewski
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Adam Liwo
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - William J. Wedemeyer
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Jooyoung Lee
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Daniel R. Ripoll
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Piotr Arłukowicz
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Stanisław Ołdziej
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Yelena A. Arnautova
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| | - Harold A. Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland, and Cornell Theory Center, Ithaca, New York 14853-3801
| |
Collapse
|
29
|
Vendruscolo M. Assessment of the quality of energy functions for protein folding by using a criterion derived with the help of the noisy go model. J Biol Phys 2001; 27:205-15. [PMID: 23345744 PMCID: PMC3456590 DOI: 10.1023/a:1013152026788] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We propose a simple criterion based on the Z-scoreto assess the quality of energy functions for protein folding: one should obtain Z>-10 for the equilibrium ensembleat about native conditions. We derive this criterionby studying a Go model with random errors added to the native interactions. The dependence of the Z-score on the thermodynamic parameters,including the noise, can be precisely obtained in this case,as the ground state of the model is known exactly.We apply this criterion to rapidly rule out two otherwise promisingpairwise energy approximations.The advantage of adopting the present criterionis that it is not necessary to know the ground state of an energy function to assess its quality. It is sufficient to compute the Z-scorefrom a single equilibrium simulation at around the folding temperature.
Collapse
Affiliation(s)
- M. Vendruscolo
- Oxford Centre for Molecular Sciences, New Chemistry Laboratory, University of Oxford, Oxford, OX1 3QT UK
| |
Collapse
|
30
|
Abstract
Models in computational biology, such as those used in binding, docking, and folding, are often empirical and have adjustable parameters. Because few of these models are yet fully predictive, the problem may be nonoptimal choices of parameters. We describe an algorithm called ENPOP (energy function parameter optimization) that improves-and sometimes optimizes-the parameters for any given model and for any given search strategy that identifies the stable state of that model. ENPOP iteratively adjusts the parameters simultaneously to move the model global minimum energy conformation for each of m different molecules as close as possible to the true native conformations, based on some appropriate measure of structural error. A proof of principle is given for two very different test problems. The first involves three different two-dimensional model protein molecules having 12 to 37 monomers and four parameters in common. The parameters converge to the values used to design the model native structures. The second problem involves nine bumpy landscapes, each having between 4 and 12 degrees of freedom. For the three adjustable parameters, the globally optimal values are known in advance. ENPOP converges quickly to the correct parameter set.
Collapse
Affiliation(s)
- J B Rosen
- Computer Science and Engineering Department, University of California at San Diego, San Diego, California 92093 USA
| | | | | | | |
Collapse
|
31
|
|
32
|
Bastolla U, Vendruscolo M, Knapp EW. A statistical mechanical method to optimize energy functions for protein folding. Proc Natl Acad Sci U S A 2000; 97:3977-81. [PMID: 10760269 PMCID: PMC18127 DOI: 10.1073/pnas.97.8.3977] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present a method for deriving energy functions for protein folding by maximizing the thermodynamic average of the overlap with the native state. The method has been tested by using the pairwise contact approximation of the energy function and generating alternative structures by threading sequences over a database of 1, 169 structures. With the derived energy function, most native structures: (i) have minimal energy and (ii) are thermodynamically rather stable, and (iii) the corresponding energy landscapes are smooth. Precisely, 92% of the 1,013 x-ray structures are stabilized. Most failures can be attributed to the neglect of interactions between chains forming polychain proteins and of interactions with cofactors. When these are considered, only nine cases remain unexplained. In contrast, 38% of NMR structures are not assigned properly.
Collapse
Affiliation(s)
- U Bastolla
- Freie Universität Berlin, Department of Biology, Chemistry and Pharmacy, Takustrasse 6, D-14195 Berlin, Germany
| | | | | |
Collapse
|
33
|
Abstract
We present an analysis of the assumptions behind some of the most commonly used methods for evaluating the goodness of the fit between a sequence and a structure. Our studies on a lattice model show that methods based on statistical considerations are easy to use and can capture some of the features of protein-like sequences and their corresponding native states, but unfortunately are incapable of recognizing, with certainty, the native-like conformation of a sequence among a set of decoys. Meanwhile, an optimization method, entailing the determination of the parameters of an effective free energy of interaction, is much more reliable in recognizing the native state of a sequence. However, the statistical method is shown to perform quite well in tests of protein design.
Collapse
Affiliation(s)
- R I Dima
- Department of Physics and Center for Materials Physics, The Pennsylvania State University, University Park 16802, USA.
| | | | | |
Collapse
|
34
|
Leckband DE, Kuhl TL, Wang HK, Müller W, Herron J, Ringsdorf H. Force probe measurements of antibody-antigen interactions. Methods 2000; 20:329-40. [PMID: 10694455 DOI: 10.1006/meth.1999.0926] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The surface force apparatus has been used to quantify directly the forces that govern the interactions between proteins and ligands. In this work, we describe the measured interactions between the antigen fluorescein and the Fab' fragment of the monoclonal 4-4-20 anti-fluorescyl IgG antibody. Here we first describe the use of the surface force apparatus to demonstrate directly the impact of the charge composition in the region of the antibody binding site on the antibody interactions. Several approaches are described for immobilizing antigens, antibodies, and proteins in general for direct force measurements. The measured force profiles presented are accompanied by an extensive discussion of protocols used to analyze the force-distance curves and to interpret them in terms of the antibody structure. In addition to long-range electrostatic forces, we also consider short-range forces that can affect the strength of adhesion between the Fab' and immobilized fluorescein. The latter investigations demonstrate the influence of interfacial properties on the recognition of surface-bound antigens.
Collapse
Affiliation(s)
- D E Leckband
- Department of Chemical Engineering and Center for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | | | | | | | | | | |
Collapse
|
35
|
Vendruscolo M, Najmanovich R, Domany E. Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading? Proteins 2000; 38:134-48. [PMID: 10656261 DOI: 10.1002/(sici)1097-0134(20000201)38:2<134::aid-prot3>3.0.co;2-a] [Citation(s) in RCA: 95] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
We present a method to derive contact energy parameters from large sets of proteins. The basic requirement on which our method is based is that for each protein in the database the native contact map has lower energy than all its decoy conformations that are obtained by threading. Only when this condition is satisfied one can use the proposed energy function for fold identification. Such a set of parameters can be found (by perceptron learning) if Mp, the number of proteins in the database, is not too large. Other aspects that influence the existence of such a solution are the exact definition of contact and the value of the critical distance Rc, below which two residues are considered to be in contact. Another important novel feature of our approach is its ability to determine whether an energy function of some suitable proposed form can or cannot be parameterized in a way that satisfies our basic requirement. As a demonstration of this, we determine the region in the (Rc, Mp) plane in which the problem is solvable, i.e., we can find a set of contact parameters that stabilize simultaneously all the native conformations. We show that for large enough databases the contact approximation to the energy cannot stabilize all the native folds even against the decoys obtained by gapless threading.
Collapse
Affiliation(s)
- M Vendruscolo
- Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot, Israel.
| | | | | |
Collapse
|
36
|
|
37
|
Vendruscolo M, Mirny LA, Shakhnovich EI, Domany E. Comparison of two optimization methods to derive energy parameters for protein folding: Perceptron andZ score. Proteins 2000. [DOI: 10.1002/1097-0134(20001101)41:2<192::aid-prot40>3.0.co;2-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
38
|
|
39
|
|
40
|
Vekhter B, Berry RS. Simulation of mutation: Influence of a “side group” on global minimum structure and dynamics of a protein model. J Chem Phys 1999. [DOI: 10.1063/1.479678] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
41
|
Liwo A, Lee J, Ripoll DR, Pillardy J, Scheraga HA. Protein structure prediction by global optimization of a potential energy function. Proc Natl Acad Sci U S A 1999; 96:5482-5. [PMID: 10318909 PMCID: PMC21885 DOI: 10.1073/pnas.96.10.5482] [Citation(s) in RCA: 122] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
An approach based exclusively on finding the global minimum of an appropriate potential energy function has been used to predict the unknown structures of five globular proteins with sizes ranging from 89 to 140 amino acid residues. Comparison of the computed lowest-energy structures of two of them (HDEA and MarA) with the crystal structures, released by the Protein Data Bank after the predictions were made, shows that large fragments (61 residues) of both proteins were predicted with rms deviations of 4.2 and 6.0 A for the Calpha atoms, for HDEA and MarA, respectively. This represents 80% and 53% of the observed structures of HDEA and MarA, respectively. Similar rms deviations were obtained for approximately 60-residue fragments of the other three proteins. These results constitute an important step toward the prediction of protein structure based solely on global optimization of a potential energy function for a given amino acid sequence.
Collapse
Affiliation(s)
- A Liwo
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA
| | | | | | | | | |
Collapse
|
42
|
Abstract
By following a consistent line of physical reasoning, some fundamental understanding about the foldability of proteins has been achieved. In recent years, this has led to the development of a number of successful algorithms for optimizing potential energy functions for folding protein models. The differences between the folding mechanisms of simple, contact-based lattice proteins and more traditional, realistic protein models, however, still call for further development of the potentials in addition to the optimization approaches.
Collapse
Affiliation(s)
- M H Hao
- Boehringer Ingelheim Pharmaceuticals Inc. R6-5, 900 Ridgebury Road, PO Box 368, Ridgefield, CT 06877, USA
| | | |
Collapse
|
43
|
Lee J, Liwo A, Scheraga HA. Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: application to the 10-55 fragment of staphylococcal protein A and to apo calbindin D9K. Proc Natl Acad Sci U S A 1999; 96:2025-30. [PMID: 10051588 PMCID: PMC26730 DOI: 10.1073/pnas.96.5.2025] [Citation(s) in RCA: 123] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The conformational space annealing (CSA) method for global optimization has been applied to the 10-55 fragment of the B-domain of staphylococcal protein A (protein A) and to a 75-residue protein, apo calbindin D9K (PDB ID code), by using the UNRES off-lattice united-residue force field. Although the potential was not calibrated with these two proteins, the native-like structures were found among the low-energy conformations, without the use of threading or secondary-structure predictions. This is because the CSA method can find many distinct families of low-energy conformations. Starting from random conformations, the CSA method found that there are two families of low-energy conformations for each of the two proteins, the native-like fold and its mirror image. The CSA method converged to the same low-energy folds in all cases studied, as opposed to other optimization methods. It appears that the CSA method with the UNRES force field, which is based on the thermodynamic hypothesis, can be used in prediction of protein structures in real time.
Collapse
Affiliation(s)
- J Lee
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA
| | | | | |
Collapse
|
44
|
Vendruscolo M, Domany E. Pairwise contact potentials are unsuitable for protein folding. J Chem Phys 1998. [DOI: 10.1063/1.477748] [Citation(s) in RCA: 115] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
45
|
Mirny LA, Shakhnovich EI. Protein structure prediction by threading. Why it works and why it does not. J Mol Biol 1998; 283:507-26. [PMID: 9769221 DOI: 10.1006/jmbi.1998.2092] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We developed a novel Monte Carlo threading algorithm which allows gaps and insertions both in the template structure and threaded sequence. The algorithm is able to find the optimal sequence-structure alignment and sample suboptimal alignments. Using our algorithm we performed sequence-structure alignments for a number of examples for three protein folds (ubiquitin, immunoglobulin and globin) using both "ideal" set of potentials (optimized to provide the best Z-score for a given protein) and more realistic knowledge-based potentials. Two physically different scenarios emerged. If a template structure is similar to the native one (within 2 A RMS), then (i) the optimal threading alignment is correct and robust with respect to deviations of the potential from the "ideal" one; (ii) suboptimal alignments are very similar to the optimal one; (iii) as Monte Carlo temperature decreases a sharp cooperative transition to the optimal alignment is observed. In contrast, if the template structure is only moderately close to the native structure (RMS greater than 3.5 A), then (i) the optimal alignment changes dramatically when an "ideal" potential is substituted by the real one; (ii) the structures of suboptimal alignments are very different from the optimal one, reducing the reliability of the alignment; (iii) the transition to the apparently optimal alignment is non-cooperative. In the intermediate cases when the RMS between the template and the native conformations is in the range between 2 A and 3.5 A, the success of threading alignment may depend on the quality of potentials used. These results are rationalized in terms of a threading free energy landscape. Possible ways to overcome the fundamental limitations of threading are discussed briefly.
Collapse
Affiliation(s)
- L A Mirny
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, 02138, USA
| | | |
Collapse
|
46
|
Derreumaux P. Finding the low-energy forms of avian pancreatic polypeptide with the diffusion-process-controlled Monte Carlo method. J Chem Phys 1998. [DOI: 10.1063/1.476708] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
47
|
Abstract
The folding of single-domain globular proteins exhibits the character of first-order or two-state thermodynamics. The origin of such high cooperativity in relatively small polymer systems is still not well understood. Recently, the statistical mechanics of protein folding has been studied extensively with simple protein models such as short cubic-lattice chains with contact-based interactions. While many valuable insights about protein folding were gained with such models, some concerns have also arisen, viz. that they lack the character of protein backbones whose interactions would limit the folding patterns of proteins. Here, a comparative study of the conventional cubic-lattice chain model and a fine-grained more realistic lattice protein model with both backbone and side-chain interactions is carried out. It is found that, even though both types of models exhibit a cooperative two-state folding transition to the native structure with optimized force fields, the character and origin of cooperativity of the two models are different. In the simple contact-based model, the free-energy barrier occurs at the low end of the energy scale, and the cooperativity arises from a concerted formation of native contacts among many residues in a compact state. In the other more complicated model, the free-energy barrier occurs in the intermediate energy region, and the folding cooperativity arises from collective orientational arrangements of locally structured units in semi-open conformational states. On the basis of these results, two limiting molecular mechanisms for protein folding emerge, which can be used for analyzing the folding process of real proteins.
Collapse
Affiliation(s)
- M H Hao
- Baker Laboratory of Chemistry, Cornell University, Ithaca, NY, 14853-1301, USA
| | | |
Collapse
|
48
|
Koretke KK, Luthey-Schulten Z, Wolynes PG. Self-consistently optimized energy functions for protein structure prediction by molecular dynamics. Proc Natl Acad Sci U S A 1998; 95:2932-7. [PMID: 9501193 PMCID: PMC19672 DOI: 10.1073/pnas.95.6.2932] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The protein energy landscape theory is used to obtain optimal energy functions for protein structure prediction via simulated annealing. The analysis here takes advantage of a more complete statistical characterization of the protein energy landscape and thereby improves on previous approximations. This schema partially takes into account correlations in the energy landscape. It also incorporates the relationships between folding dynamics and characteristic energy scales that control the collapse of the proteins and modulate rigidity of short-range interactions. Simulated annealing for the optimal energy functions, which are associative memory hamiltonians using a database of folding patterns, generally leads to quantitatively correct structures. In some cases the algorithm achieves "creativity," i.e., structures result that are better than any homolog in the database.
Collapse
Affiliation(s)
- K K Koretke
- School of Chemical Sciences, University of Illinois, Urbana, IL 61801, USA
| | | | | |
Collapse
|
49
|
Abstract
We use two simple models and the energy landscape perspective to study protein folding kinetics. A major challenge has been to use the landscape perspective to interpret experimental data, which requires ensemble averaging over the microscopic trajectories usually observed in such models. Here, because of the simplicity of the model, this can be achieved. The kinetics of protein folding falls into two classes: multiple-exponential and two-state (single-exponential) kinetics. Experiments show that two-state relaxation times have "chevron plot" dependences on denaturant and non-Arrhenius dependences on temperature. We find that HP and HP+ models can account for these behaviors. The HP model often gives bumpy landscapes with many kinetic traps and multiple-exponential behavior, whereas the HP+ model gives more smooth funnels and two-state behavior. Multiple-exponential kinetics often involves fast collapse into kinetic traps and slower barrier climbing out of the traps. Two-state kinetics often involves entropic barriers where conformational searching limits the folding speed. Transition states and activation barriers need not define a single conformation; they can involve a broad ensemble of the conformations searched on the way to the native state. We find that unfolding is not always a direct reversal of the folding process.
Collapse
Affiliation(s)
- H S Chan
- Department of Pharmaceutical Chemistry, University of California, San Francisco 94143-1204, USA.
| | | |
Collapse
|
50
|
Huber T, Torda AE. Protein fold recognition without Boltzmann statistics or explicit physical basis. Protein Sci 1998; 7:142-9. [PMID: 9514269 PMCID: PMC2143813 DOI: 10.1002/pro.5560070115] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We present a fast method for finding optimal parameters for a low-resolution (threading) force field intended to distinguish correct from incorrect folds for a given protein sequence. In contrast to other methods, the parameterization uses information from >10(7) misfolded structures as well as a set of native sequence-structure pairs. In addition to testing the resulting force field's performance on the protein sequence threading problem, results are shown that characterize the number of parameters necessary for effective structure recognition.
Collapse
Affiliation(s)
- T Huber
- Research School of Chemistry, Australian National University, Canberra
| | | |
Collapse
|