51
|
Cui Y, Wong WH, Bornberg-Bauer E, Chan HS. Recombinatoric exploration of novel folded structures: a heteropolymer-based model of protein evolutionary landscapes. Proc Natl Acad Sci U S A 2002; 99:809-14. [PMID: 11805332 PMCID: PMC117387 DOI: 10.1073/pnas.022240299] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The role of recombination in evolution is compared with that of point mutations (substitutions) in the context of a simple, polymer physics-based model mapping between sequence (genotype) and conformational (phenotype) spaces. Crossovers and point mutations of lattice chains with a hydrophobic polar code are investigated. Sequences encoding for a single ground-state conformation are considered viable and used as model proteins. Point mutations lead to diffusive walks on the evolutionary landscape, whereas crossovers can "tunnel" through barriers of diminished fitness. The degree to which crossovers allow for more efficient sequence and structural exploration depends on the relative rates of point mutations versus that of crossovers and the dispersion in fitness that characterizes the ruggedness of the evolutionary landscape. The probability that a crossover between a pair of viable sequences results in viable sequences is an order of magnitude higher than random, implying that a sequence's overall propensity to encode uniquely is embodied partially in local signals. Consistent with this observation, certain hydrophobicity patterns are significantly more favored than others among fragments (i.e., subsequences) of sequences that encode uniquely, and examples reminiscent of autonomous folding units in real proteins are found. The number of structures explored by both crossovers and point mutations is always substantially larger than that via point mutations alone, but the corresponding numbers of sequences explored can be comparable when the evolutionary landscape is rugged. Efficient structural exploration requires intermediate nonextreme ratios between point-mutation and crossover rates.
Collapse
Affiliation(s)
- Yan Cui
- Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
| | | | | | | |
Collapse
|
52
|
Plotkin SS, Onuchic JN. Structural and energetic heterogeneity in protein folding. I. Theory. J Chem Phys 2002. [DOI: 10.1063/1.1449866] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
53
|
Srinivas G, Bagchi B. Foldability and the funnel of HP-36 protein sequence: Use of hydropathy scale in protein folding. J Chem Phys 2002. [DOI: 10.1063/1.1467341] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
54
|
Broglia RA, Tiana G. Reading the three-dimensional structure of lattice model-designed proteins from their amino acid sequence. Proteins 2001; 45:421-7. [PMID: 11746689 DOI: 10.1002/prot.1158] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
While all the information required for the folding of a protein is contained in its amino acid sequence, one has not yet learned how to extract this information to predict the detailed, biological active, three-dimensional structure of a protein whose sequence is known. Using insight obtained from lattice model simulations of the folding of small proteins (fewer than 100 residues), in particular of the fact that this phenomenon is essentially controlled by conserved contacts (Mirny et al., Proc Natl Acad Sci USA 1995;92:1282) among (few) strongly interacting ("hot") amino acids (Tiana et al., J Chem Phys 1998;108:757-761), which also stabilize local elementary structures formed early in the folding process and leading to the (postcritical) folding core when they assemble together (Broglia et al., Proc Natl Acad Sci USA 1998;95:12930, Broglia & Tiana, J Chem Phys 2001;114:7267), we have worked out a successful strategy for reading the three-dimensional structure of lattice model-designed proteins from the knowledge of only their amino acid sequence and of the contact energies among the amino acids.
Collapse
Affiliation(s)
- R A Broglia
- Dipartimento di Fisica, Università di Milano, Milano, Italy.
| | | |
Collapse
|
55
|
Fink TM, Ball RC. How many conformations can a protein remember? PHYSICAL REVIEW LETTERS 2001; 87:198103. [PMID: 11690459 DOI: 10.1103/physrevlett.87.198103] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2000] [Indexed: 05/23/2023]
Abstract
We show that a protein can be trained to recognize multiple conformations, analogous to an associative memory, and provide capacity calculations based on energy fluctuations and information theory. Unlike the linear capacity of a Hopfield network, the number of conformations which can be remembered by a protein sequence depends on the size of the amino acid alphabet as lnA, independent of protein length. This admits the possibility of certain proteins, such as prions, evolving to fold to independent stable conformations, as well as novel possibilities for protein and heteropolymer design.
Collapse
Affiliation(s)
- T M Fink
- Theory of Condensed Matter, Cavendish Laboratory, Cambridge CB3 0HE, United Kingdom.
| | | |
Collapse
|
56
|
Bastolla U, Farwer J, Knapp EW, Vendruscolo M. How to guarantee optimal stability for most representative structures in the Protein Data Bank. Proteins 2001; 44:79-96. [PMID: 11391771 DOI: 10.1002/prot.1075] [Citation(s) in RCA: 101] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
We proposed recently an optimization method to derive energy parameters for simplified models of protein folding. The method is based on the maximization of the thermodynamic average of the overlap between protein native structures and a Boltzmann ensemble of alternative structures. Such a condition enforces protein models whose ground states are most similar to the corresponding native states. We present here an extensive testing of the method for a simple residue-residue contact energy function and for alternative structures generated by threading. The optimized energy function guarantees high stability and a well-correlated energy landscape to most representative structures in the PDB database. Failures in the recognition of the native structure can be attributed to the neglect of interactions between different chains in oligomeric proteins or with cofactors. When these are taken into account, only very few X-ray structures are not recognized. Most of them are short inhibitors or fragments and one is a structure that presents serious inconsistencies. Finally, we discuss the reasons that make NMR structures more difficult to recognizeCopyright 2001 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- U Bastolla
- Free University of Berlin, Department of Biology, Chemistry and Pharmacy, Institute of Chemistry, Berlin, Germany.
| | | | | | | |
Collapse
|
57
|
Tiana G, Broglia RA, Shakhnovich EI. Energy profile of the space of model protein sequences. J Biol Phys 2001; 27:147-59. [PMID: 23345740 PMCID: PMC3456584 DOI: 10.1023/a:1013151530254] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A numerical study of the energy landscape of the space of model proteinsequences is carried out. As a consequence of the heterogeneity of thecontact energies among amino acids, the energy landscape displays a veryrough profile, a behaviour typical of frustrated systems. This givesraise to a hierarchical clustering of low-energy sequences and can have evolutionary consequences.
Collapse
Affiliation(s)
- G Tiana
- The Niels Bohr Institute, University of Copenhagen, Bledgamsvej 16, 2100 Copenhagen, Denmark ; Department of Chemistry, Harvard University, 16 Oxford st., Cambridge, MA USA
| | | | | |
Collapse
|
58
|
Shimada J, Kussell EL, Shakhnovich EI. The folding thermodynamics and kinetics of crambin using an all-atom Monte Carlo simulation. J Mol Biol 2001; 308:79-95. [PMID: 11302709 DOI: 10.1006/jmbi.2001.4586] [Citation(s) in RCA: 130] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We present a novel Monte Carlo simulation of protein folding, in which all heavy atoms are represented as interacting hard spheres. This model includes all degrees of freedom relevant to folding, all side-chain and backbone torsions, and uses a Go potential. In this study, we focus on the 46 residue alpha/beta protein crambin and two of its structural components, the helix and helix hairpin. For a wide range of temperatures, we recorded multiple folding events of these three structures from random coils to native conformations that differ by less than 1 A C(alpha) dRMS from their crystal structure coordinates. The thermodynamics and kinetic mechanism of the helix-coil transition obtained from our simulation shows excellent agreement with currently available experimental and molecular dynamics data. Based on insights obtained from folding its smaller structural components, a possible folding mechanism for crambin is proposed. We observed that the folding occurs via a cooperative, first order-like process, and that many folding pathways to the native state exist. One particular sequence of events constitutes a "fast-folding" pathway where kinetic traps are avoided. At very low temperatures, a kinetic trap arising from the incorrect packing of side-chains was observed. These results demonstrate that folding to the native state can be observed in a reasonable amount of time on desktop computers even when an all-atom representation is used, provided the energetics sufficiently stabilize the native state.
Collapse
Affiliation(s)
- J Shimada
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, 02138, USA
| | | | | |
Collapse
|
59
|
Pereira de Araújo AF. Sequence rotation in N-dimensional space and the folding of hydrophobic protein models: Surpassing the diagonal unfolded state approximation. J Chem Phys 2001. [DOI: 10.1063/1.1329347] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
60
|
Plotkin SS, Onuchic JN. Investigation of routes and funnels in protein folding by free energy functional methods. Proc Natl Acad Sci U S A 2000; 97:6509-14. [PMID: 10841554 PMCID: PMC18640 DOI: 10.1073/pnas.97.12.6509] [Citation(s) in RCA: 84] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We use a free energy functional theory to elucidate general properties of heterogeneously ordering, fast folding proteins, and we test our conclusions with lattice simulations. We find that both structural and energetic heterogeneity can lower the free energy barrier to folding. Correlating stronger contact energies with entropically likely contacts of a given native structure lowers the barrier, and anticorrelating the energies has the reverse effect. Designing in relatively mild energetic heterogeneity can eliminate the barrier completely at the transition temperature. Sequences with native energies tuned to fold uniformly, as well as sequences tuned to fold reliably by a single or a few routes, are rare. Sequences with weak native energetic heterogeneity are more common; their folding kinetics is more strongly determined by properties of the native structure. Sequences with different distributions of stability throughout the protein may still be good folders to the same structure. A measure of folding route narrowness is introduced that correlates with rate and that can give information about the intrinsic biases in ordering arising from native topology. This theoretical framework allows us to investigate systematically the coupled effects of energy and topology in protein folding and to interpret recent experiments that investigate these effects.
Collapse
Affiliation(s)
- S S Plotkin
- Department of Physics, University of California at San Diego, La Jolla, CA 92093-5003, USA.
| | | |
Collapse
|
61
|
Buchler NEG, Goldstein RA. Surveying determinants of protein structure designability across different energy models and amino-acid alphabets: A consensus. J Chem Phys 2000. [DOI: 10.1063/1.480893] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
62
|
Dinner AR, Verosub E, Karplus M. Use of a quantitative structure-property relationship to design larger model proteins that fold rapidly. PROTEIN ENGINEERING 1999; 12:909-17. [PMID: 10585496 DOI: 10.1093/protein/12.11.909] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
A quantitative structure-property relationship (QSPR) was used to design model protein sequences that fold repeatedly and relatively rapidly to stable target structures. The specific model was a 125-residue heteropolymer chain subject to Monte Carlo dynamics on a simple cubic lattice. The QSPR was derived from an analysis of a database of 200 sequences by a statistical method that uses a genetic algorithm to select the sequence attributes that are most important for folding and a neural network to determine the corresponding functional dependence of folding ability on the chosen attributes. The QSPR depends on the number of anti-parallel sheet contacts, the energy gap between the native state and quasi-continuous part of the spectrum and the total energy of the contacts between surface residues. Two Monte Carlo procedures were used in series to optimize both the target structures and the sequences. We generated 20 fully optimized sequences and 60 partially optimized control sequences and tested each for its ability to fold in dynamic MC simulations. Although sequences in which either the number of anti-parallel sheet contacts or the energy of the surface residues is non-optimal are capable of folding almost as well as fully optimized ones, sequences in which only the energy gap is optimized fold markedly more slowly. Implications of the results for the design of proteins are discussed.
Collapse
Affiliation(s)
- A R Dinner
- Department of Chemistry and Chemical Biology and Committee on Higher Degrees in Biophysics, Harvard University, 12 Oxford Street, Cambridge, MA 02138, USA and Laboratoire de Chimie Biophysique, Institut le Bel, Université Louis Pasteur, 4
| | | | | |
Collapse
|
63
|
Verkhivker GM, Rejto PA, Bouzida D, Arthurs S, Colson AB, Freer ST, Gehlhaar DK, Larson V, Luty BA, Marrone T, Rose PW. Towards understanding the mechanisms of molecular recognition by computer simulations of ligand-protein interactions. J Mol Recognit 1999; 12:371-89. [PMID: 10611647 DOI: 10.1002/(sici)1099-1352(199911/12)12:6<371::aid-jmr479>3.0.co;2-o] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The thermodynamic and kinetic aspects of molecular recognition for the methotrexate (MTX)-dihydrofolate reductase (DHFR) ligand-protein system are investigated by the binding energy landscape approach. The impact of 'hot' and 'cold' errors in ligand mutations on the thermodynamic stability of the native MTX-DHFR complex is analyzed, and relationships between the molecular recognition mechanism and the degree of ligand optimization are discussed. The nature and relative stability of intermediates and thermodynamic phases on the ligand-protein association pathway are studied, providing new insights into connections between protein folding and molecular recognition mechanisms, and cooperativity of ligand-protein binding. The results of kinetic docking simulations are rationalized based on the thermodynamic properties determined from equilibrium simulations and the shape of the underlying binding energy landscape. We show how evolutionary ligand selection for a receptor active site can produce well-optimized ligand-protein systems such as MTX-DHFR complex with the thermodynamically stable native structure and a direct transition mechanism of binding from unbound conformations to the unique native structure.
Collapse
Affiliation(s)
- G M Verkhivker
- Agouron Pharmaceuticals Inc., 3301 North Torrey Pines Court, La Jolla, CA 92037-1022, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
64
|
Pereira De Araújo AF. Folding protein models with a simple hydrophobic energy function: the fundamental importance of monomer inside/outside segregation. Proc Natl Acad Sci U S A 1999; 96:12482-7. [PMID: 10535948 PMCID: PMC22956 DOI: 10.1073/pnas.96.22.12482] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/1999] [Indexed: 11/18/2022] Open
Abstract
The present study explores a "hydrophobic" energy function for folding simulations of the protein lattice model. The contribution of each monomer to conformational energy is the product of its "hydrophobicity" and the number of contacts it makes, i.e., E(h,c) = -Sigma N/i=1 c(i)h(i) = -(h.c) is the negative scalar product between two vectors in N-dimensional cartesian space: h = (h1,., hN), which represents monomer hydrophobicities and is sequence-dependent; and c = (c(1),., c(N)), which represents the number of contacts made by each monomer and is conformation-dependent. A simple theoretical analysis shows that restrictions are imposed concomitantly on both sequences and native structures if the stability criterion for protein-like behavior is to be satisfied. Given a conformation with vector c, the best sequence is a vector h on the direction upon which the projection of c - c is maximal, where c is the diagonal vector with components equal to c, the average number of contacts per monomer in the unfolded state. Best native conformations are suggested to be not maximally compact, as assumed in many studies, but the ones with largest variance of contacts among its monomers, i.e., with monomers tending to occupy completely buried or completely exposed positions. This inside/outside segregation is reflected on an apolar/polar distribution on the corresponding sequence. Monte Carlo simulations in two dimensions corroborate this general scheme. Sequences targeted to conformations with large contact variances folded cooperatively with thermodynamics of a two-state transition. Sequences targeted to maximally compact conformations, which have lower contact variance, were either found to have degenerate ground state or to fold with much lower cooperativity.
Collapse
Affiliation(s)
- A F Pereira De Araújo
- Departamento de Biologia Celular, International Center of Condensed Matter Physics, Universidade de Brasília, Brasília-DF 70910-900, Brazil.
| |
Collapse
|
65
|
Shea JE, Onuchic JN, Brooks CL. Exploring the origins of topological frustration: design of a minimally frustrated model of fragment B of protein A. Proc Natl Acad Sci U S A 1999; 96:12512-7. [PMID: 10535953 PMCID: PMC22965 DOI: 10.1073/pnas.96.22.12512] [Citation(s) in RCA: 186] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Topological frustration in an energetically unfrustrated off-lattice model of the helical protein fragment B of protein A from Staphylococcus aureus was investigated. This G-type model exhibited thermodynamic and kinetic signatures of a well-designed two-state folder with concurrent collapse and folding transitions and single exponential kinetics at the transition temperature. Topological frustration is determined in the absence of energetic frustration by the distribution of Fersht phi values. Topologically unfrustrated systems present a unimodal distribution sharply peaked at intermediate phi, whereas highly frustrated systems display a bimodal distribution peaked at low and high phi values. The distribution of phi values in protein A was determined both thermodynamically and kinetically. Both methods yielded a unimodal distribution centered at phi = 0.3 with tails extending to low and high phi values, indicating the presence of a small amount of topological frustration. The contacts with high phi values were located in the turn regions between helices I and II and II and III, intimating that these hairpins are in large part required in the transition state. Our results are in good agreement with all-atom simulations of protein A, as well as lattice simulations of a three- letter code 27-mer (which can be compared with a 60-residue helical protein). The relatively broad unimodal distribution of phi values obtained from the all-atom simulations and that from the minimalist model for the same native fold suggest that the structure of the transition state ensemble is determined mostly by the protein topology and not energetic frustration.
Collapse
Affiliation(s)
- J E Shea
- Department of Molecular Biology, TPC6, The Scripps Research Institute, La Jolla, CA 92037, USA
| | | | | |
Collapse
|
66
|
Buchler NEG, Goldstein RA. Universal correlation between energy gap and foldability for the random energy model and lattice proteins. J Chem Phys 1999. [DOI: 10.1063/1.479951] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
67
|
Bornberg-Bauer E, Chan HS. Modeling evolutionary landscapes: mutational stability, topology, and superfunnels in sequence space. Proc Natl Acad Sci U S A 1999; 96:10689-94. [PMID: 10485887 PMCID: PMC17944 DOI: 10.1073/pnas.96.19.10689] [Citation(s) in RCA: 196] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Random mutations under neutral or near-neutral conditions are studied by considering plausible evolutionary trajectories on "neutral nets"-i.e., collections of sequences (genotypes) interconnected via single-point mutations encoding for the same ground-state structure (phenotype). We use simple exact lattice models for the mapping between sequence and conformational spaces. Densities of states based on model intrachain interactions are determined by exhaustive conformational enumeration. We compare results from two very different interaction schemes to ascertain robustness of the conclusions. In both models, sequences in a majority of neutral nets center around a single "prototype sequence" of maximum mutational stability, tolerating the largest number of neutral mutations. General analytical considerations show that these topologies by themselves lead to higher steady-state evolutionary populations at prototype sequences. On average, native thermodynamic stability increases toward a maximum at the prototype sequence, resulting in funnel-like arrangements of native stabilities in sequence space. These observations offer a unified perspective on sequence design, native stability, and mutational stability of proteins. These principles are generalizable from native stability to any measure of fitness provided that its variation with respect to mutations is essentially smooth.
Collapse
Affiliation(s)
- E Bornberg-Bauer
- Theoretical Bioinformatics Group, German Cancer Research Centre Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany
| | | |
Collapse
|
68
|
Lednev IK, Karnoup AS, Sparrow MC, Asher SA. α-Helix Peptide Folding and Unfolding Activation Barriers: A Nanosecond UV Resonance Raman Study. J Am Chem Soc 1999. [DOI: 10.1021/ja991382f] [Citation(s) in RCA: 158] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Igor K. Lednev
- Contribution form the Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Anton S. Karnoup
- Contribution form the Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Mark C. Sparrow
- Contribution form the Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Sanford A. Asher
- Contribution form the Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| |
Collapse
|
69
|
Dima RI, Banavar JR, Cieplak M, Maritan A. Statistical mechanics of protein-like heteropolymers. Proc Natl Acad Sci U S A 1999; 96:4904-7. [PMID: 10220391 PMCID: PMC21789 DOI: 10.1073/pnas.96.9.4904] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A strategy is outlined for obtaining the free energy of a typical designed heteropolymer. The design procedure considers the probability that the target conformation is occupied in comparison with all the other conformations that could house the given sequence. Numerical calculations on lattice heteropolymer models are presented to illustrate the key physical principles.
Collapse
Affiliation(s)
- R I Dima
- Department of Physics and Center for Materials Physics, 104 Davey Laboratory, Pennsylvania State University, University Park, PA 16802, USA.
| | | | | | | |
Collapse
|
70
|
|
71
|
Dokholyan NV, Buldyrev SV, Stanley HE, Shakhnovich EI. Discrete molecular dynamics studies of the folding of a protein-like model. FOLDING & DESIGN 1999; 3:577-87. [PMID: 9889167 DOI: 10.1016/s1359-0278(98)00072-8] [Citation(s) in RCA: 238] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
BACKGROUND Many attempts have been made to resolve in time the folding of model proteins in computer simulations. Different computational approaches have emerged. Some of these approaches suffer from insensitivity to the geometrical properties of the proteins (lattice models), whereas others are computationally heavy (traditional molecular dynamics). RESULTS We used the recently proposed approach of Zhou and Karplus to study the folding of a protein model based on the discrete time molecular dynamics algorithm. We show that this algorithm resolves with respect to time the folding <--> unfolding transition. In addition, we demonstrate the ability to study the core of the model protein. CONCLUSIONS The algorithm along with the model of interresidue interactions can serve as a tool for studying the thermodynamics and kinetics of protein models.
Collapse
Affiliation(s)
- N V Dokholyan
- Center for Polymer Studies, Physics Department, Boston University, MA 02215, USA.
| | | | | | | |
Collapse
|
72
|
|
73
|
|
74
|
Kaffe-Abramovich T, Unger R. A simple model for evolution of proteins towards the global minimum of free energy. FOLDING & DESIGN 1998; 3:389-99. [PMID: 9806940 DOI: 10.1016/s1359-0278(98)00052-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
BACKGROUND Proteins seem to have their native structure in a global minimum of free energy. No mechanism is known, however, for ensuring this property. Furthermore, computational complexity studies suggest that such a mechanism is not feasible. These seemingly contradictory observations can be reconciled by the suggestion that evolutionary selection can yield proteins whose native conformation is in the global minimum of free energy. The aim of this study is to investigate such evolutionary processes in a simple model of protein folding. RESULTS Three possible evolutionary processes are explored.First, if the free energy of the chain is kept below a fixed threshold there is no improvement towards the global minimum. Second, if free energy is minimized directly, sequences emerge whose native conformation is in the global minimum of free energy. Third, when evolutionary pressure is applied within a small set of close homologs, sequences emerge whose functional conformation is in the global minimum of free energy. CONCLUSIONS Although minimizing free energy does select for sequences whose functional conformation is in the global free energy minimum, we argue that for most proteins, which typically have free energy values of only 5-15 kcal/mol, such evolutionary pressure cannot be considered biologically plausible. In contrast, by repeatedly forcing sequences to avoid drifting towards competing "non-native" conformations, sequences emerge whose native conformation becomes very close to the global minimum of free energy. We argue that such a mechanism is both efficient and biologically plausible.
Collapse
|
75
|
Abstract
We investigate the folding of a 125-bead heteropolymer model for proteins subject to Monte Carlo dynamics on a simple cubic lattice. Detailed study of a few sequences revealed a folding mechanism consisting of a rapid collapse followed by a slow search for a stable core that served as the transition state for folding to a near-native intermediate. Rearrangement from the intermediate to the native state slowed folding further because it required breaking native-like local structure between surface monomers so that those residues could condense onto the core. We demonstrate here the generality of this mechanism by a statistical analysis of a 200 sequence database using a method that employs a genetic algorithm to pick the sequence attributes that are most important for folding and an artificial neural network to derive the corresponding functional dependence of folding ability on the chosen sequence attributes [quantitative structure-property relationships (QSPRs)]. QSPRs that use three sequence attributes yielded substantially more accurate predictions than those that use only one. The results suggest that efficient search for the core is dependent on both the native state's overall stability and its amount of kinetically accessible, cooperative structure, whereas rearrangement from the intermediate is facilitated by destabilization of contacts between surface monomers. Implications for folding and design are discussed.
Collapse
Affiliation(s)
- A R Dinner
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| | | | | |
Collapse
|
76
|
Dokholyan NV, Buldyrev SV, Stanley HE, Shakhnovich EI. Discrete molecular dynamics studies of the folding of a protein-like model. ACTA ACUST UNITED AC 1998. [DOI: 10.1016/s1359-0278%2898%2900072-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
77
|
Klimov DK, Thirumalai D. Linking rates of folding in lattice models of proteins with underlying thermodynamic characteristics. J Chem Phys 1998. [DOI: 10.1063/1.477012] [Citation(s) in RCA: 73] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
78
|
|
79
|
Affiliation(s)
- Jae-Min Shin
- Molecular Design Laboratory, Hanhyo Institutes of Technology, 461-6 Chunmin-Dong, Yoosung-Ku, Taejun 305-390, Korea
| | - Won Seok Oh
- Molecular Design Laboratory, Hanhyo Institutes of Technology, 461-6 Chunmin-Dong, Yoosung-Ku, Taejun 305-390, Korea
| |
Collapse
|
80
|
Shakhnovich EI. Protein design: a perspective from simple tractable models. FOLDING & DESIGN 1998; 3:R45-58. [PMID: 9562552 DOI: 10.1016/s1359-0278(98)00021-2] [Citation(s) in RCA: 141] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Recent progress in computational approaches to protein design builds on advances in statistical mechanical protein folding theory. Here, the number of sequences folding into a given conformation is evaluated and a simple condition for a protein model's designability is outlined.
Collapse
Affiliation(s)
- EI Shakhnovich
- Harvard University Department of Chemistry and Chemical Biology 12 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
81
|
|
82
|
Abstract
The validity of the thermodynamic hypothesis of protein folding was explored by simulating the evolution of protein sequences. Simple models of lattice proteins were allowed to evolve by random point mutations subject to the constraint that they fold into a predetermined native structure with a Monte Carlo folding algorithm. We employed a simple analytical approach to compute the probability of violation of the thermodynamic hypothesis as a function of the size of the protein, the fraction of the total number of possible conformations which are kinetically accessible, and the roughness of the free-energy landscape. It was found that even if the folding is under kinetic control, the sequence will evolve so that the native state is most often the state of minimum free energy.
Collapse
Affiliation(s)
- S Govindarajan
- Department of Chemistry, University of Michigan, Ann Arbor, MI 48109-1055, USA
| | | |
Collapse
|
83
|
Mirny LA, Abkevich VI, Shakhnovich EI. How evolution makes proteins fold quickly. Proc Natl Acad Sci U S A 1998; 95:4976-81. [PMID: 9560213 PMCID: PMC20198 DOI: 10.1073/pnas.95.9.4976] [Citation(s) in RCA: 150] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Sequences of fast-folding model proteins (48 residues long on a cubic lattice) were generated by an evolution-like selection toward fast folding. We find that fast-folding proteins exhibit a specific folding mechanism in which all transition state conformations share a smaller subset of common contacts (folding nucleus). Acceleration of folding was accompanied by dramatic strengthening of interactions in the folding nucleus whereas average energy of nonnucleus interactions remained largely unchanged. Furthermore, the residues involved in the nucleus are the most conserved ones within families of evolved sequences. Our results imply that for each protein structure there is a small number of conserved positions that are key determinants of fast folding into that structure. This conjecture was tested on two protein superfamilies: the first having the classical monophosphate binding fold (CMBF; 98 families) and the second having type-III repeat fold (47 families). For each superfamily, we discovered a few positions that exhibit very strong and statistically significant "conservatism of conservatism"-amino acids in those positions are conserved within every family whereas the actual types of amino acids varied from family to family. Those amino acids are in spatial contact with each other. The experimental data of Serrano and coworkers [Lopez-Hernandez, E. & Serrano, L. (1996) Fold. Des. (London) 1, 43-55]. for one of the proteins of the CMBF superfamily (CheY) show that residues identified this way indeed belong to the folding nucleus. Further analysis revealed deep connections between nucleation in CMBF proteins and their function.
Collapse
Affiliation(s)
- L A Mirny
- Harvard University, Department of Chemistry and Chemical Biology, 12 Oxford Street, Cambridge MA 02138, USA
| | | | | |
Collapse
|
84
|
Gutin A, Sali A, Abkevich V, Karplus M, Shakhnovich EI. Temperature dependence of the folding rate in a simple protein model: Search for a “glass” transition. J Chem Phys 1998. [DOI: 10.1063/1.476053] [Citation(s) in RCA: 69] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
85
|
Irbäck A, Sandelin E. Local interactions and protein folding: A model study on the square and triangular lattices. J Chem Phys 1998. [DOI: 10.1063/1.475605] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
86
|
Tiana G, Broglia RA, Roman HE, Vigezzi E, Shakhnovich E. Folding and misfolding of designed proteinlike chains with mutations. J Chem Phys 1998. [DOI: 10.1063/1.475435] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
87
|
Du R, Pande VS, Grosberg AY, Tanaka T, Shakhnovich ES. On the transition coordinate for protein folding. J Chem Phys 1998. [DOI: 10.1063/1.475393] [Citation(s) in RCA: 434] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
88
|
Abstract
The energy landscape theory of protein folding is a statistical description of a protein's potential surface. It assumes that folding occurs through organizing an ensemble of structures rather than through only a few uniquely defined structural intermediates. It suggests that the most realistic model of a protein is a minimally frustrated heteropolymer with a rugged funnel-like landscape biased toward the native structure. This statistical description has been developed using tools from the statistical mechanics of disordered systems, polymers, and phase transitions of finite systems. We review here its analytical background and contrast the phenomena in homopolymers, random heteropolymers, and protein-like heteropolymers that are kinetically and thermodynamically capable of folding. The connection between these statistical concepts and the results of minimalist models used in computer simulations is discussed. The review concludes with a brief discussion of how the theory helps in the interpretation of results from fast folding experiments and in the practical task of protein structure prediction.
Collapse
Affiliation(s)
- J N Onuchic
- Department of Physics, University of California at San Diego, La Jolla 92093-0319, USA
| | | | | |
Collapse
|
89
|
Abstract
We model the evolution of simple lattice proteins as a random walk in a fitness landscape, where the fitness represents the ability of the protein to fold. At higher selective pressure, the evolutionary trajectories are confined to neutral networks where the native structure is conserved and the dynamics are non self-averaging and nonexponential. The optimizability of the corresponding native structure has a strong effect on the size of these neutral networks and thus on the nature of the evolutionary process.
Collapse
Affiliation(s)
- S Govindarajan
- Department of Chemistry, University of Michigan, Ann Arbor 48109-1055, USA
| | | |
Collapse
|
90
|
Abstract
The thermodynamic properties of a 46-mer beta-barrel protein model are investigated using Langevin dynamics and the histogram analysis method. By obtaining the density of states distribution and using the methods of statistical mechanics, we are able to identify the thermodynamic transitions for this model protein and characterize the nature of these transitions. Consistent with an earlier study of this model, we find that the transition from a random coil state to a manifold of collapsed but nonnative states is a continuous transition, and the transition from the manifold of collapsed states to the native state is first order-like. However, our calculations indicate that the folding transition is only weakly first order. Most importantly, we are able to characterize the free energy surface of the protein model, as well as the processes of compaction and native structure formation, from a statistical point of view. We also examined the thermodynamic transition state. By combining the earlier kinetic analysis for the same protein model, we provide a more complete description of this model protein and propose possible further modifications of the model to improve its stability and foldability.
Collapse
Affiliation(s)
- Z Guo
- Department of Molecular Biology, Scripps Research Institute, La Jolla, CA 92037, USA
| | | |
Collapse
|
91
|
Abstract
Molecular evolution may be considered as a walk in a multidimensional fitness landscape, where the fitness at each point is associated with features such as the function, stability, and survivability of these molecules. We present a simple model for the evolution of protein sequences on a landscape with a precisely defined fitness function. We use simple lattice models to represent protein structures, with the ability of a protein sequence to fold into the structure with lowest energy, quantified as the foldability, representing the fitness of the sequence. The foldability of the sequence is characterized based on the spin glass model of protein folding. We consider evolution as a walk in this foldability landscape and study the nature of the landscape and the resulting dynamics. Selective pressure is explicitly included in this model in the form of a minimum foldability requirement. We find that different native structures are not evenly distributed in interaction space, with similar structures and structures with similar optimal foldabilities clustered together. Evolving proteins marginally fulfill the selective criteria of foldability. As the selective pressure is increased, evolutionary trajectories become increasingly confined to "neutral networks," where the sequence and the interactions can be significantly changed while a constant structure is maintained.
Collapse
Affiliation(s)
- S Govindarajan
- Department of Chemistry, University of Michigan, Ann Arbor 48109-1055, USA
| | | |
Collapse
|
92
|
Scalley ML, Baker D. Protein folding kinetics exhibit an Arrhenius temperature dependence when corrected for the temperature dependence of protein stability. Proc Natl Acad Sci U S A 1997; 94:10636-40. [PMID: 9380687 PMCID: PMC23430 DOI: 10.1073/pnas.94.20.10636] [Citation(s) in RCA: 138] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/1997] [Accepted: 08/01/1997] [Indexed: 02/05/2023] Open
Abstract
The anomalous temperature dependence of protein folding has received considerable attention. Here we show that the temperature dependence of the folding of protein L becomes extremely simple when the effects of temperature on protein stability are corrected for; the logarithm of the folding rate is a linear function of 1/T on constant stability contours in the temperature-denaturant plane. This convincingly demonstrates that the anomalous temperature dependence of folding derives from the temperature dependence of the interactions that stabilize proteins, rather than from the super Arrhenius temperature dependence predicted for the configurational diffusion constant on a rough energy landscape. However, because of the limited temperature range accessible to experiment, the results do not rule out models with higher order temperature dependences. The significance of the slope of the stability-corrected Arrhenius plots is discussed.
Collapse
Affiliation(s)
- M L Scalley
- Department of Biochemistry 357350, University of Washington, Seattle, WA 98195, USA
| | | |
Collapse
|
93
|
Sosnick TR, Shtilerman MD, Mayne L, Englander SW. Ultrafast signals in protein folding and the polypeptide contracted state. Proc Natl Acad Sci U S A 1997; 94:8545-50. [PMID: 9238013 PMCID: PMC23003 DOI: 10.1073/pnas.94.16.8545] [Citation(s) in RCA: 123] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/04/1997] [Indexed: 02/04/2023] Open
Abstract
To test the significance of ultrafast protein folding signals (<<1 msec), we studied cytochrome c (Cyt c) and two Cyt c fragments with major C-terminal segments deleted. The fragments remain unfolded under all conditions and so could be used to define the unfolded baselines for protein fluorescence and circular dichroism (CD) as a function of denaturant concentration. When diluted from high to low denaturant in kinetic folding experiments, the fragments readjust to their new baseline values in a "burst phase" within the mixing dead time. The fragment burst phase reflects a contraction of the polypeptide from a more extended unfolded condition at high denaturant to a more contracted unfolded condition in the poorer, low denaturant solvent. Holo Cyt c exhibits fluorescence and CD burst phase signals that are essentially identical to the fragment signals over the whole range of final denaturant concentrations, evidently reflecting the same solvent-dependent, relatively nonspecific contraction and not the formation of a specific folding intermediate. The significance of fast folding signals in Cyt c and other proteins is discussed in relation to the hypothesis of an initial rate-limiting search-nucleation-collapse step in protein folding [Sosnick, T. R., Mayne, L. & Englander, S. W. (1996) Proteins Struct. Funct. Genet. 24, 413-426].
Collapse
Affiliation(s)
- T R Sosnick
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, PA 19104-6059, USA
| | | | | | | |
Collapse
|
94
|
Shah NK, Rejto PA, Verkhivker GM. Structural consensus in ligand-protein docking identifies recognition peptide motifs that bind streptavidin. Proteins 1997. [DOI: 10.1002/(sici)1097-0134(199707)28:3<421::aid-prot11>3.0.co;2-j] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
95
|
Abstract
Recently, protein-folding models have advanced to the point where folding simulations of protein-like chains of reasonable length (up to 125 amino acids) are feasible, and the major physical features of folding proteins, such as cooperativity in thermodynamics and nucleation mechanisms in kinetics, can be reproduced. This has allowed deep insight into the physical mechanism of folding, including the solution of the so-called 'Levinthal paradox'.
Collapse
Affiliation(s)
- E I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
96
|
|
97
|
Mirny LA, Shakhnovich EI. How to derive a protein folding potential? A new approach to an old problem. J Mol Biol 1996; 264:1164-79. [PMID: 9000638 DOI: 10.1006/jmbi.1996.0704] [Citation(s) in RCA: 230] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
In this paper we introduce a novel method of deriving a pairwise potential for protein folding. The potential is obtained by an optimization procedure that simultaneously maximizes thermodynamic stability for all proteins in the database. When applied to the representative dataset of proteins and with the energy function taken in pairwise contact approximation, our potential scored somewhat better than existing ones. However, the discrimination of the native structure from decoys is still not strong enough to make the potential useful for ab initio folding. Our results suggest that the problem lies with pairwise amino acid contact approximation and/or simplified presentation of proteins rather than with the derivation of potential. We argue that more detail of protein structure and energetics should be taken into account to achieve energy gaps. The suggested method is general enough to allow us to systematically derive parameters for more sophisticated energy functions. The internal control of validity for the potential derived by our method is convergence to a unique solution upon addition of new proteins to the database. The method is tested on simple model systems where sequences are designed, using the preset "true" potential, to have low energy in a dataset of structures. Our procedure is able to recover the potential with correlation r approximately 91% with the true one and we were able to fold all model structures using the recovered potential. Other statistical knowledge-based approaches were tested using this model and the results indicate that they also can recover the true potential with high degree of accuracy.
Collapse
Affiliation(s)
- L A Mirny
- Harvard University, Department of Chemistry, Cambridge, MA 02138, USA
| | | |
Collapse
|
98
|
Rejto PA, Verkhivker GM. Unraveling principles of lead discovery: from unfrustrated energy landscapes to novel molecular anchors. Proc Natl Acad Sci U S A 1996; 93:8945-50. [PMID: 8799133 PMCID: PMC38574 DOI: 10.1073/pnas.93.17.8945] [Citation(s) in RCA: 72] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
The search for novel leads is a critical step in the drug discovery process. Computational approaches to identify new lead molecules have focused on discovering complete ligands by evaluating the binding affinity of a large number of candidates, a task of considerable complexity. A new computational method is introduced in this work based on the premise that the primary molecular recognition event in the protein binding site may be accomplished by small core fragments that serve as molecular anchors, providing a structurally stable platform that can be subsequently tailored into complete ligands. To fulfill its role, we show that an effective molecular anchor must meet both the thermodynamic requirement of relative energetic stability of a single binding mode and its consistent kinetic accessibility, which may be measured by the structural consensus of multiple docking simulations. From a large number of candidates, this technique is able to identify known core fragments responsible for primary recognition by the FK506 binding protein (FKBP-12), along with a diverse repertoire of novel molecular cores. By contrast, absolute energetic criteria for selecting molecular anchors are found to be promiscuous. A relationship between a minimum frustration principle of binding energy landscapes and receptor-specific molecular anchors in their role as "recognition nuclei" is established, thereby unraveling a mechanism of lead discovery and providing a practical route to receptor-biased computational combinatorial chemistry.
Collapse
Affiliation(s)
- P A Rejto
- Agouron Pharmaceuticals Inc., San Diego, CA 92121, USA
| | | |
Collapse
|
99
|
Abstract
Rapid photochemical electron injection into unfolded ferricytochrome c titrated with 2.3 to 4.6 M guanidine hydrochloride (GuHCL) at pH 7 and 40 degrees C produced unfolded ferrocytochrome, which then converted to the folded protein. Two folding phases were observed: a fast process with a time constant of 40 microseconds (4.6 M GuHCL), and a slower phase with a rate constant of 90 +/- 20 per second (2.3 M GuHCL). The activation free energy for the slow step varied linearly with GuHCL concentration; the rate constant, extrapolated to aqueous solution, was 7600 per second. Electron-transfer methods can bridge the nanosecond to millisecond measurement time gap for protein folding.
Collapse
Affiliation(s)
- T Pascher
- Beckman Institute, California Institute of Technology, Pasadena, 91125, USA
| | | | | | | |
Collapse
|
100
|
Abstract
Proteins fold to unique compact native structures. Perhaps other polymers could be designed to fold in similar ways. The chemical nature of the monomer "alphabet" determines the "energy matrix" of monomer interactions-which defines the folding code, the relationship between sequence and structure. We study two properties of energy matrices using two-dimensional lattice models: uniqueness, the number of sequences that fold to only one structure, and encodability, the number of folds that are unique lowest-energy structures of certain monomer sequences. For the simplest model folding code, involving binary sequences of H (hydrophobic) and P (polar) monomers, only a small fraction of sequences fold uniquely, and not all structures can be encoded. Adding strong repulsive interactions results in a folding code with more sequences folding uniquely and more designable folds. Some theories suggest that the quality of a folding code depends only on the number of letters in the monomer alphabet, but we find that the energy matrix itself can be at least as important as the size of the alphabet. Certain multi-letter codes, including some with 20 letters, may be less physical or protein-like than codes with smaller numbers of letters because they neglect correlations among inter-residue interactions, treat only maximally compact conformations, or add arbitrary energies to the energy matrix.
Collapse
Affiliation(s)
- H S Chan
- Department of Pharmaceutical Chemistry, University of California, San Francisco 94143-1204, USA
| | | |
Collapse
|