51
|
Abstract
Protein folding should be complex. Proteins organize themselves into specific three-dimensional structures, through a myriad of conformational changes. The classical view of protein folding describes this process as a nearly sequential series of discrete intermediates. In contrast, the energy landscape theory of folding considers folding as the progressive organization of an ensemble of partially folded structures through which the protein passes on its way to the natively folded structure. As a result of evolution, proteins have a rugged funnel-like landscape biased toward the native structure. Connecting theory and simulations of minimalist models with experiments has completely revolutionized our understanding of the underlying mechanisms that control protein folding.
Collapse
Affiliation(s)
- José Nelson Onuchic
- Center for Theoretical Biological Physics, Department of Physics, University of California at San Diego, La Jolla 92093, USA.
| | | |
Collapse
|
52
|
Abstract
The fastest simple, kinetically two-state protein folds a million times more rapidly than the slowest. Here we review many recent theories of protein folding kinetics in terms of their ability to qualitatively rationalize, if not quantitatively predict, this fundamental experimental observation.
Collapse
Affiliation(s)
- Blake Gillespie
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, Santa Barbara, California 93106, USA.
| | | |
Collapse
|
53
|
Yang S, Cho SS, Levy Y, Cheung MS, Levine H, Wolynes PG, Onuchic JN. Domain swapping is a consequence of minimal frustration. Proc Natl Acad Sci U S A 2004; 101:13786-91. [PMID: 15361578 PMCID: PMC518834 DOI: 10.1073/pnas.0403724101] [Citation(s) in RCA: 153] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The same energy landscape principles associated with the folding of proteins into their monomeric conformations should also describe how these proteins oligomerize into domain-swapped conformations. We tested this hypothesis by using a simplified model for the epidermal growth factor receptor pathway substrate 8 src homology 3 domain protein, both of whose monomeric and domain-swapped structures have been solved. The model, which we call the symmetrized Gō-type model, incorporates only information regarding the monomeric conformation in an energy function for the dimer to predict the domain-swapped conformation. A striking preference for the correct domain-swapped structure was observed, indicating that overall monomer topology is a main determinant of the structure of domain-swapped dimers. Furthermore, we explore the free energy surface for domain swapping by using our model to characterize the mechanism of oligomerization.
Collapse
Affiliation(s)
- Sichun Yang
- Center for Theoretical Biological Physics and Department of Physics, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | | | | | | | | | | | | |
Collapse
|
54
|
Abstract
Computational protein design continues to experience a variety of methodological advances. Several improvements have been suggested for the objective functions used to quantify sequence/structure compatibility. Disparate design strategies based upon dead-end elimination, simulated annealing and statistical design have each recently yielded striking successes involving de novo designed proteins with sizes on the order of 100 residues or greater. Such methods may be used to design new proteins, as well as to redesign natural proteins to facilitate structural and biophysical studies.
Collapse
Affiliation(s)
- Sheldon Park
- Makineni Theoretical Laboratories and Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, Pennsylvania 19104, USA
| | | | | |
Collapse
|
55
|
Klepeis JL, Floudas CA, Morikis D, Tsokos CG, Lambris JD. Design of Peptide Analogues with Improved Activity Using a Novel de Novo Protein Design Approach. Ind Eng Chem Res 2004. [DOI: 10.1021/ie0340995] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- J. L. Klepeis
- Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544-5263, Department of Chemical and Environmental Engineering, University of California at Riverside, Riverside, California 92093-0359, and Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104
| | - C. A. Floudas
- Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544-5263, Department of Chemical and Environmental Engineering, University of California at Riverside, Riverside, California 92093-0359, and Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104
| | - D. Morikis
- Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544-5263, Department of Chemical and Environmental Engineering, University of California at Riverside, Riverside, California 92093-0359, and Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104
| | - C. G. Tsokos
- Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544-5263, Department of Chemical and Environmental Engineering, University of California at Riverside, Riverside, California 92093-0359, and Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104
| | - J. D. Lambris
- Department of Chemical Engineering, Princeton University, Princeton, New Jersey 08544-5263, Department of Chemical and Environmental Engineering, University of California at Riverside, Riverside, California 92093-0359, and Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104
| |
Collapse
|
56
|
Fujitsuka Y, Takada S, Luthey-Schulten ZA, Wolynes PG. Optimizing physical energy functions for protein folding. Proteins 2004; 54:88-103. [PMID: 14705026 DOI: 10.1002/prot.10429] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
We optimize a physical energy function for proteins with the use of the available structural database and perform three benchmark tests of the performance: (1) recognition of native structures in the background of predefined decoy sets of Levitt, (2) de novo structure prediction using fragment assembly sampling, and (3) molecular dynamics simulations. The energy parameter optimization is based on the energy landscape theory and uses a Monte Carlo search to find a set of parameters that seeks the largest ratio deltaE(s)/DeltaE for all proteins in a training set simultaneously. Here, deltaE(s) is the stability gap between the native and the average in the denatured states and DeltaE is the energy fluctuation among these states. Some of the energy parameters optimized are found to show significant correlation with experimentally observed quantities: (1) In the recognition test, the optimized function assigns the lowest energy to either the native or a near-native structure among many decoy structures for all the proteins studied. (2) Structure prediction with the fragment assembly sampling gives structure models with root mean square deviation less than 6 A in one of the top five cluster centers for five of six proteins studied. (3) Structure prediction using molecular dynamics simulation gives poorer performance, implying the importance of having a more precise description of local structures. The physical energy function solely inferred from a structural database neither utilizes sequence information from the family of the target nor the outcome of the secondary structure prediction but can produce the correct native fold for many small proteins.
Collapse
Affiliation(s)
- Yoshimi Fujitsuka
- Graduate School of Science and Technology, Kobe University, Nada, Kobe, Japan
| | | | | | | |
Collapse
|
57
|
Khatun J, Khare SD, Dokholyan NV. Can Contact Potentials Reliably Predict Stability of Proteins? J Mol Biol 2004; 336:1223-38. [PMID: 15037081 DOI: 10.1016/j.jmb.2004.01.002] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2003] [Revised: 01/08/2004] [Accepted: 01/08/2004] [Indexed: 11/17/2022]
Abstract
The simplest approximation of interaction potential between amino acid residues in proteins is the contact potential, which defines the effective free energy of a protein conformation by a set of amino acid contacts formed in this conformation. Finding a contact potential capable of predicting free energies of protein states across a variety of protein families will aid protein folding and engineering in silico on a computationally tractable time-scale. We test the ability of contact potentials to accurately and transferably (across various protein families) predict stability changes of proteins upon mutations. We develop a new methodology to determine the contact potentials in proteins from experimental measurements of changes in protein's thermodynamic stabilities (DeltaDeltaG) upon mutations. We apply our methodology to derive sets of contact interaction parameters for a hierarchy of interaction models including solvation and multi-body contact parameters. We test how well our models reproduce experimental measurements by statistical tests. We evaluate the maximum accuracy of predictions obtained by using contact potentials and the correlation between parameters derived from different data-sets of experimental (DeltaDeltaG) values. We argue that it is impossible to reach experimental accuracy and derive fully transferable contact parameters using the contact models of potentials. However, contact parameters may yield reliable predictions of DeltaDeltaG for datasets of mutations confined to the same amino acid positions in the sequence of a single protein.
Collapse
Affiliation(s)
- Jainab Khatun
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | | | | |
Collapse
|
58
|
Dirks RM, Lin M, Winfree E, Pierce NA. Paradigms for computational nucleic acid design. Nucleic Acids Res 2004; 32:1392-403. [PMID: 14990744 PMCID: PMC390280 DOI: 10.1093/nar/gkh291] [Citation(s) in RCA: 128] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The design of DNA and RNA sequences is critical for many endeavors, from DNA nanotechnology, to PCR-based applications, to DNA hybridization arrays. Results in the literature rely on a wide variety of design criteria adapted to the particular requirements of each application. Using an extensively studied thermodynamic model, we perform a detailed study of several criteria for designing sequences intended to adopt a target secondary structure. We conclude that superior design methods should explicitly implement both a positive design paradigm (optimize affinity for the target structure) and a negative design paradigm (optimize specificity for the target structure). The commonly used approaches of sequence symmetry minimization and minimum free-energy satisfaction primarily implement negative design and can be strengthened by introducing a positive design component. Surprisingly, our findings hold for a wide range of secondary structures and are robust to modest perturbation of the thermodynamic parameters used for evaluating sequence quality, suggesting the feasibility and ongoing utility of a unified approach to nucleic acid design as parameter sets are refined further. Finally, we observe that designing for thermodynamic stability does not determine folding kinetics, emphasizing the opportunity for extending design criteria to target kinetic features of the energy landscape.
Collapse
Affiliation(s)
- Robert M Dirks
- Chemistry Department, California Institute of Technology, Pasadena, CA 91125, USA
| | | | | | | |
Collapse
|
59
|
Kuhlman B, Baker D. Exploring folding free energy landscapes using computational protein design. Curr Opin Struct Biol 2004; 14:89-95. [PMID: 15102454 DOI: 10.1016/j.sbi.2004.01.002] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Recent advances in computational protein design have allowed exciting new insights into the sequence dependence of protein folding free energy landscapes. Whereas most previous studies have examined the sequence dependence of protein stability and folding kinetics by characterizing naturally occurring proteins and variants of these proteins that contain a small number of mutations, it is now possible to generate and characterize computationally designed proteins that differ significantly from naturally occurring proteins in sequence and/or structure. These computer-generated proteins provide insights into the determinants of protein structure, stability and folding, and make it possible to disentangle the properties of proteins that are the consequence of natural selection from those that reflect the fundamental physical chemistry of polypeptide chains.
Collapse
Affiliation(s)
- Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA.
| | | |
Collapse
|
60
|
Kuhlman B, Dantas G, Ireton GC, Varani G, Stoddard BL, Baker D. Design of a novel globular protein fold with atomic-level accuracy. Science 2003; 302:1364-8. [PMID: 14631033 DOI: 10.1126/science.1089427] [Citation(s) in RCA: 1139] [Impact Index Per Article: 54.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
A major challenge of computational protein design is the creation of novel proteins with arbitrarily chosen three-dimensional structures. Here, we used a general computational strategy that iterates between sequence design and structure prediction to design a 93-residue alpha/beta protein called Top7 with a novel sequence and topology. Top7 was found experimentally to be folded and extremely stable, and the x-ray crystal structure of Top7 is similar (root mean square deviation equals 1.2 angstroms) to the design model. The ability to design a new protein fold makes possible the exploration of the large regions of the protein universe not yet observed in nature.
Collapse
Affiliation(s)
- Brian Kuhlman
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | | | | | | | | | | |
Collapse
|
61
|
|