1
|
Nixon C, Lim SA, Sternke M, Barrick D, Harms MJ, Marqusee S. The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins. Protein Sci 2024; 33:e5011. [PMID: 38747388 PMCID: PMC11094778 DOI: 10.1002/pro.5011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 04/02/2024] [Accepted: 04/23/2024] [Indexed: 05/19/2024]
Abstract
A protein sequence encodes its energy landscape-all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs-questioning the differences between them and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the consensus protein derived from our full Ribonuclease H sequence alignment is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted set of sequences is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order sequence correlations using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.
Collapse
Affiliation(s)
- Charlotte Nixon
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
| | - Shion A. Lim
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
| | - Matt Sternke
- The T.C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Doug Barrick
- The T.C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Michael J. Harms
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Susan Marqusee
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
- Department of ChemistryUniversity of California, BerkeleyBerkeleyCaliforniaUSA
- California Institute for Quantitative Biosciences (QB3)BerkeleyCaliforniaUSA
| |
Collapse
|
2
|
Lim SA, Bolin ER, Marqusee S. Tracing a protein's folding pathway over evolutionary time using ancestral sequence reconstruction and hydrogen exchange. eLife 2018; 7:38369. [PMID: 30204082 PMCID: PMC6158009 DOI: 10.7554/elife.38369] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Accepted: 09/09/2018] [Indexed: 12/15/2022] Open
Abstract
The conformations populated during protein folding have been studied for decades; yet, their evolutionary importance remains largely unexplored. Ancestral sequence reconstruction allows access to proteins across evolutionary time, and new methods such as pulsed-labeling hydrogen exchange coupled with mass spectrometry allow determination of folding intermediate structures at near amino-acid resolution. Here, we combine these techniques to monitor the folding of the ribonuclease H family along the evolutionary lineages of T. thermophilus and E. coli RNase H. All homologs and ancestral proteins studied populate a similar folding intermediate despite being separated by billions of years of evolution. Even though this conformation is conserved, the pathway leading to it has diverged over evolutionary time, and rational mutations can alter this trajectory. Our results demonstrate that evolutionary processes can affect the energy landscape to preserve or alter specific features of a protein’s folding pathway.
Collapse
Affiliation(s)
- Shion An Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States.,Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States
| | - Eric Richard Bolin
- Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States.,Biophysics Graduate Program, University of California, Berkeley, Berkeley, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, United States.,Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States.,Department of Chemistry, University of California, Berkeley, Berkeley, United States.,Chan Zuckerberg Biohub, San Francisco, United States
| |
Collapse
|
3
|
Lim SA, Marqusee S. The burst-phase folding intermediate of ribonuclease H changes conformation over evolutionary history. Biopolymers 2018; 109:e23086. [PMID: 29152711 PMCID: PMC6047922 DOI: 10.1002/bip.23086] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 10/23/2017] [Accepted: 10/30/2017] [Indexed: 11/06/2022]
Abstract
The amino acid sequence encodes the energy landscape of a protein. Therefore, we expect evolutionary mutations to change features of the protein energy landscape, including the conformations adopted by a polypeptide as it folds to its native state. Ribonucleases H (RNase H) from Escherichia coli and Thermus thermophilus both fold via a partially folded intermediate in which the core region of the protein (helices A-D and strands 4-5) is structured. Strand 1, however, uniquely contributes to the T. thermophilus RNase H folding intermediate (Icore+1 ), but not the E. coli RNase H intermediate (Icore ) (Rosen & Marqusee, PLoS One 2015). We explore the origin of this difference by characterizing the folding intermediate of seven ancestral RNases H spanning the evolutionary history of these two homologs. Using fragment models with or without strand 1 and FRET probes to characterize the folding intermediate of each ancestor, we find a distinct evolutionary trend across the family-the involvement of strand 1 in the folding intermediate is an ancestral feature that is maintained in the thermophilic lineage and is gradually lost in the mesophilic lineage. Evolutionary sequence changes indeed modulate the conformations present on the folding landscape and altered the folding trajectory of RNase H.
Collapse
Affiliation(s)
- Shion An Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
4
|
Postic G, Ghouzam Y, Chebrek R, Gelly JC. An ambiguity principle for assigning protein structural domains. SCIENCE ADVANCES 2017; 3:e1600552. [PMID: 28097215 PMCID: PMC5235333 DOI: 10.1126/sciadv.1600552] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 11/28/2016] [Indexed: 05/20/2023]
Abstract
Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object-in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our "multipartitioning" approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules.
Collapse
Affiliation(s)
- Guillaume Postic
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Corresponding author. (G.P.); (J.-C.G.)
| | - Yassine Ghouzam
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
| | - Romain Chebrek
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
| | - Jean-Christophe Gelly
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Corresponding author. (G.P.); (J.-C.G.)
| |
Collapse
|
5
|
Abstract
Protein structure prediction and protein docking prediction are two related problems in molecular biology. We suggest the use of multiple docking in the process of protein structure prediction. Once reliable structural models are predicted to disjoint fragments of the protein target sequence, a combinatorial assembly may be used to predict their native arrangement. Here, we present CombDock, a combinatorial docking algorithm for the structural units assembly problem. We have tested the algorithm on various examples using both domains and domain substructures as input. Inaccurate models of the structural units were also used, to test the robustness of the algorithm. The algorithm was able to predict a near-native arrangement of the input structural units in almost all of the cases, showing that the combinatorial approach succeeds in overcoming the inexact shape complementarity caused by the inaccuracy of the models.
Collapse
Affiliation(s)
- Yuval Inbar
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel,
| | - Haim J. Wolfson
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Ruth Nussinov
- Sackler Institute of Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel and Basic Research Program, SAIC-Frederick Inc., Laboratory of Experimental and Computational Biology, NCI - FCRDC, Bldg 469, Rm 151, Frederick, MD 21702, USA
| |
Collapse
|
6
|
Gangadhara BN, Laine JM, Kathuria SV, Massi F, Matthews CR. Clusters of branched aliphatic side chains serve as cores of stability in the native state of the HisF TIM barrel protein. J Mol Biol 2013; 425:1065-81. [PMID: 23333740 DOI: 10.1016/j.jmb.2013.01.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2012] [Revised: 01/01/2013] [Accepted: 01/03/2013] [Indexed: 01/31/2023]
Abstract
Imidazole-3-glycerol phosphate synthase is a heterodimeric allosteric enzyme that catalyzes consecutive reactions in imidazole biosynthesis through its HisF and HisH subunits. The unusually slow unfolding reaction of the isolated HisF TIM barrel domain from the thermophilic bacteria, Thermotoga maritima, enabled an NMR-based site-specific analysis of the main-chain hydrogen bonds that stabilize its native conformation. Very strong protection against exchange with solvent deuterium in the native state was found in a subset of buried positions in α-helices and pervasively in the underlying β-strands associated with a pair of large clusters of isoleucine, leucine and valine (ILV) side chains located in the α7(βα)8(βα)1-2 and α2(βα)3-6β7 segments of the (βα)8 barrel. The most densely packed region of the large cluster, α3(βα)4-6β7, correlates closely with the core of stability previously observed in computational, protein engineering and NMR dynamics studies, demonstrating a key role for this cluster in determining the thermodynamic and structural properties of the native state of HisF. When considered with the results of previous studies where ILV clusters were found to stabilize the hydrogen-bonded networks in folding intermediates for other TIM barrel proteins, it appears that clusters of branched aliphatic side chains can serve as cores of stability across the entire folding reaction coordinate of one of the most common motifs in biology.
Collapse
Affiliation(s)
- Basavanapura N Gangadhara
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, 364 Plantation Street, Worcester, MA 01605, USA
| | | | | | | | | |
Collapse
|
7
|
Ratcliff K, Marqusee S. Identification of residual structure in the unfolded state of ribonuclease H1 from the moderately thermophilic Chlorobium tepidum: comparison with thermophilic and mesophilic homologues. Biochemistry 2010; 49:5167-75. [PMID: 20491485 DOI: 10.1021/bi1001097] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Ribonucleases H from organisms that grow at different temperatures demonstrate a variable change in heat capacity upon unfolding (DeltaC degrees (P)) [Ratcliff, K., et al. (2009) Biochemistry 48, 5890-5898]. This DeltaC degrees (P) has been shown to correlate with a tolerance to higher temperatures and residual structure in the unfolded state of the thermophilic proteins. In the RNase H from Thermus thermophilus, the low DeltaC degrees (P) has been shown to arise from the same region as the folding core of the protein, and mutagenic studies have shown that loss of a hydrophobic residue in this region can disrupt this residual unfolded state structure and result in a return to a more mesophile-like DeltaC degrees (P) [Robic, S., et al. (2002) Protein Sci. 11, 381-389; Robic, S., et al. (2003) Proc. Natl. Acad. Sci. U.S.A. 100, 11345-11349]. To understand further how residual structure in the unfolded state is encoded in the sequences of these thermophilic proteins, we subjected the RNase H from Chlorobium tepidum to similar studies. Analysis of new chimeric proteins reveals that like T. thermophilus RNase H, the folding core of C. tepidum RNase H plays an important role in the unfolded state of this protein. Mutagenesis studies, based on both a computational investigation of the hydrophobic networks in the core region and comparisons with similar studies on T. thermophilus RNase H, identify new residues involved in this residual structure and suggest that the residual structure in the unfolded state of C. tepidum RNase H is more restricted than that of T. thermophilus. We conclude that while the folding core region determines the thermophilic-like behavior of this family of proteins, the residue-specific details vary.
Collapse
Affiliation(s)
- Kathleen Ratcliff
- Biophysics Graduate Group, University of California, Berkeley, Berkeley, California 94720, USA
| | | |
Collapse
|
8
|
Zheng Z, Sosnick TR. Protein vivisection reveals elusive intermediates in folding. J Mol Biol 2010; 397:777-88. [PMID: 20144618 PMCID: PMC2838964 DOI: 10.1016/j.jmb.2010.01.056] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2009] [Revised: 01/19/2010] [Accepted: 01/25/2010] [Indexed: 10/19/2022]
Abstract
Although most folding intermediates escape detection, their characterization is crucial to the elucidation of folding mechanisms. Here, we outline a powerful strategy to populate partially unfolded intermediates: A buried aliphatic residue is substituted with a charged residue (e.g., Leu-->Glu(-)) to destabilize and unfold a specific region of the protein. We applied this strategy to ubiquitin, reversibly trapping a folding intermediate in which the beta5-strand is unfolded. The intermediate refolds to a native-like structure upon charge neutralization under mildly acidic conditions. Characterization of the trapped intermediate using NMR and hydrogen exchange methods identifies a second folding intermediate and reveals the order and free energies of the two major folding events on the native side of the rate-limiting step. This general strategy may be combined with other methods and have broad applications in the study of protein folding and other reactions that require trapping of high-energy states.
Collapse
Affiliation(s)
- Zhongzhou Zheng
- Department of Biochemistry and Molecular Biology, University of Chicago, 929 E. 57th St., Chicago, IL 60637
| | - Tobin R. Sosnick
- Department of Biochemistry and Molecular Biology, University of Chicago, 929 E. 57th St., Chicago, IL 60637
- Institute for Biophysical Dynamics, Computation Institute, University of Chicago, 929 E. 57th St., Chicago, IL 60637
| |
Collapse
|
9
|
Mohan PK, Hosur RV. pH dependent unfolding characteristics of DLC8 dimer: Residue level details from NMR. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2008; 1784:1795-803. [DOI: 10.1016/j.bbapap.2008.07.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2008] [Revised: 06/30/2008] [Accepted: 07/04/2008] [Indexed: 10/21/2022]
|
10
|
The high-resolution NMR structure of the early folding intermediate of the Thermus thermophilus ribonuclease H. J Mol Biol 2008; 384:531-9. [PMID: 18848567 DOI: 10.1016/j.jmb.2008.09.044] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2008] [Revised: 09/16/2008] [Accepted: 09/17/2008] [Indexed: 11/23/2022]
Abstract
Elucidation of the high-resolution structures of folding intermediates is a necessary but difficult step toward the ultimate understanding of the mechanism of protein folding. Here, using hydrogen-exchange-directed protein engineering, we populated the folding intermediate of the Thermus thermophilus ribonuclease H, which forms before the rate-limiting transition state, by removing the unfolded regions of the intermediate, including an alpha-helix and two beta-strands (51 folded residues). Using multidimensional NMR, we solved the structure of this intermediate mimic to an atomic resolution (backbone rmsd, 0.51 A). It has a native-like backbone topology and shows some local deviations from the native structure, revealing that the structure of the folded region of an early folding intermediate can be as well defined as the native structure. The topological parameters calculated from the structures of the intermediate mimic and the native state predict that the intermediate should fold on a millisecond time scale or less and form much faster than the native state. Other factors that may lead to the slow folding of the native state and the accumulation of the intermediate before the rate-limiting transition state are also discussed.
Collapse
|
11
|
Abstract
Two fundamentally different views of how proteins fold are now being debated. Do proteins fold through multiple unpredictable routes directed only by the energetically downhill nature of the folding landscape or do they fold through specific intermediates in a defined pathway that systematically puts predetermined pieces of the target native protein into place? It has now become possible to determine the structure of protein folding intermediates, evaluate their equilibrium and kinetic parameters, and establish their pathway relationships. Results obtained for many proteins have serendipitously revealed a new dimension of protein structure. Cooperative structural units of the native protein, called foldons, unfold and refold repeatedly even under native conditions. Much evidence obtained by hydrogen exchange and other methods now indicates that cooperative foldon units and not individual amino acids account for the unit steps in protein folding pathways. The formation of foldons and their ordered pathway assembly systematically puts native-like foldon building blocks into place, guided by a sequential stabilization mechanism in which prior native-like structure templates the formation of incoming foldons with complementary structure. Thus the same propensities and interactions that specify the final native state, encoded in the amino-acid sequence of every protein, determine the pathway for getting there. Experimental observations that have been interpreted differently, in terms of multiple independent pathways, appear to be due to chance misfolding errors that cause different population fractions to block at different pathway points, populate different pathway intermediates, and fold at different rates. This paper summarizes the experimental basis for these three determining principles and their consequences. Cooperative native-like foldon units and the sequential stabilization process together generate predetermined stepwise pathways. Optional misfolding errors are responsible for 3-state and heterogeneous kinetic folding.
Collapse
Affiliation(s)
- S Walter Englander
- The Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia 19104-6059, USA.
| | | | | |
Collapse
|
12
|
Krishna MMG, Maity H, Rumbley JN, Englander SW. Branching in the sequential folding pathway of cytochrome c. Protein Sci 2007; 16:1946-56. [PMID: 17660254 PMCID: PMC2206985 DOI: 10.1110/ps.072922307] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Previous results indicate that the folding pathways of cytochrome c and other proteins progressively build the target native protein in a predetermined stepwise manner by the sequential formation and association of native-like foldon units. The present work used native state hydrogen exchange methods to investigate a structural anomaly in cytochrome c results that suggested the concerted folding of two segments that have little structural relationship in the native protein. The results show that the two segments, an 18-residue omega loop and a 10-residue helix, are able to unfold and refold independently, which allows a branch point in the folding pathway. The pathway that emerges assembles native-like foldon units in a linear sequential manner when prior native-like structure can template a single subsequent foldon, and optional pathway branching is seen when prior structure is able to support the folding of two different foldons.
Collapse
Affiliation(s)
- Mallela M G Krishna
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104-6059, USA.
| | | | | | | |
Collapse
|
13
|
Krishna MMG, Englander SW. A unified mechanism for protein folding: predetermined pathways with optional errors. Protein Sci 2007; 16:449-64. [PMID: 17322530 PMCID: PMC2203325 DOI: 10.1110/ps.062655907] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2006] [Revised: 12/11/2006] [Accepted: 12/11/2006] [Indexed: 10/23/2022]
Abstract
There is a fundamental conflict between two different views of how proteins fold. Kinetic experiments and theoretical calculations are often interpreted in terms of different population fractions folding through different intermediates in independent unrelated pathways (IUP model). However, detailed structural information indicates that all of the protein population folds through a sequence of intermediates predetermined by the foldon substructure of the target protein and a sequential stabilization principle. These contrary views can be resolved by a predetermined pathway--optional error (PPOE) hypothesis. The hypothesis is that any pathway intermediate can incorporate a chance misfolding error that blocks folding and must be reversed for productive folding to continue. Different fractions of the protein population will then block at different steps, populate different intermediates, and fold at different rates, giving the appearance of multiple unrelated pathways. A test of the hypothesis matches the two models against extensive kinetic folding results for hen lysozyme which have been widely cited in support of independent parallel pathways. The PPOE model succeeds with fewer fitting constants. The fitted PPOE reaction scheme leads to known folding behavior, whereas the IUP properties are contradicted by experiment. The appearance of a conflict with multipath theoretical models seems to be due to their different focus, namely on multitrack microscopic behavior versus cooperative macroscopic behavior. The integration of three well-documented principles in the PPOE model (cooperative foldons, sequential stabilization, optional errors) provides a unifying explanation for how proteins fold and why they fold in that way.
Collapse
Affiliation(s)
- Mallela M G Krishna
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104-6059, USA.
| | | |
Collapse
|
14
|
Gebhard LG, Risso VA, Santos J, Ferreyra RG, Noguera ME, Ermácora MR. Mapping the Distribution of Conformational Information Throughout a Protein Sequence. J Mol Biol 2006; 358:280-8. [PMID: 16510154 DOI: 10.1016/j.jmb.2006.01.095] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2005] [Revised: 01/26/2006] [Accepted: 01/27/2006] [Indexed: 12/01/2022]
Abstract
The three-dimensional structure of protein is encoded in the sequence, but many amino acid residues carry no essential conformational information, and the identity of those that are structure-determining is elusive. By circular permutation and terminal deletion, we produced and purified 25 Bacillus licheniformis beta-lactamase (ESBL) variants that lack 5-21 contiguous residues each, and collectively have 82% of the sequence and 92% of the non-local atom-atom contacts eliminated. Circular dichroism and size-exclusion chromatography showed that most of the variants form conformationally heterogeneous mixtures, but by measuring catalytic constants, we found that all populate, to a greater or lesser extent, conformations with the essential features of the native fold. This suggests that no segment of the ESBL sequence is essential to the structure as a whole, which is congruent with the notion that local information and modular organization can impart most of the tertiary fold specificity and cooperativity.
Collapse
Affiliation(s)
- Leopoldo G Gebhard
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Sáenz Peña 180, (1876) Bernal, Buenos Aires, Argentina
| | | | | | | | | | | |
Collapse
|
15
|
Protein misfolding: optional barriers, misfolded intermediates, and pathway heterogeneity. J Mol Biol 2004; 343:1095-109. [PMID: 15476824 DOI: 10.1016/j.jmb.2004.08.098] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2004] [Revised: 08/20/2004] [Accepted: 08/26/2004] [Indexed: 10/26/2022]
Abstract
To investigate the character and role of misfolded intermediates in protein folding, a recombinant cytochrome c without the normally blocking histidine to heme misligation was studied. Folding remains heterogeneous as in the wild-type protein. Half of the population folds relatively rapidly to the native state in a two-state manner. The other half collapses (fluorescence quenching) and forms a full complement of helix (CD) with the same rate and denaturant dependence as the fast folding fraction but then is blocked and reaches the native structure (695nm absorbance) much more slowly. The factors that transiently block folding are not intrinsic to the folding process but depend on ambient conditions, including protein aggregation (f(concentration)), N terminus to heme misligation (f(pH)), and proline mis-isomerization (f(U state equilibration time)). The misfolded intermediate populated by the slowly folding fraction was characterized by hydrogen exchange pulse labeling. It is very advanced with all of the native-like elements fairly stably formed but not the final Met80-S to heme iron ligation, similar to a previously studied molten globule form induced by low pH. To complete final native state acquisition, some small back unfolding is required (error repair) but the misfolded intermediate does not revisit the U state before proceeding to N. These properties show that the intermediate is a normal on-pathway form that contains, in addition, adventitious misfolding errors that transiently block its forward progress. Related observations for other proteins (partially misfolded intermediates, pathway heterogeneity) might be similarly explained in terms of the optional insertion of error-dependent barriers into a classical folding pathway.
Collapse
|
16
|
Maity H, Maity M, Englander SW. How Cytochrome c Folds, and Why: Submolecular Foldon Units and their Stepwise Sequential Stabilization. J Mol Biol 2004; 343:223-33. [PMID: 15381432 DOI: 10.1016/j.jmb.2004.08.005] [Citation(s) in RCA: 102] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2004] [Revised: 07/15/2004] [Accepted: 08/03/2004] [Indexed: 10/26/2022]
Abstract
Native state hydrogen exchange experiments have shown that the cytochrome c (Cyt c) protein consists of five cooperative folding-unfolding units, called foldons. These are named, in the order of increasing unfolding free energy, the nested-Yellow, Red, Yellow, Green, and Blue foldons. Previous results suggest that these units unfold in a stepwise sequential way so that each higher energy partially unfolded form includes all of the previously unfolded lower free energy units. If this is so, then selectively destabilizing any given foldon should equally destabilize each subsequent unfolding step above it in the unfolding ladder but leave the lower ones before it unaffected. To perform this test, we introduced the mutation Glu62Gly, which deletes a salt link in the Yellow unit and destabilizes the protein by 0.8 kcal/mol. Native state hydrogen exchange and other experiments show that the stability of the Yellow unit and the states above it in the free energy ladder are destabilized by about the same amount while the lower lying states are unaffected. These results help to confirm the sequential stepwise nature of the Cyt c unfolding pathway and therefore a similar refolding pathway. The steps in the pathway are dictated by the concerted folding-unfolding property of the individual unit foldons; the order of steps is determined by the sequential stabilization of progressively added foldons in the native context. Much related information for Cyt c strongly conforms with this mechanism. Its generality is supported by available information for other proteins.
Collapse
Affiliation(s)
- Haripada Maity
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | | | | |
Collapse
|
17
|
Kim MY, Maier CS, Reed DJ, Deinzer ML. Conformational changes in chemically modified Escherichia coli thioredoxin monitored by H/D exchange and electrospray ionization mass spectrometry. Protein Sci 2002; 11:1320-9. [PMID: 12021431 PMCID: PMC2373629 DOI: 10.1110/ps.3140102] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
Abstract
Hydrogen/deuterium (H/D) exchange in combination with electrospray ionization mass spectrometry and near-ultraviolet (UV) circular dichroism (CD) was used to study the conformational properties and thermal unfolding of Escherichia coli thioredoxin and its Cys32-alkylated derivatives in 1% acetic acid (pH 2.7). Thermal unfolding of oxidized (Oxi) and reduced (Red) -thioredoxin (TRX) and Cys-32-ethylglutathionyl (GS-ethyl-TRX) and Cys-32-ethylcysteinyl (Cys-ethyl-TRX), which are derivatives of Red-TRX, follow apparent EX1 kinetics as charge-state envelopes, H/D mass spectral exchange profiles, and near-UV CD appear to support a two-state folding/unfolding model. Minor mass peaks in the H/D exchange profiles and nonsuperimposable MS- and CD-derived melting curves, however, suggest the participation of unfolding intermediates leading to the conclusion that the two-state model is an oversimplification of the process. The relative stabilities as measured by melting temperatures by both CD and mass spectral charge states are, Oxi-TRX, GS-ethyl-TRX, Cys-ethyl-TRX, and Red-TRX. The introduction of the Cys-32-ethylglutathionyl group provides extra stabilization that results from additional hydrogen bonding interactions between the ethylglutathionyl group and the protein. Near-UV CD data show that the local environment near the active site is perturbed to almost an identical degree regardless of whether alkylation at Cys-32 is by the ethylglutathionyl group, or the smaller, nonhydrogen-bonding ethylcysteinyl group. Mass spectral data, however, indicate a tighter structure for GS-ethyl-TRX.
Collapse
Affiliation(s)
- Moo-Young Kim
- Department of Chemistry, Oregon State University, Corvallis, Oregon 97331, USA
| | | | | | | |
Collapse
|
18
|
Robic S, Berger JM, Marqusee S. Contributions of folding cores to the thermostabilities of two ribonucleases H. Protein Sci 2002; 11:381-9. [PMID: 11790848 PMCID: PMC2373436 DOI: 10.1110/ps.38602] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
To investigate the contribution of the folding cores to the thermodynamic stability of RNases H, we used rational design to create two chimeras composed of parts of a thermophilic and a mesophilic RNase H. Each chimera combines the folding core from one parent protein and the remaining parts of the other. Both chimeras form active, well-folded RNases H. Stability curves, based on CD-monitored chemical denaturations, show that the chimera with the thermophilic core is more stable, has a higher midpoint of thermal denaturation, and a lower change in heat capacity (DeltaCp) upon unfolding than the chimera with the mesophilic core. A possible explanation for the low DeltaCp of both the parent thermophilic RNase H and the chimera with the thermophilic core is the residual structure in the denatured state. On the basis of the studied parameters, the chimera with the thermophilic core resembles a true thermophilic protein. Our results suggest that the folding core plays an essential role in conferring thermodynamic parameters to RNases H.
Collapse
Affiliation(s)
- Srebrenka Robic
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | | | | |
Collapse
|
19
|
Rumbley J, Hoang L, Mayne L, Englander SW. An amino acid code for protein folding. Proc Natl Acad Sci U S A 2001; 98:105-12. [PMID: 11136249 PMCID: PMC14552 DOI: 10.1073/pnas.98.1.105] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/01/2000] [Indexed: 11/18/2022] Open
Abstract
Direct structural information obtained for many proteins supports the following conclusions. The amino acid sequences of proteins can stabilize not only the final native state but also a small set of discrete partially folded native-like intermediates. Intermediates are formed in steps that use as units the cooperative secondary structural elements of the native protein. Earlier intermediates guide the addition of subsequent units in a process of sequential stabilization mediated by native-like tertiary interactions. The resulting stepwise self-assembly process automatically constructs a folding pathway, whether linear or branched. These conclusions are drawn mainly from hydrogen exchange-based methods, which can depict the structure of infinitesimally populated folding intermediates at equilibrium and kinetic intermediates with subsecond lifetimes. Other kinetic studies show that the polypeptide chain enters the folding pathway after an initial free-energy-uphill conformational search. The search culminates by finding a native-like topology that can support forward (native-like) folding in a free-energy-downhill manner. This condition automatically defines an initial transition state, the search for which sets the maximum possible (two-state) folding rate. It also extends the sequential stabilization strategy, which depends on a native-like context, to the first step in the folding process. Thus the native structure naturally generates its own folding pathway. The same amino acid code that translates into the final equilibrium native structure-by virtue of propensities, patterning, secondary structural cueing, and tertiary context-also produces its kinetic accessibility.
Collapse
Affiliation(s)
- J Rumbley
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104-6059, USA
| | | | | | | |
Collapse
|
20
|
Fischer KF, Marqusee S. A rapid test for identification of autonomous folding units in proteins. J Mol Biol 2000; 302:701-12. [PMID: 10986128 DOI: 10.1006/jmbi.2000.4049] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The structure of a protein is dictated by a large number of weak interactions that cooperatively stabilize the native state. Usually, excised fragments smaller than a domain have little if any residual structure. When autonomous units of structure are found within domains, this challenges common assumptions about the cooperativity of protein structure. Such autonomous folding units (AFUs) are of wide interest and have applications in protein engineering and as simple model systems for studying the determinants of stability and specificity. A new method of identifying AFUs within proteins is presented here. The rapid autonomous fragment test (RAFT) identifies AFUs based on analysis of inter-residue contacts present in the three-dimensional structure of a protein. RAFT is fast enough to mine the entire PDB for AFUs and provide a library of potential small stable folds. We show that RAFT is able to predict whether a protein fragment will be structured if isolated from its parent domain.
Collapse
Affiliation(s)
- K F Fischer
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720-3206, USA
| | | |
Collapse
|
21
|
Chamberlain AK, Marqusee S. Comparison of equilibrium and kinetic approaches for determining protein folding mechanisms. ADVANCES IN PROTEIN CHEMISTRY 2000; 53:283-328. [PMID: 10751947 DOI: 10.1016/s0065-3233(00)53006-x] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- A K Chamberlain
- Oxford Centre for Molecular Sciences, New Chemistry Lab, Oxford, United Kingdom
| | | |
Collapse
|
22
|
Abstract
A physical theory of protein secondary structure is proposed and tested by performing exceedingly simple Monte Carlo simulations. In essence, secondary structure propensities are predominantly a consequence of two competing local effects, one favoring hydrogen bond formation in helices and turns, the other opposing the attendant reduction in sidechain conformational entropy on helix and turn formation. These sequence specific biases are densely dispersed throughout the unfolded polypeptide chain, where they serve to preorganize the folding process and largely, but imperfectly, anticipate the native secondary structure.
Collapse
Affiliation(s)
- R Srinivasan
- Department of Biophysics, Johns Hopkins University School of Medicine, 725 North Wolfe Street, Baltimore, MD 21205, USA
| | | |
Collapse
|