1
|
Abstract
Repeat proteins are made with tandem copies of similar amino acid stretches that fold into elongated architectures. These proteins constitute excellent model systems to investigate how evolution relates to structure, folding, and function. Here, we propose a scheme to map evolutionary information at the sequence level to a coarse-grained model for repeat-protein folding and use it to investigate the folding of thousands of repeat proteins. We model the energetics by a combination of an inverse Potts-model scheme with an explicit mechanistic model of duplications and deletions of repeats to calculate the evolutionary parameters of the system at the single-residue level. These parameters are used to inform an Ising-like model that allows for the generation of folding curves, apparent domain emergence, and occupation of intermediate states that are highly compatible with experimental data in specific case studies. We analyzed the folding of thousands of natural Ankyrin repeat proteins and found that a multiplicity of folding mechanisms are possible. Fully cooperative all-or-none transitions are obtained for arrays with enough sequence-similar elements and strong interactions between them, while noncooperative element-by-element intermittent folding arose if the elements are dissimilar and the interactions between them are energetically weak. Additionally, we characterized nucleation-propagation and multidomain folding mechanisms. We show that the global stability and cooperativity of the repeating arrays can be predicted from simple sequence scores.
Collapse
|
2
|
Galpern EA, Freiberger MI, Ferreiro DU. Large Ankyrin repeat proteins are formed with similar and energetically favorable units. PLoS One 2020; 15:e0233865. [PMID: 32579546 PMCID: PMC7314423 DOI: 10.1371/journal.pone.0233865] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 05/13/2020] [Indexed: 11/19/2022] Open
Abstract
Ankyrin containing proteins are one of the most abundant repeat protein families present in all extant organisms. They are made with tandem copies of similar amino acid stretches that fold into elongated architectures. Here, we built and curated a dataset of 200 thousand proteins that contain 1.2 million Ankyrin regions and characterize the abundance, structure and energetics of the repetitive regions in natural proteins. We found that there is a continuous roughly exponential variety of array lengths with an exceptional frequency at 24 repeats. We described that individual repeats are seldom interrupted with long insertions and accept few deletions, in line with the known tertiary structures. We found that longer arrays are made up of repeats that are more similar to each other than shorter arrays, and display more favourable folding energy, hinting at their evolutionary origin. The array distributions show that there is a physical upper limit to the size of an array of repeats of about 120 copies, consistent with the limit found in nature. The identity patterns within the arrays suggest that they may have originated by sequential copies of more than one Ankyrin unit.
Collapse
Affiliation(s)
- Ezequiel A. Galpern
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - María I. Freiberger
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Diego U. Ferreiro
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
- * E-mail:
| |
Collapse
|
3
|
Lowe AR, Perez-Riba A, Itzhaki LS, Main ERG. PyFolding: Open-Source Graphing, Simulation, and Analysis of the Biophysical Properties of Proteins. Biophys J 2019; 114:516-521. [PMID: 29414697 DOI: 10.1016/j.bpj.2017.11.3779] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2017] [Revised: 11/21/2017] [Accepted: 11/27/2017] [Indexed: 11/25/2022] Open
Abstract
For many years, curve-fitting software has been heavily utilized to fit simple models to various types of biophysical data. Although such software packages are easy to use for simple functions, they are often expensive and present substantial impediments to applying more complex models or for the analysis of large data sets. One field that is reliant on such data analysis is the thermodynamics and kinetics of protein folding. Over the past decade, increasingly sophisticated analytical models have been generated, but without simple tools to enable routine analysis. Consequently, users have needed to generate their own tools or otherwise find willing collaborators. Here we present PyFolding, a free, open-source, and extensible Python framework for graphing, analysis, and simulation of the biophysical properties of proteins. To demonstrate the utility of PyFolding, we have used it to analyze and model experimental protein folding and thermodynamic data. Examples include: 1) multiphase kinetic folding fitted to linked equations, 2) global fitting of multiple data sets, and 3) analysis of repeat protein thermodynamics with Ising model variants. Moreover, we demonstrate how PyFolding is easily extensible to novel functionality beyond applications in protein folding via the addition of new models. Example scripts to perform these and other operations are supplied with the software, and we encourage users to contribute notebooks and models to create a community resource. Finally, we show that PyFolding can be used in conjunction with Jupyter notebooks as an easy way to share methods and analysis for publication and among research teams.
Collapse
Affiliation(s)
- Alan R Lowe
- London Centre for Nanotechnology, University College London, London, United Kingdom; Department of Structural and Molecular Biology, University College London, London, United Kingdom; Department of Biological Sciences, Birkbeck College, University of London, London, United Kingdom.
| | - Albert Perez-Riba
- Department of Pharmacology, University of Cambridge, Cambridge, United Kingdom
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge, Cambridge, United Kingdom
| | - Ewan R G Main
- School of Biological and Chemical Sciences, Queen Mary University of London, London, United Kingdom.
| |
Collapse
|
4
|
Detailing Protein Landscapes under Pressure. Biophys J 2016; 111:2339-2341. [PMID: 27926834 DOI: 10.1016/j.bpj.2016.10.038] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Revised: 10/17/2016] [Accepted: 10/27/2016] [Indexed: 11/23/2022] Open
|
5
|
Parra RG, Espada R, Verstraete N, Ferreiro DU. Structural and Energetic Characterization of the Ankyrin Repeat Protein Family. PLoS Comput Biol 2015; 11:e1004659. [PMID: 26691182 PMCID: PMC4687027 DOI: 10.1371/journal.pcbi.1004659] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 11/10/2015] [Indexed: 11/21/2022] Open
Abstract
Ankyrin repeat containing proteins are one of the most abundant solenoid folds. Usually implicated in specific protein-protein interactions, these proteins are readily amenable for design, with promising biotechnological and biomedical applications. Studying repeat protein families presents technical challenges due to the high sequence divergence among the repeating units. We developed and applied a systematic method to consistently identify and annotate the structural repetitions over the members of the complete Ankyrin Repeat Protein Family, with increased sensitivity over previous studies. We statistically characterized the number of repeats, the folding of the repeat-arrays, their structural variations, insertions and deletions. An energetic analysis of the local frustration patterns reveal the basic features underlying fold stability and its relation to the functional binding regions. We found a strong linear correlation between the conservation of the energetic features in the repeat arrays and their sequence variations, and discuss new insights into the organization and function of these ubiquitous proteins. Some natural proteins are formed with repetitions of similar amino acid stretches. Ankyrin-repeat proteins constitute one of the most abundant families of this class of proteins that serve as model systems to analyze how variations in sequences exert effects in structures and biological functions. We present an in-depth analysis of the ankyrin repeat protein family, characterizing the variations in the repeating arrays both at the structural and energetic level. We introduce a consistent annotation for the repeat characteristics and describe how the structural differences are related to the sequences by their underlying energetic signatures.
Collapse
Affiliation(s)
- R. Gonzalo Parra
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| | - Rocío Espada
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| | - Nina Verstraete
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| | - Diego U. Ferreiro
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
- * E-mail:
| |
Collapse
|
6
|
Aksel T, Barrick D. Direct observation of parallel folding pathways revealed using a symmetric repeat protein system. Biophys J 2015; 107:220-32. [PMID: 24988356 DOI: 10.1016/j.bpj.2014.04.058] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Revised: 04/09/2014] [Accepted: 04/11/2014] [Indexed: 11/26/2022] Open
Abstract
Although progress has been made to determine the native fold of a polypeptide from its primary structure, the diversity of pathways that connect the unfolded and folded states has not been adequately explored. Theoretical and computational studies predict that proteins fold through parallel pathways on funneled energy landscapes, although experimental detection of pathway diversity has been challenging. Here, we exploit the high translational symmetry and the direct length variation afforded by linear repeat proteins to directly detect folding through parallel pathways. By comparing folding rates of consensus ankyrin repeat proteins (CARPs), we find a clear increase in folding rates with increasing size and repeat number, although the size of the transition states (estimated from denaturant sensitivity) remains unchanged. The increase in folding rate with chain length, as opposed to a decrease expected from typical models for globular proteins, is a clear demonstration of parallel pathways. This conclusion is not dependent on extensive curve-fitting or structural perturbation of protein structure. By globally fitting a simple parallel-Ising pathway model, we have directly measured nucleation and propagation rates in protein folding, and have quantified the fluxes along each path, providing a detailed energy landscape for folding. This finding of parallel pathways differs from results from kinetic studies of repeat-proteins composed of sequence-variable repeats, where modest repeat-to-repeat energy variation coalesces folding into a single, dominant channel. Thus, for globular proteins, which have much higher variation in local structure and topology, parallel pathways are expected to be the exception rather than the rule.
Collapse
Affiliation(s)
- Tural Aksel
- Deparment of Biochemistry, Stanford University School of Medicine, Stanford, California
| | - Doug Barrick
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland.
| |
Collapse
|
7
|
Espada R, Parra RG, Mora T, Walczak AM, Ferreiro DU. Capturing coevolutionary signals inrepeat proteins. BMC Bioinformatics 2015; 16:207. [PMID: 26134293 PMCID: PMC4489039 DOI: 10.1186/s12859-015-0648-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2015] [Accepted: 06/16/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The analysis of correlations of amino acid occurrences in globular domains has led to the development of statistical tools that can identify native contacts - portions of the chains that come to close distance in folded structural ensembles. Here we introduce a direct coupling analysis for repeat proteins - natural systems for which the identification of folding domains remains challenging. RESULTS We show that the inherent translational symmetry of repeat protein sequences introduces a strong bias in the pair correlations at precisely the length scale of the repeat-unit. Equalizing for this bias in an objective way reveals true co-evolutionary signals from which local native contacts can be identified. Importantly, parameter values obtained for all other interactions are not significantly affected by the equalization. We quantify the robustness of the procedure and assign confidence levels to the interactions, identifying the minimum number of sequences needed to extract evolutionary information in several repeat protein families. CONCLUSIONS The overall procedure can be used to reconstruct the interactions at distances larger than repeat-pairs, identifying the characteristics of the strongest couplings in each family, and can be applied to any system that appears translationally symmetric.
Collapse
Affiliation(s)
- Rocío Espada
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina.,Departamento de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - R Gonzalo Parra
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| | - Thierry Mora
- Laboratoire de physique statistique, CNRS, UPMC and École normale supérieure, 24 rue Lhomond, Paris, 75005, France
| | | | - Diego U Ferreiro
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| |
Collapse
|
8
|
Naiyer A, Hassan MI, Islam A, Sundd M, Ahmad F. Structural characterization of MG and pre-MG states of proteins by MD simulations, NMR, and other techniques. J Biomol Struct Dyn 2015; 33:2267-84. [PMID: 25586676 DOI: 10.1080/07391102.2014.999354] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Almost all proteins fold via a number of partially structured intermediates such as molten globule (MG) and pre-molten globule states. Understanding the structure of these intermediates at atomic level is often a challenge, as these states are observed under extreme conditions of pH, temperature, and chemical denaturants. Furthermore, several other processes such as chemical modification, site-directed mutagenesis (or point mutation), and cleavage of covalent bond of natural proteins often lead to MG like partially unfolded conformation. However, the dynamic nature of proteins in these states makes them unsuitable for most structure determination at atomic level. Intermediate states studied so far have been characterized mostly by circular dichroism, fluorescence, viscosity, dynamic light scattering measurements, dye binding, infrared techniques, molecular dynamics simulations, etc. There is a limited amount of structural data available on these intermediate states by nuclear magnetic resonance (NMR) and hence there is a need to characterize these states at the molecular level. In this review, we present characterization of equilibrium intermediates by biophysical techniques with special reference to NMR.
Collapse
Affiliation(s)
- Abdullah Naiyer
- a Centre for Interdisciplinary Research in Basic Sciences , Jamia Millia Islamia , Jamia Nagar, New Delhi - 110025 , India
| | | | | | | | | |
Collapse
|
9
|
Folding pathway of a multidomain protein depends on its topology of domain connectivity. Proc Natl Acad Sci U S A 2014; 111:15969-74. [PMID: 25267632 DOI: 10.1073/pnas.1406244111] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
How do the folding mechanisms of multidomain proteins depend on protein topology? We addressed this question by developing an Ising-like structure-based model and applying it for the analysis of free-energy landscapes and folding kinetics of an example protein, Escherichia coli dihydrofolate reductase (DHFR). DHFR has two domains, one comprising discontinuous N- and C-terminal parts and the other comprising a continuous middle part of the chain. The simulated folding pathway of DHFR is a sequential process during which the continuous domain folds first, followed by the discontinuous domain, thereby avoiding the rapid decrease in conformation entropy caused by the association of the N- and C-terminal parts during the early phase of folding. Our simulated results consistently explain the observed experimental data on folding kinetics and predict an off-pathway structural fluctuation at equilibrium. For a circular permutant for which the topological complexity of wild-type DHFR is resolved, the balance between energy and entropy is modulated, resulting in the coexistence of the two folding pathways. This coexistence of pathways should account for the experimentally observed complex folding behavior of the circular permutant.
Collapse
|
10
|
Abstract
Biomolecules are the prime information processing elements of living matter. Most of these inanimate systems are polymers that compute their own structures and dynamics using as input seemingly random character strings of their sequence, following which they coalesce and perform integrated cellular functions. In large computational systems with finite interaction-codes, the appearance of conflicting goals is inevitable. Simple conflicting forces can lead to quite complex structures and behaviors, leading to the concept of frustration in condensed matter. We present here some basic ideas about frustration in biomolecules and how the frustration concept leads to a better appreciation of many aspects of the architecture of biomolecules, and especially how biomolecular structure connects to function by means of localized frustration. These ideas are simultaneously both seductively simple and perilously subtle to grasp completely. The energy landscape theory of protein folding provides a framework for quantifying frustration in large systems and has been implemented at many levels of description. We first review the notion of frustration from the areas of abstract logic and its uses in simple condensed matter systems. We discuss then how the frustration concept applies specifically to heteropolymers, testing folding landscape theory in computer simulations of protein models and in experimentally accessible systems. Studying the aspects of frustration averaged over many proteins provides ways to infer energy functions useful for reliable structure prediction. We discuss how frustration affects folding mechanisms. We review here how the biological functions of proteins are related to subtle local physical frustration effects and how frustration influences the appearance of metastable states, the nature of binding processes, catalysis and allosteric transitions. In this review, we also emphasize that frustration, far from being always a bad thing, is an essential feature of biomolecules that allows dynamics to be harnessed for function. In this way, we hope to illustrate how Frustration is a fundamental concept in molecular biology.
Collapse
|
11
|
Abstract
In this issue of Structure, Tsytlonok and colleagues describe the folding landscape of the giant HEAT-repeat protein PR65/A (a molecular adaptor of protein phosphatase 2A) by using experimental and theoretical methods. Both approaches agree in suggesting the presence of parallel folding pathways with several intermediates.
Collapse
|
12
|
Tsytlonok M, Craig PO, Sivertsson E, Serquera D, Perrett S, Best RB, Wolynes PG, Itzhaki LS. Complex energy landscape of a giant repeat protein. Structure 2013; 21:1954-65. [PMID: 24120762 DOI: 10.1016/j.str.2013.08.028] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Revised: 08/21/2013] [Accepted: 08/22/2013] [Indexed: 01/10/2023]
Abstract
Here, we reveal a remarkable complexity in the unfolding of giant HEAT-repeat protein PR65/A, a molecular adaptor for the heterotrimeric PP2A phosphatases. The repeat array ruptures at multiple sites, leading to intermediate states with noncontiguous folded subdomains. There is a dominant sequence of unfolding, which reflects a nonuniform stability distribution across the repeat array and can be rationalized by theoretical models accounting for heterogeneous contact density in the folded structure. Unfolding of certain intermediates is, however, competitive, leading to parallel unfolding pathways. The low-stability, central repeats sample unfolded conformations under physiological conditions, suggesting how folding directs function: certain regions present rigid motifs for molecular recognition, whereas others have the flexibility with which to broaden the search area, as in the fly-casting mechanism. Partial unfolding of PR65/A also impacts catalysis by altering the proximity of bound catalytic subunit and substrate. Thus, the repeat array orchestrates the assembly and activity of PP2A.
Collapse
Affiliation(s)
- Maksym Tsytlonok
- MRC Cancer Cell Unit, Hutchison/MRC Research Centre, Hills Road, Cambridge CB2 0XZ, UK; Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Parra RG, Espada R, Sánchez IE, Sippl MJ, Ferreiro DU. Detecting repetitions and periodicities in proteins by tiling the structural space. J Phys Chem B 2013; 117:12887-97. [PMID: 23758291 PMCID: PMC3807821 DOI: 10.1021/jp402105j] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
![]()
The
notion of energy landscapes provides conceptual tools for understanding
the complexities of protein folding and function. Energy landscape
theory indicates that it is much easier to find sequences that satisfy
the “Principle of Minimal Frustration” when the folded
structure is symmetric (Wolynes, P. G. Symmetry and the Energy Landscapes
of Biomolecules. Proc. Natl. Acad. Sci. U.S.A.1996, 93, 14249–14255). Similarly,
repeats and structural mosaics may be fundamentally related to landscapes
with multiple embedded funnels. Here we present analytical tools to
detect and compare structural repetitions in protein molecules. By
an exhaustive analysis of the distribution of structural repeats using
a robust metric, we define those portions of a protein molecule that
best describe the overall structure as a tessellation of basic units.
The patterns produced by such tessellations provide intuitive representations
of the repeating regions and their association toward higher order
arrangements. We find that some protein architectures can be described
as nearly periodic, while in others clear separations between repetitions
exist. Since the method is independent of amino acid sequence information,
we can identify structural units that can be encoded by a variety
of distinct amino acid sequences.
Collapse
Affiliation(s)
- R Gonzalo Parra
- Protein Physiology Lab, Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN , Buenos Aires, Argentina
| | | | | | | | | |
Collapse
|
14
|
Tandem-repeat proteins: regularity plus modularity equals design-ability. Curr Opin Struct Biol 2013; 23:622-31. [PMID: 23831287 DOI: 10.1016/j.sbi.2013.06.011] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2013] [Revised: 06/13/2013] [Accepted: 06/14/2013] [Indexed: 12/16/2022]
Abstract
Researchers in the field of rational protein design face a significant challenge, which arises from the two defining and inter-related features of typical globular protein structures, namely topological complexity and cooperativity. In striking contrast to globular proteins, tandem repeat proteins, such as ankyrin, tetratricopeptide and leucine-rich repeats, have regular, modular, linearly arrayed structures which makes it especially straightforward to dissect and redesign their properties. Here we review what we have learnt about the biophysics of natural repeat proteins and recent progress in applying that knowledge to engineer the thermodynamics, folding pathways and molecular recognition properties of tandem repeat proteins, and we discuss the wealth of possibilities presented for the extension of this modular construction process to build new molecules for use in medicine and biotechnology.
Collapse
|
15
|
Modulation of folding kinetics of repeat proteins: interplay between intra- and interdomain interactions. Biophys J 2012; 103:1555-65. [PMID: 23062348 DOI: 10.1016/j.bpj.2012.08.018] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2012] [Revised: 07/06/2012] [Accepted: 08/02/2012] [Indexed: 12/31/2022] Open
Abstract
Repeat proteins have unique elongated structures that, unlike globular proteins, are quite modular. Despite their simple one-dimensional structure, repeat proteins exhibit intricate folding behavior with a complexity similar to that of globular proteins. Therefore, repeat proteins allow one to quantify fundamental aspects of the biophysics of protein folding. One important feature of repeat proteins is the interfaces between the repeating units. In particular, the distribution of stabilities within and between the repeats was previously suggested to affect their folding characteristics. In this study, we explore how the interface affects folding kinetics and cooperativity by investigating two families of repeat proteins, namely, the Ankyrin and tetratricopeptide repeat proteins, which differ in the number of interfacial contacts that are formed between their units as well as in their folding behavior. By using simple topology-based models, we show that modulating the energetic strength of the interface relative to that of the repeat itself can drastically change the protein stability, folding rate, and cooperativity. By further dissecting the interfacial contacts into several subsets, we isolated the effects of each of these groups on folding kinetics. Our study highlights the importance of interface connectivity in determining the folding behavior.
Collapse
|
16
|
Jenik M, Parra RG, Radusky LG, Turjanski A, Wolynes PG, Ferreiro DU. Protein frustratometer: a tool to localize energetic frustration in protein molecules. Nucleic Acids Res 2012; 40:W348-51. [PMID: 22645321 PMCID: PMC3394345 DOI: 10.1093/nar/gks447] [Citation(s) in RCA: 121] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The frustratometer is an energy landscape theory-inspired algorithm that aims at quantifying the location of frustration manifested in protein molecules. Frustration is a useful concept for gaining insight to the proteins biological behavior by analyzing how the energy is distributed in protein structures and how mutations or conformational changes shift the energetics. Sites of high local frustration often indicate biologically important regions involved in binding or allostery. In contrast, minimally frustrated linkages comprise a stable folding core of the molecule that is conserved in conformational changes. Here, we describe the implementation of these ideas in a webserver freely available at the National EMBNet node-Argentina, at URL: http://lfp.qb.fcen.uba.ar/embnet/.
Collapse
Affiliation(s)
- Michael Jenik
- Protein Physiology Laboratory, Departamento de Química Biológica-CONICET, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires C1428EGA, Argentina
| | | | | | | | | | | |
Collapse
|
17
|
Itzhaki LS, Lowe AR. From artificial antibodies to nanosprings: the biophysical properties of repeat proteins. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 747:153-66. [PMID: 22949117 DOI: 10.1007/978-1-4614-3229-6_10] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
In this chapter we review recent studies of repeat proteins, a class of proteins consisting of tandem arrays of small structural motifs that stack approximately linearly to produce elongated structures. We discuss the observation that, despite lacking the long-range tertiary interactions that are thought to be the hallmark of globular protein stability, repeat proteins can be as stable and as co-orperatively folded as their globular counterparts. The symmetry inherent in the structures of repeat arrays, however, means there can be many partly folded species (whether it be intermediates or transition states) that have similar stabilities. Consequently they do have distinct folding properties compared with globular proteins and these are manifest in their behaviour both at equilibrium and under kinetic conditions. Thus, when studying repeat proteins one appears to be probing a moving target: a relatively small perturbation, by mutation for example, can result in a shift to a different intermediate or transition state. The growing literature on these proteins illustrates how their modular architecture can be adapted to a remarkable array of biological and physical roles, both in vivo and in vitro. Further, their simple architecture makes them uniquely amenable to redesign-of their stability, folding and function-promising exciting possibilities for future research.
Collapse
Affiliation(s)
- Laura S Itzhaki
- Department of Chemistry, University of Cambridge, Cambridge, UK.
| | | |
Collapse
|
18
|
DeVries I, Ferreiro DU, Sánchez IE, Komives EA. Folding kinetics of the cooperatively folded subdomain of the IκBα ankyrin repeat domain. J Mol Biol 2011; 408:163-76. [PMID: 21329696 DOI: 10.1016/j.jmb.2011.02.021] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2010] [Revised: 02/03/2011] [Accepted: 02/09/2011] [Indexed: 11/15/2022]
Abstract
The ankyrin repeat (AR) domain of IκBα consists of a cooperative folding unit of roughly four ARs (AR1-AR4) and of two weakly folded repeats (AR5 and AR6). The kinetic folding mechanism of the cooperative subdomain, IκBα(67-206), was analyzed using rapid mixing techniques. Despite its apparent architectural simplicity, IκBα(67-206) displays complex folding kinetics, with two sequential on-pathway high-energy intermediates. The effect of mutations to or away from the consensus sequences of ARs on folding behavior was analyzed, particularly the GXTPLHLA motif, which have not been examined in detail previously. Mutations toward the consensus generally resulted in an increase in folding stability, whereas mutations away from the consensus resulted in decreased overall stability. We determined the free energy change upon mutation for three sequential transition state ensembles along the folding route for 16 mutants. We show that folding initiates with the formation of the interface of the outer helices of AR3 and AR4, and then proceeds to consolidate structure in these repeats. Subsequently, AR1 and AR2 fold in a concerted way in a single kinetic step. We show that this mechanism is robust to the presence of AR5 and AR6 as they do not strongly affect the folding kinetics. Overall, the protein appears to fold on a rather smooth energy landscape, where the folding mechanism conforms a one-dimensional approximation. However, we note that the AR does not necessarily act as a single folding element.
Collapse
Affiliation(s)
- Ingrid DeVries
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0378, USA
| | | | | | | |
Collapse
|
19
|
Lee W, Zeng X, Zhou HX, Bennett V, Yang W, Marszalek PE. Full reconstruction of a vectorial protein folding pathway by atomic force microscopy and molecular dynamics simulations. J Biol Chem 2010; 285:38167-72. [PMID: 20870713 DOI: 10.1074/jbc.m110.179697] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
During co-translational folding, the nascent polypeptide chain is extruded sequentially from the ribosome exit tunnel and is [corrected] under severe conformational constraints [corrected] dictated by the one-dimensional geometry of the tunnel. [corrected] How do such vectorial constraints impact the folding pathway? Here, we combine single-molecule atomic force spectroscopy and steered molecular dynamics simulations to examine protein folding in the presence of one-dimensional constraints that are similar to those imposed on the nascent polypeptide chain. The simulations exquisitely reproduced the experimental unfolding and refolding force extension relationships and led to the full reconstruction of the vectorial folding pathway of a large polypeptide, the 253-residue consensus ankyrin repeat protein, NI6C. We show that fully stretched and then relaxed NI6C starts folding by the formation of local secondary structures, followed by the nucleation of three N-terminal repeats. This rate-limiting step is then followed by the vectorial and sequential folding of the remaining repeats. However, after partial unfolding, when allowed to refold, the C-terminal repeats successively regain structures without any nucleation step by using the intact N-terminal repeats as a template. These results suggest a pathway for the co-translational folding of repeat proteins and have implications for mechanotransduction.
Collapse
Affiliation(s)
- Whasil Lee
- Center for Biologically Inspired Materials and Material Systems and Department of Mechanical Engineering and Materials Science, Duke University, Durham, North Carolina 27708, USA
| | | | | | | | | | | |
Collapse
|
20
|
|
21
|
Mor A, Haran G, Levy Y. Characterization of the unfolded state of repeat proteins. HFSP JOURNAL 2008; 2:405-15. [PMID: 19436472 PMCID: PMC2633173 DOI: 10.2976/1.3021145] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2008] [Accepted: 10/20/2008] [Indexed: 11/19/2022]
Abstract
The unfolded state ensemble of proteins has been described as a structurally featureless state. While this approach is supported by the fact that many unfolded proteins follow the scaling law behavior of a random coil, there is evidence that the unfolded states of various proteins are stabilized by native or non-native interactions. Recently, the existence of extensive non-native structure was reported for a repeat protein, which resulted in a scaling law exponent that is significantly smaller than that of a random polymer [Cortajarena et al., J. Mol. Biol. 382(1), 203-212 (2008)]. It was concluded that the high compactness of this protein stems from a significant fraction of interacting PP(II) helical segments in the unfolded state. In this study, we aim at providing possible molecular understanding of this anomalous compactness of the unfolded state and to investigate its origin. Using a hierarchy of computational models, we ask whether in general the unfolded state of a repeat protein is likely to be intrinsically more compact than the unfolded state of globular proteins, or whether this phenomenon depends mostly on the occurrence of a specific sequence that promotes PP(II) conformations. Our results suggest that the formation of the PP(II) conformation is indeed essential, yet the recurring sequence of repeat proteins promotes the interactions between these PP(II) segments and the formation of non-native interactions in the unfolded state.
Collapse
Affiliation(s)
- Amit Mor
- Department of Structural Biology, Weizmann Institute
of Science, Rehovot, 76100, Israel
| | - Gilad Haran
- Department of Chemical Physics, Weizmann Institute
of Science, Rehovot, 76100, Israel
| | - Yaakov Levy
- Department of Structural Biology, Weizmann Institute
of Science, Rehovot, 76100, Israel
| |
Collapse
|
22
|
Abstract
The complexity of the mechanisms by which proteins fold has been shown by many studies to be governed by their native-state topologies. This was manifested in the ability of the native topology-based model to capture folding mechanisms and the success of folding rate predictions based on various topological measures, such as the contact order. However, while the finer details of topological complexity have been thoroughly examined and related to folding kinetics, simpler characteristics of the protein, such as its overall shape, have been largely disregarded. In this study, we investigated the folding of proteins with an unusual elongated geometry that differs substantially from the common globular structure. To study the effect of the elongation degree on the folding kinetics, we used repeat proteins, which become more elongated as they include more repeating units. Some of these have apparently anomalous experimental folding kinetics, with rates that are often less than expected on the basis of rates for globular proteins possessing similar topological complexity. Using experimental folding rates and a larger set of rates obtained from simulations, we have shown that as the protein becomes increasingly elongated, its folding kinetics becomes slower and deviates more from the rate expected on the basis of topology measures fitted for globular proteins. The observed slow kinetics is a result of a more complex pathway in which stable intermediates composed of several consecutive repeats can appear. We thus propose a novel measure, an elongation-sensitive contact order, that takes into account both the extent of elongation and the topological complexity of the protein. This new measure resolves the apparent discrimination between the folding of globular and elongated repeat proteins. Our study extends the current capabilities of folding-rate predictions by unifying the kinetics of repeat and globular proteins.
Collapse
Affiliation(s)
- Tzachi Hagai
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | | |
Collapse
|