1
|
Galpern EA, Freiberger MI, Ferreiro DU. Large Ankyrin repeat proteins are formed with similar and energetically favorable units. PLoS One 2020; 15:e0233865. [PMID: 32579546 PMCID: PMC7314423 DOI: 10.1371/journal.pone.0233865] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 05/13/2020] [Indexed: 11/19/2022] Open
Abstract
Ankyrin containing proteins are one of the most abundant repeat protein families present in all extant organisms. They are made with tandem copies of similar amino acid stretches that fold into elongated architectures. Here, we built and curated a dataset of 200 thousand proteins that contain 1.2 million Ankyrin regions and characterize the abundance, structure and energetics of the repetitive regions in natural proteins. We found that there is a continuous roughly exponential variety of array lengths with an exceptional frequency at 24 repeats. We described that individual repeats are seldom interrupted with long insertions and accept few deletions, in line with the known tertiary structures. We found that longer arrays are made up of repeats that are more similar to each other than shorter arrays, and display more favourable folding energy, hinting at their evolutionary origin. The array distributions show that there is a physical upper limit to the size of an array of repeats of about 120 copies, consistent with the limit found in nature. The identity patterns within the arrays suggest that they may have originated by sequential copies of more than one Ankyrin unit.
Collapse
Affiliation(s)
- Ezequiel A. Galpern
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - María I. Freiberger
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Diego U. Ferreiro
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
- * E-mail:
| |
Collapse
|
2
|
On the folding of a structurally complex protein to its metastable active state. Proc Natl Acad Sci U S A 2018; 115:1998-2003. [PMID: 29343647 DOI: 10.1073/pnas.1708173115] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
For successful protease inhibition, the reactive center loop (RCL) of the two-domain serine protease inhibitor, α1-antitrypsin (α1-AT), needs to remain exposed in a metastable active conformation. The α1-AT RCL is sequestered in a β-sheet in the stable latent conformation. Thus, to be functional, α1-AT must always fold to a metastable conformation while avoiding folding to a stable conformation. We explore the structural basis of this choice using folding simulations of coarse-grained structure-based models of the two α1-AT conformations. Our simulations capture the key features of folding experiments performed on both conformations. The simulations also show that the free energy barrier to fold to the latent conformation is much larger than the barrier to fold to the active conformation. An entropically stabilized on-pathway intermediate lowers the barrier for folding to the active conformation. In this intermediate, the RCL is in an exposed configuration, and only one of the two α1-AT domains is folded. In contrast, early conversion of the RCL into a β-strand increases the coupling between the two α1-AT domains in the transition state and creates a larger barrier for folding to the latent conformation. Thus, unlike what happens in several proteins, where separate regions promote folding and function, the structure of the RCL, formed early during folding, determines both the conformational and the functional fate of α1-AT. Further, the short 12-residue RCL modulates the free energy barrier and the folding cooperativity of the large 370-residue α1-AT. Finally, we suggest experiments to test the predicted folding mechanism for the latent state.
Collapse
|
3
|
Abstract
Structural domains are believed to be modules within proteins that can fold and function independently. Some proteins show tandem repetitions of apparent modular structure that do not fold independently, but rather co-operate in stabilizing structural forms that comprise several repeat-units. For many natural repeat-proteins, it has been shown that weak energetic links between repeats lead to the breakdown of co-operativity and the appearance of folding sub-domains within an apparently regular repeat array. The quasi-1D architecture of repeat-proteins is crucial in detailing how the local energetic balances can modulate the folding dynamics of these proteins, which can be related to the physiological behaviour of these ubiquitous biological systems.
Collapse
|
4
|
Using natural sequences and modularity to design common and novel protein topologies. Curr Opin Struct Biol 2016; 38:26-36. [PMID: 27270240 DOI: 10.1016/j.sbi.2016.05.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Revised: 05/13/2016] [Accepted: 05/18/2016] [Indexed: 02/07/2023]
Abstract
Protein design is still a challenging undertaking, often requiring multiple attempts or iterations for success. Typically, the source of failure is unclear, and scoring metrics appear similar between successful and failed cases. Nevertheless, the use of sequence statistics, modularity and symmetry from natural proteins, combined with computational design both at the coarse-grained and atomistic levels is propelling a new wave of design efforts to success. Here we highlight recent examples of design, showing how the wealth of natural protein sequence and topology data may be leveraged to reduce the search space and increase the likelihood of achieving desired outcomes.
Collapse
|
5
|
Hutton RD, Wilkinson J, Faccin M, Sivertsson EM, Pelizzola A, Lowe AR, Bruscolini P, Itzhaki LS. Mapping the Topography of a Protein Energy Landscape. J Am Chem Soc 2015; 137:14610-25. [PMID: 26561984 DOI: 10.1021/jacs.5b07370] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Protein energy landscapes are highly complex, yet the vast majority of states within them tend to be invisible to experimentalists. Here, using site-directed mutagenesis and exploiting the simplicity of tandem-repeat protein structures, we delineate a network of these states and the routes between them. We show that our target, gankyrin, a 226-residue 7-ankyrin-repeat protein, can access two alternative (un)folding pathways. We resolve intermediates as well as transition states, constituting a comprehensive series of snapshots that map early and late stages of the two pathways and show both to be polarized such that the repeat array progressively unravels from one end of the molecule or the other. Strikingly, we find that the protein folds via one pathway but unfolds via a different one. The origins of this behavior can be rationalized using the numerical results of a simple statistical mechanics model that allows us to visualize the equilibrium behavior as well as single-molecule folding/unfolding trajectories, thereby filling in the gaps that are not accessible to direct experimental observation. Our study highlights the complexity of repeat-protein folding arising from their symmetrical structures; at the same time, however, this structural simplicity enables us to dissect the complexity and thereby map the precise topography of the energy landscape in full breadth and remarkable detail. That we can recapitulate the key features of the folding mechanism by computational analysis of the native structure alone will help toward the ultimate goal of designed amino-acid sequences with made-to-measure folding mechanisms-the Holy Grail of protein folding.
Collapse
Affiliation(s)
- Richard D Hutton
- Hutchison/MRC Research Centre , Hills Road, Cambridge CB2 0XZ, U.K
| | - James Wilkinson
- Hutchison/MRC Research Centre , Hills Road, Cambridge CB2 0XZ, U.K
| | - Mauro Faccin
- ICTEAM, Université Catholique de Lovain , Euler Building 4, Avenue Lemaître, B-1348 Louvain-la-Neuve, Belgium
| | - Elin M Sivertsson
- Department of Pharmacology, University of Cambridge , Tennis Court Road, Cambridge CB2 1PD, U.K
| | - Alessandro Pelizzola
- Dipartimento di Scienza Applicata e Tecnologia, CNISM, and Center for Computational Studies, Politecnico di Torino , Corso Duca degli Abruzzi 24, I-10129 Torino, Italy.,INFN, Sezione di Torino , via Pietro Giuria 1, I-10125 Torino, Italy.,Human Genetics Foundation (HuGeF) , Via Nizza 52, I-10126 Torino, Italy
| | - Alan R Lowe
- Institute of Structural and Molecular Biology and London Centre for Nanotechnology, University College London and Birkbeck College , London WC1E 7HX, U.K
| | - Pierpaolo Bruscolini
- Departamento de Física Teórica and Instituto de Biocomputacíon y Física de Sistemas Complejos (BIFI), Universidad de Zaragoza , c/Mariano Esquillor s/n, 50018 Zaragoza, Spain
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge , Tennis Court Road, Cambridge CB2 1PD, U.K
| |
Collapse
|
6
|
Aksel T, Barrick D. Direct observation of parallel folding pathways revealed using a symmetric repeat protein system. Biophys J 2015; 107:220-32. [PMID: 24988356 DOI: 10.1016/j.bpj.2014.04.058] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Revised: 04/09/2014] [Accepted: 04/11/2014] [Indexed: 11/26/2022] Open
Abstract
Although progress has been made to determine the native fold of a polypeptide from its primary structure, the diversity of pathways that connect the unfolded and folded states has not been adequately explored. Theoretical and computational studies predict that proteins fold through parallel pathways on funneled energy landscapes, although experimental detection of pathway diversity has been challenging. Here, we exploit the high translational symmetry and the direct length variation afforded by linear repeat proteins to directly detect folding through parallel pathways. By comparing folding rates of consensus ankyrin repeat proteins (CARPs), we find a clear increase in folding rates with increasing size and repeat number, although the size of the transition states (estimated from denaturant sensitivity) remains unchanged. The increase in folding rate with chain length, as opposed to a decrease expected from typical models for globular proteins, is a clear demonstration of parallel pathways. This conclusion is not dependent on extensive curve-fitting or structural perturbation of protein structure. By globally fitting a simple parallel-Ising pathway model, we have directly measured nucleation and propagation rates in protein folding, and have quantified the fluxes along each path, providing a detailed energy landscape for folding. This finding of parallel pathways differs from results from kinetic studies of repeat-proteins composed of sequence-variable repeats, where modest repeat-to-repeat energy variation coalesces folding into a single, dominant channel. Thus, for globular proteins, which have much higher variation in local structure and topology, parallel pathways are expected to be the exception rather than the rule.
Collapse
Affiliation(s)
- Tural Aksel
- Deparment of Biochemistry, Stanford University School of Medicine, Stanford, California
| | - Doug Barrick
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland.
| |
Collapse
|
7
|
Rennoll-Bankert KE, Garcia-Garcia JC, Sinclair SH, Dumler JS. Chromatin-bound bacterial effector ankyrin A recruits histone deacetylase 1 and modifies host gene expression. Cell Microbiol 2015; 17:1640-52. [PMID: 25996657 DOI: 10.1111/cmi.12461] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2015] [Revised: 05/11/2015] [Accepted: 05/17/2015] [Indexed: 11/29/2022]
Abstract
Control of host epigenetics is becoming evident as a mechanism by which symbionts and pathogens survive. Anaplasma phagocytophilum, an obligate intracellular bacterium, down-regulates multiple host defence genes where histone deacetylase 1 (HDAC1) binds and histone 3 is deacetylated at their promoters, including the NADPH oxidase component, CYBB. How HDAC1 is targeted to defence gene promoters is unknown. Ankyrin A (AnkA), an A. phagocytophilum type IV secretion system effector, enters the granulocyte nucleus, binds stretches of AT-rich DNA and alters transcription of antimicrobial defence genes, including down-regulation of CYBB. Here we found AnkA binds to a predicted matrix attachment region in the proximal CYBB promoter. Using the CYBB promoter as a model of cis-gene silencing, we interrogated the mechanism of AnkA-mediated CYBB repression. The N-terminus of AnkA was critical for nuclear localization, the central ANK repeats and C-terminus were important for DNA binding, and most promoter activity localized to the central ANK repeats. Furthermore, a direct interaction between AnkA and HDAC1 was detected at the CYBB promoter, and was critical for AnkA-mediated CYBB repression. This novel microbial manipulation of host chromatin and gene expression provides important evidence of the direct effects that prokaryotic nuclear effectors can exert over host transcription and function.
Collapse
Affiliation(s)
- Kristen E Rennoll-Bankert
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, USA
| | | | - Sara H Sinclair
- Department of Pathology, University of Maryland School of Medicine, Baltimore, MD, USA.,Cellular and Molecular Medicine Program, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - J Stephen Dumler
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, USA.,Department of Pathology, University of Maryland School of Medicine, Baltimore, MD, USA.,Division of Medical Microbiology, Department of Pathology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| |
Collapse
|
8
|
Folding pathway of a multidomain protein depends on its topology of domain connectivity. Proc Natl Acad Sci U S A 2014; 111:15969-74. [PMID: 25267632 DOI: 10.1073/pnas.1406244111] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
How do the folding mechanisms of multidomain proteins depend on protein topology? We addressed this question by developing an Ising-like structure-based model and applying it for the analysis of free-energy landscapes and folding kinetics of an example protein, Escherichia coli dihydrofolate reductase (DHFR). DHFR has two domains, one comprising discontinuous N- and C-terminal parts and the other comprising a continuous middle part of the chain. The simulated folding pathway of DHFR is a sequential process during which the continuous domain folds first, followed by the discontinuous domain, thereby avoiding the rapid decrease in conformation entropy caused by the association of the N- and C-terminal parts during the early phase of folding. Our simulated results consistently explain the observed experimental data on folding kinetics and predict an off-pathway structural fluctuation at equilibrium. For a circular permutant for which the topological complexity of wild-type DHFR is resolved, the balance between energy and entropy is modulated, resulting in the coexistence of the two folding pathways. This coexistence of pathways should account for the experimentally observed complex folding behavior of the circular permutant.
Collapse
|
9
|
Abstract
Biomolecules are the prime information processing elements of living matter. Most of these inanimate systems are polymers that compute their own structures and dynamics using as input seemingly random character strings of their sequence, following which they coalesce and perform integrated cellular functions. In large computational systems with finite interaction-codes, the appearance of conflicting goals is inevitable. Simple conflicting forces can lead to quite complex structures and behaviors, leading to the concept of frustration in condensed matter. We present here some basic ideas about frustration in biomolecules and how the frustration concept leads to a better appreciation of many aspects of the architecture of biomolecules, and especially how biomolecular structure connects to function by means of localized frustration. These ideas are simultaneously both seductively simple and perilously subtle to grasp completely. The energy landscape theory of protein folding provides a framework for quantifying frustration in large systems and has been implemented at many levels of description. We first review the notion of frustration from the areas of abstract logic and its uses in simple condensed matter systems. We discuss then how the frustration concept applies specifically to heteropolymers, testing folding landscape theory in computer simulations of protein models and in experimentally accessible systems. Studying the aspects of frustration averaged over many proteins provides ways to infer energy functions useful for reliable structure prediction. We discuss how frustration affects folding mechanisms. We review here how the biological functions of proteins are related to subtle local physical frustration effects and how frustration influences the appearance of metastable states, the nature of binding processes, catalysis and allosteric transitions. In this review, we also emphasize that frustration, far from being always a bad thing, is an essential feature of biomolecules that allows dynamics to be harnessed for function. In this way, we hope to illustrate how Frustration is a fundamental concept in molecular biology.
Collapse
|
10
|
González-Charro V, Rey A. Intermediates in the folding equilibrium of repeat proteins from the TPR family. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2014; 43:433-43. [DOI: 10.1007/s00249-014-0975-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Revised: 06/20/2014] [Accepted: 07/03/2014] [Indexed: 11/29/2022]
|
11
|
Jernigan KK, Bordenstein SR. Ankyrin domains across the Tree of Life. PeerJ 2014; 2:e264. [PMID: 24688847 PMCID: PMC3932732 DOI: 10.7717/peerj.264] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2013] [Accepted: 01/15/2014] [Indexed: 11/20/2022] Open
Abstract
Ankyrin (ANK) repeats are one of the most common amino acid sequence motifs that mediate interactions between proteins of myriad sizes, shapes and functions. We assess their widespread abundance in Bacteria and Archaea for the first time and demonstrate in Bacteria that lifestyle, rather than phylogenetic history, is a predictor of ANK repeat abundance. Unrelated organisms that forge facultative and obligate symbioses with eukaryotes show enrichment for ANK repeats in comparison to free-living bacteria. The reduced genomes of obligate intracellular bacteria remarkably contain a higher fraction of ANK repeat proteins than other lifestyles, and the number of ANK repeats in each protein is augmented in comparison to other bacteria. Taken together, these results reevaluate the concept that ANK repeats are signature features of eukaryotic proteins and support the hypothesis that intracellular bacteria broadly employ ANK repeats for structure-function relationships with the eukaryotic host cell.
Collapse
Affiliation(s)
- Kristin K Jernigan
- Department of Biological Sciences, Vanderbilt University , Nashville , Tennessee , United States of America
| | - Seth R Bordenstein
- Department of Biological Sciences, Vanderbilt University , Nashville , Tennessee , United States of America ; Department of Pathology, Microbiology, and Immunology, Vanderbilt University , Nashville , Tennessee , United States of America
| |
Collapse
|
12
|
A disorder-induced domino-like destabilization mechanism governs the folding and functional dynamics of the repeat protein IκBα. PLoS Comput Biol 2013; 9:e1003403. [PMID: 24367251 PMCID: PMC3868533 DOI: 10.1371/journal.pcbi.1003403] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2013] [Accepted: 11/07/2013] [Indexed: 11/19/2022] Open
Abstract
The stability of the repeat protein IκBα, a transcriptional inhibitor in mammalian cells, is critical in the functioning of the NF-κB signaling module implicated in an array of cellular processes, including cell growth, disease, immunity and apoptosis. Structurally, IκBα is complex, with both ordered and disordered regions, thus posing a challenge to the available computational protocols to model its conformational behavior. Here, we introduce a simple procedure to model disorder in systems that undergo binding-induced folding that involves modulation of the contact map guided by equilibrium experimental observables in combination with an Ising-like Wako-Saitô-Muñoz-Eaton model. This one-step procedure alone is able to reproduce a variety of experimental observables, including ensemble thermodynamics (scanning calorimetry, pre-transitions, m-values) and kinetics (roll-over in chevron plot, intermediates and their identity), and is consistent with hydrogen-deuterium exchange measurements. We further capture the intricate distance-dynamics between the domains as measured by single-molecule FRET by combining the model predictions with simple polymer physics arguments. Our results reveal a unique mechanism at work in IκBα folding, wherein disorder in one domain initiates a domino-like effect partially destabilizing neighboring domains, thus highlighting the effect of symmetry-breaking at the level of primary sequences. The offshoot is a multi-state and a dynamic conformational landscape that is populated by increasingly partially folded ensembles upon destabilization. Our results provide, in a straightforward fashion, a rationale to the promiscuous binding and short intracellular half-life of IκBα evolutionarily engineered into it through repeats with variable stabilities and expand the functional repertoire of disordered regions in proteins.
Collapse
|