1
|
Chakravarty D, Sreenivasan S, Swint-Kruse L, Porter LL. Identification of a covert evolutionary pathway between two protein folds. Nat Commun 2023; 14:3177. [PMID: 37264049 DOI: 10.1038/s41467-023-38519-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 05/03/2023] [Indexed: 06/03/2023] Open
Abstract
Although homologous protein sequences are expected to adopt similar structures, some amino acid substitutions can interconvert α-helices and β-sheets. Such fold switching may have occurred over evolutionary history, but supporting evidence has been limited by the: (1) abundance and diversity of sequenced genes, (2) quantity of experimentally determined protein structures, and (3) assumptions underlying the statistical methods used to infer homology. Here, we overcome these barriers by applying multiple statistical methods to a family of ~600,000 bacterial response regulator proteins. We find that their homologous DNA-binding subunits assume divergent structures: helix-turn-helix versus α-helix + β-sheet (winged helix). Phylogenetic analyses, ancestral sequence reconstruction, and AlphaFold2 models indicate that amino acid substitutions facilitated a switch from helix-turn-helix into winged helix. This structural transformation likely expanded DNA-binding specificity. Our approach uncovers an evolutionary pathway between two protein folds and provides a methodology to identify secondary structure switching in other protein families.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Shwetha Sreenivasan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Lauren L Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
2
|
Sykes J, Holland BR, Charleston MA. A review of visualisations of protein fold networks and their relationship with sequence and function. Biol Rev Camb Philos Soc 2023; 98:243-262. [PMID: 36210328 PMCID: PMC10092621 DOI: 10.1111/brv.12905] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 09/08/2022] [Accepted: 09/09/2022] [Indexed: 01/12/2023]
Abstract
Proteins form arguably the most significant link between genotype and phenotype. Understanding the relationship between protein sequence and structure, and applying this knowledge to predict function, is difficult. One way to investigate these relationships is by considering the space of protein folds and how one might move from fold to fold through similarity, or potential evolutionary relationships. The many individual characterisations of fold space presented in the literature can tell us a lot about how well the current Protein Data Bank represents protein fold space, how convergence and divergence may affect protein evolution, how proteins affect the whole of which they are part, and how proteins themselves function. A synthesis of these different approaches and viewpoints seems the most likely way to further our knowledge of protein structure evolution and thus, facilitate improved protein structure design and prediction.
Collapse
Affiliation(s)
- Janan Sykes
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| | - Barbara R Holland
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| | - Michael A Charleston
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| |
Collapse
|
3
|
Reversible switching between two common protein folds in a designed system using only temperature. Proc Natl Acad Sci U S A 2023; 120:e2215418120. [PMID: 36669114 PMCID: PMC9942840 DOI: 10.1073/pnas.2215418120] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Naturally occurring metamorphic proteins have the ability to interconvert from one folded state to another through either a limited set of mutations or by way of a change in the local environment. Here, we show in a designed system that it is possible to switch reversibly between two of the most common monomeric folds employing only temperature changes. We demonstrate that a latent 3α state can be unmasked from an α/β-plait topology with a single V90T amino acid substitution, populating both forms simultaneously. The equilibrium between these two states exhibits temperature dependence, such that the 3α state is predominant (>90%) at 5 °C, while the α/β-plait fold is the major species (>90%) at 30 °C. We describe the structure and dynamics of these topologies, how mutational changes affect the temperature dependence, and the energetics and kinetics of interconversion. Additionally, we demonstrate how ligand-binding function can be tightly regulated by large amplitude changes in protein structure over a relatively narrow temperature range that is relevant to biology. The 3α/αβ switch thus represents a potentially useful approach for designing proteins that alter their fold topologies in response to environmental triggers. It may also serve as a model for computational studies of temperature-dependent protein stability and fold switching.
Collapse
|
4
|
Porter LL, Kim AK, Rimal S, Looger LL, Majumdar A, Mensh BD, Starich MR, Strub MP. Many dissimilar NusG protein domains switch between α-helix and β-sheet folds. Nat Commun 2022; 13:3802. [PMID: 35778397 PMCID: PMC9247905 DOI: 10.1038/s41467-022-31532-9] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 06/17/2022] [Indexed: 11/16/2022] Open
Abstract
Folded proteins are assumed to be built upon fixed scaffolds of secondary structure, α-helices and β-sheets. Experimentally determined structures of >58,000 non-redundant proteins support this assumption, though it has recently been challenged by ~100 fold-switching proteins. Though ostensibly rare, these proteins raise the question of how many uncharacterized proteins have shapeshifting-rather than fixed-secondary structures. Here, we use a comparative sequence-based approach to predict fold switching in the universally conserved NusG transcription factor family, one member of which has a 50-residue regulatory subunit experimentally shown to switch between α-helical and β-sheet folds. Our approach predicts that 24% of sequences in this family undergo similar α-helix ⇌ β-sheet transitions. While these predictions cannot be reproduced by other state-of-the-art computational methods, they are confirmed by circular dichroism and nuclear magnetic resonance spectroscopy for 10 out of 10 sequence-diverse variants. This work suggests that fold switching may be a pervasive mechanism of transcriptional regulation in all kingdoms of life.
Collapse
Affiliation(s)
- Lauren L Porter
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA.
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD, 20892, USA.
| | - Allen K Kim
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Swechha Rimal
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Loren L Looger
- Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, 20147, USA
| | - Ananya Majumdar
- The Johns Hopkins University Biomolecular NMR Center, The Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Brett D Mensh
- Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, 20147, USA
| | - Mary R Starich
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Marie-Paule Strub
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD, 20892, USA
| |
Collapse
|
5
|
Mishra S, Looger LL, Porter LL. A sequence-based method for predicting extant fold switchers that undergo α-helix ↔ β-strand transitions. Biopolymers 2021; 112:e23471. [PMID: 34498740 PMCID: PMC8545793 DOI: 10.1002/bip.23471] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Revised: 08/10/2021] [Accepted: 08/10/2021] [Indexed: 12/21/2022]
Abstract
Extant fold-switching proteins remodel their secondary structures and change their functions in response to cellular stimuli, regulating biological processes and affecting human health. Despite their biological importance, these proteins remain understudied. Predictive methods are needed to expedite the process of discovering and characterizing more of these shapeshifting proteins. Most previous approaches require a solved structure or all-atom simulations, greatly constraining their use. Here, we propose a high-throughput sequence-based method for predicting extant fold switchers that transition from α-helix in one conformation to β-strand in the other. This method leverages two previous observations: (a) α-helix ↔ β-strand prediction discrepancies from JPred4 are a robust predictor of fold switching, and (b) the fold-switching regions (FSRs) of some extant fold switchers have different secondary structure propensities when expressed by themselves (isolated FSRs) than when expressed within the context of their parent protein (contextualized FSRs). Combining these two observations, we ran JPred4 on 99-fold-switching proteins and found strong correspondence between predicted and experimentally observed α-helix ↔ β-strand discrepancies. To test the overall robustness of this finding, we randomly selected regions of proteins not expected to switch folds (single-fold proteins) and found significantly fewer predicted α-helix ↔ β-strand discrepancies. Combining these discrepancies with the overall percentage of predicted secondary structure, we developed a classifier to identify extant fold switchers (Matthews correlation coefficient of .71). Although this classifier had a high false-negative rate (7/17), its false-positive rate was very low (2/136), suggesting that it can be used to predict a subset of extant fold switchers from a multitude of available genomic sequences.
Collapse
Affiliation(s)
- Soumya Mishra
- National Library of MedicineNational Institutes of HealthBethesdaMarylandUSA
- Howard Hughes Medical Institute, Janelia Research CampusAshburnVirginiaUSA
| | - Loren L. Looger
- Howard Hughes Medical Institute, Janelia Research CampusAshburnVirginiaUSA
| | - Lauren L. Porter
- National Library of MedicineNational Institutes of HealthBethesdaMarylandUSA
- National Heart, Lung, and Blood InstituteNational Institutes of HealthBethesdaMarylandUSA
| |
Collapse
|
6
|
Porter LL. Predictable fold switching by the SARS-CoV-2 protein ORF9b. Protein Sci 2021; 30:1723-1729. [PMID: 33934422 PMCID: PMC8242659 DOI: 10.1002/pro.4097] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Revised: 04/24/2021] [Accepted: 04/27/2021] [Indexed: 12/27/2022]
Abstract
Extant fold-switching proteins remodel their secondary structures and change their functions in response to environmental stimuli. These shapeshifting proteins regulate biological processes and are associated with a number of diseases, including tuberculosis, cancer, Alzheimer's, and autoimmune disorders. Thus, predictive methods are needed to identify more fold-switching proteins, especially since all naturally occurring instances have been discovered by chance. In response to this need, two high-throughput predictive methods have recently been developed. Here we test them on ORF9b, a newly discovered fold switcher and potential therapeutic target from the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Promisingly, both methods correctly indicate that ORF9b switches folds. We then tested the same two methods on ORF9b1, the ORF9b homolog from SARS-CoV-1. Again, both methods predict that ORF9b1 switches folds, a finding consistent with experimental binding studies. Together, these results (a) demonstrate that protein fold switching can be predicted using high-throughput computational approaches and (b) suggest that fold switching might be a general characteristic of ORF9b homologs.
Collapse
Affiliation(s)
- Lauren L. Porter
- National Library of MedicineNational Center for Biotechnology Information, National Institutes of HealthBethesdaMarylandUSA
- National Heart, Lung, and Blood InstituteBiochemistry and Biophysics Center, National Institutes of HealthBethesdaMarylandUSA
| |
Collapse
|
7
|
Seifi B, Aina A, Wallin S. Structural fluctuations and mechanical stabilities of the metamorphic protein RfaH. Proteins 2020; 89:289-300. [PMID: 32996201 DOI: 10.1002/prot.26014] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 07/17/2020] [Accepted: 08/31/2020] [Indexed: 01/08/2023]
Abstract
RfaH is a compact two-domain bacterial transcription factor that functions both as a regulator of transcription and an enhancer of translation. Underpinning the dual functional roles of RfaH is a partial but dramatic fold switch, which completely transforms the ~50-amino acid C-terminal domain (CTD) from an all-α state to an all-β state. The fold switch of the CTD occurs when RfaH binds to RNA polymerase (RNAP), however, the details of how this structural transformation is triggered is not well understood. Here we use all-atom Monte Carlo simulations to characterize structural fluctuations and mechanical stability properties of the full-length RfaH and the CTD as an isolated fragment. In agreement with experiments, we find that interdomain contacts are crucial for maintaining a stable, all-α CTD in free RfaH. To probe mechanical properties, we use pulling simulations to measure the work required to inflict local deformations at different positions along the chain. The resulting mechanical stability profile reveals that free RfaH can be divided into a "rigid" part and a "soft" part, with a boundary that nearly coincides with the boundary between the two domains. We discuss the potential role of this feature for how fold switching may be triggered by interaction with RNAP.
Collapse
Affiliation(s)
- Bahman Seifi
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St Johns, Newfoundland, Canada
| | - Adekunle Aina
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St Johns, Newfoundland, Canada
| | - Stefan Wallin
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St Johns, Newfoundland, Canada
| |
Collapse
|
8
|
Tian P, Best RB. Exploring the sequence fitness landscape of a bridge between protein folds. PLoS Comput Biol 2020; 16:e1008285. [PMID: 33048928 PMCID: PMC7553338 DOI: 10.1371/journal.pcbi.1008285] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 08/24/2020] [Indexed: 12/15/2022] Open
Abstract
Most foldable protein sequences adopt only a single native fold. Recent protein design studies have, however, created protein sequences which fold into different structures apon changes of environment, or single point mutation, the best characterized example being the switch between the folds of the GA and GB binding domains of streptococcal protein G. To obtain further insight into the design of sequences which can switch folds, we have used a computational model for the fitness landscape of a single fold, built from the observed sequence variation of protein homologues. We have recently shown that such coevolutionary models can be used to design novel foldable sequences. By appropriately combining two of these models to describe the joint fitness landscape of GA and GB, we are able to describe the propensity of a given sequence for each of the two folds. We have successfully tested the combined model against the known series of designed GA/GB hybrids. Using Monte Carlo simulations on this landscape, we are able to identify pathways of mutations connecting the two folds. In the absence of a requirement for domain stability, the most frequent paths go via sequences in which neither domain is stably folded, reminiscent of the propensity for certain intrinsically disordered proteins to fold into different structures according to context. Even if the folded state is required to be stable, we find that there is nonetheless still a wide range of sequences which are close to the transition region and therefore likely fold switches, consistent with recent estimates that fold switching may be more widespread than had been thought.
Collapse
Affiliation(s)
- Pengfei Tian
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, U.S.A
| | - Robert B. Best
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, U.S.A
| |
Collapse
|
9
|
Banach M, Fabian P, Stapor K, Konieczny L, Roterman I. Structure of the Hydrophobic Core Determines the 3D Protein Structure-Verification by Single Mutation Proteins. Biomolecules 2020; 10:E767. [PMID: 32423068 PMCID: PMC7281683 DOI: 10.3390/biom10050767] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 05/08/2020] [Accepted: 05/12/2020] [Indexed: 02/06/2023] Open
Abstract
Four de novo proteins differing in single mutation positions, with a chain length of 56 amino acids, represent diverse 3D structures: monomeric 3α and 4β + α folds. The reason for this diversity is seen in the different structure of the hydrophobic core as a result of synergy leading to the generation of a system in which the polypeptide chain as a whole participates. On the basis of the fuzzy oil drop model, where the structure of the hydrophobic core is expressed by means of the hydrophobic distribution function in the form of a 3D Gaussian distribution, it has been shown that the composition of the hydrophobic core in these two structural forms is different. In addition, the use of a model to determine the structure of the early intermediate in the folding process allows to indicate differences in the polypeptide chain geometry, which, combined with the construction of a common hydrophobic nucleus as an effect of specific synergy, may indicate the reason for the diversity of the folding process of the polypeptide chain. The results indicate the need to take into account the presence of an external force field originating from the water environment and that its active impact on the formation of a hydrophobic core whose participation in the stabilization of the tertiary structure is fundamental.
Collapse
Affiliation(s)
- Mateusz Banach
- Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, Lazarza 16, 31-533 Krakow, Poland;
| | - Piotr Fabian
- Institute of Computer Science, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland; (P.F.); (K.S.)
| | - Katarzyna Stapor
- Institute of Computer Science, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland; (P.F.); (K.S.)
| | - Leszek Konieczny
- Chair of Medical Biochemistry, Medical College, Jagiellonian University, Kopernika 7, 31-034 Krakow, Poland;
| | - Irena Roterman
- Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, Lazarza 16, 31-533 Krakow, Poland;
| |
Collapse
|
10
|
|
11
|
Kumirov VK, Dykstra EM, Hall BM, Anderson WJ, Szyszka TN, Cordes MHJ. Multistep mutational transformation of a protein fold through structural intermediates. Protein Sci 2018; 27:1767-1779. [PMID: 30051937 DOI: 10.1002/pro.3488] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2018] [Revised: 07/24/2018] [Accepted: 07/25/2018] [Indexed: 12/24/2022]
Abstract
New protein folds may evolve from existing folds through metamorphic evolution involving a dramatic switch in structure. To mimic pathways by which amino acid sequence changes could induce a change in fold, we designed two folded hybrids of Xfaso 1 and Pfl 6, a pair of homologous Cro protein sequences with ~40% identity but different folds (all-α vs. α + β, respectively). Each hybrid, XPH1 or XPH2, is 85% identical in sequence to its parent, Xfaso 1 or Pfl 6, respectively; 55% identical to its noncognate parent; and ~70% identical to the other hybrid. XPH1 and XPH2 also feature a designed hybrid chameleon sequence corresponding to the C-terminal region, which switched from α-helical to β-sheet structure during Cro evolution. We report solution nuclear magnetic resonance (NMR) structures of XPH1 and XPH2 at 0.3 Å and 0.5 Å backbone root mean square deviation (RMSD), respectively. XPH1 retains a global fold generally similar to Xfaso 1, and XPH2 retains a fold similar to Pfl 6, as measured by TM-align scores (~0.7), DALI Z-scores (7-9), and backbone RMSD (2-3 Å RMSD for the most ordered regions). However, these scores also indicate significant deviations in structure. Most notably, XPH1 and XPH2 have different, and intermediate, secondary structure content relative to Xfaso 1 and Pfl 6. The multistep progression in sequence, from Xfaso 1 to XPH1 to XPH2 to Pfl 6, thus involves both abrupt and gradual changes in folding pattern. The plasticity of some protein folds may allow for "polymetamorphic" evolution through intermediate structures.
Collapse
Affiliation(s)
- Vlad K Kumirov
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Emily M Dykstra
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Branwen M Hall
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - William J Anderson
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Taylor N Szyszka
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| |
Collapse
|
12
|
Kulkarni P, Solomon TL, He Y, Chen Y, Bryan PN, Orban J. Structural metamorphism and polymorphism in proteins on the brink of thermodynamic stability. Protein Sci 2018; 27:1557-1567. [PMID: 30144197 PMCID: PMC6194243 DOI: 10.1002/pro.3458] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Accepted: 06/11/2018] [Indexed: 12/13/2022]
Abstract
The classical view of the structure-function paradigm advanced by Anfinsen in the 1960s is that a protein's function is inextricably linked to its three-dimensional structure and is encrypted in its amino acid sequence. However, it is now known that a significant fraction of the proteome consists of intrinsically disordered proteins (IDPs). These proteins populate a polymorphic ensemble of conformations rather than a unique structure but are still capable of performing biological functions. At the boundary, between well-ordered and inherently disordered states are proteins that are on the brink of stability, either weakly stable ordered systems or disordered but on the verge of being stable. In such marginal states, even relatively minor changes can significantly alter the energy landscape, leading to large-scale conformational remodeling. Some proteins on the edge of stability are metamorphic, with the capacity to switch from one fold topology to another in response to an environmental trigger (e.g., pH, temperature/salt, redox). Many IDPs, on the other hand, are marginally unstable such that small perturbations (e.g., phosphorylation, ligands) tip the balance over to a range of ordered, partially ordered, or even more disordered states. In general, the structural transitions described by metamorphic fold switches and polymorphic IDPs possess a number of common features including low or diminished stability, large-scale conformational changes, critical disordered regions, latent or attenuated binding sites, and expansion of function. We suggest that these transitions are, therefore, conceptually and mechanistically analogous, representing adjacent regions in the continuum of order/disorder transitions.
Collapse
Affiliation(s)
- Prakash Kulkarni
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Tsega L. Solomon
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Yanan He
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Yihong Chen
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Philip N. Bryan
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - John Orban
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
- Department of Chemistry and BiochemistryUniversity of MarylandCollege ParkMaryland20742
| |
Collapse
|
13
|
Abstract
It is commonly thought that each globular protein has a single 3D structure, or fold, that fosters its function. In contrast, recent studies have identified several fold-switching proteins whose secondary structures can be remodeled in response to cellular stimuli. Although thought to be rare, we found 96 literature-validated fold-switching proteins by exhaustively searching the database of protein structures [Protein Data Bank (PDB)]. Characterizing these proteins led us to hypothesize that their abundance may be underrepresented in the PDB. Thus, we developed a computational method that identifies fold-switching proteins and used it to estimate that 0.5–4% of PDB proteins switch folds. These results suggest that proteins switch folds with significant frequency, which has implications for cell biology, genomics, and human health. A central tenet of biology is that globular proteins have a unique 3D structure under physiological conditions. Recent work has challenged this notion by demonstrating that some proteins switch folds, a process that involves remodeling of secondary structure in response to a few mutations (evolved fold switchers) or cellular stimuli (extant fold switchers). To date, extant fold switchers have been viewed as rare byproducts of evolution, but their frequency has been neither quantified nor estimated. By systematically and exhaustively searching the Protein Data Bank (PDB), we found ∼100 extant fold-switching proteins. Furthermore, we gathered multiple lines of evidence suggesting that these proteins are widespread in nature. Based on these lines of evidence, we hypothesized that the frequency of extant fold-switching proteins may be underrepresented by the structures in the PDB. Thus, we sought to identify other putative extant fold switchers with only one solved conformation. To do this, we identified two characteristic features of our ∼100 extant fold-switching proteins, incorrect secondary structure predictions and likely independent folding cooperativity, and searched the PDB for other proteins with similar features. Reassuringly, this method identified dozens of other proteins in the literature with indication of a structural change but only one solved conformation in the PDB. Thus, we used it to estimate that 0.5–4% of PDB proteins switch folds. These results demonstrate that extant fold-switching proteins are likely more common than the PDB reflects, which has implications for cell biology, genomics, and human health.
Collapse
|
14
|
Aina A, Wallin S. Multisequence algorithm for coarse-grained biomolecular simulations: Exploring the sequence-structure relationship of proteins. J Chem Phys 2017; 147:095102. [DOI: 10.1063/1.4986933] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Affiliation(s)
- A. Aina
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John’s, Newfoundland and Labrador A1B 3X7, Canada
| | - S. Wallin
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John’s, Newfoundland and Labrador A1B 3X7, Canada
| |
Collapse
|
15
|
Zimmermann MT, Tischer A, Whitten ST, Auton M. Structural origins of misfolding propensity in the platelet adhesive von Willebrand factor A1 domain. Biophys J 2016. [PMID: 26200876 DOI: 10.1016/j.bpj.2015.06.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The von Willebrand factor (VWF) A1 and A3 domains are structurally isomorphic yet exhibit distinct mechanisms of unfolding. The A1 domain, responsible for platelet adhesion to VWF in hemostasis, unfolds through a molten globule intermediate in an apparent three-state mechanism, while A3 unfolds by a classical two-state mechanism. Inspection of the sequences or structures alone does not elucidate the source of this thermodynamic conundrum; however, the three-state character of the A1 domain suggests that it has more than one cooperative substructure yielding two separate unfolding transitions not present in A3. We investigate the extent to which structural elements contributing to intermediate conformations can be identified using a residue-specific implementation of the structure-energy-equivalence-of-domains algorithm (SEED), which parses proteins of known structure into their constituent thermodynamically cooperative components using protein-group-specific, transfer free energies. The structural elements computed to contribute to the non-two-state character coincide with regions where Von Willebrand disease mutations induce misfolded molten globule conformations of the A1 domain. This suggests a mechanism for the regulation of rheological platelet adhesion to A1 based on cooperative flexibility of the α2 and α3 helices flanking the platelet GPIbα receptor binding interface.
Collapse
Affiliation(s)
- Michael T Zimmermann
- Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota
| | - Alexander Tischer
- Division of Hematology, Department of Internal Medicine, Mayo Clinic, Rochester, Minnesota
| | - Steven T Whitten
- Department of Chemistry and Biochemistry, Texas State University, San Marcos, Texas
| | - Matthew Auton
- Division of Hematology, Department of Internal Medicine, Mayo Clinic, Rochester, Minnesota.
| |
Collapse
|
16
|
Hoffmann J, Wrabl JO, Hilser VJ. The role of negative selection in protein evolution revealed through the energetics of the native state ensemble. Proteins 2016; 84:435-47. [PMID: 26800099 DOI: 10.1002/prot.24989] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Revised: 12/15/2015] [Accepted: 12/19/2015] [Indexed: 12/14/2022]
Abstract
Knowing the determinants of conformational specificity is essential for understanding protein structure, stability, and fold evolution. To address this issue, a novel statistical measure of energetic compatibility between sequence and structure was developed using an experimentally validated model of the energetics of the native state ensemble. This approach successfully matched sequences from a diverse subset of the human proteome to their respective folds. Unexpectedly, significant energetic compatibility between ostensibly unrelated sequences and structures was also observed. Interrogation of these matches revealed a general framework for understanding the origins of conformational specificity within a proteome: specificity is a complex function of both the ability of a sequence to adopt folds other than the native, and ability of a fold to accommodate sequences other than the native. The regional variation in energetic compatibility indicates that the compatibility is dominated by incompatibility of sequence for alternative fold segments, suggesting that evolution of protein sequences has involved substantial negative selection, with certain segments serving as "gatekeepers" that presumably prevent alternative structures. Beyond these global trends, a size dependence exists in the degree to which the energetic compatibility is determined from negative selection, with smaller proteins displaying more negative selection. This partially explains how short sequences can adopt unique folds, despite the higher probability in shorter proteins for small numbers of mutations to increase compatibility with other folds. In providing evolutionary ground rules for the thermodynamic relationship between sequence and fold, this framework imparts valuable insight for rational design of unique folds or fold switches.
Collapse
Affiliation(s)
- Jordan Hoffmann
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, 21218.,T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, 21218
| | - James O Wrabl
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, 21218.,T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, 21218
| | - Vincent J Hilser
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, 21218.,T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, 21218
| |
Collapse
|
17
|
Eaton KV, Anderson WJ, Dubrava MS, Kumirov VK, Dykstra EM, Cordes MHJ. Studying protein fold evolution with hybrids of differently folded homologs. Protein Eng Des Sel 2015; 28:241-50. [PMID: 25991865 DOI: 10.1093/protein/gzv027] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2015] [Accepted: 04/20/2015] [Indexed: 11/13/2022] Open
Abstract
To study the sequence determinants governing protein fold evolution, we generated hybrid sequences from two homologous proteins with 40% identity but different folds: Pfl 6 Cro, which has a mixed α + β structure, and Xfaso 1 Cro, which has an all α-helical structure. First, we first examined eight chimeric hybrids in which the more structurally conserved N-terminal half of one protein was fused to the more structurally divergent C-terminal half of the other. None of these chimeras folded, as judged by circular dichroism spectra and thermal melts, suggesting that both halves have strong intrinsic preferences for the native global fold pattern, and/or that the interfaces between the halves are not readily interchangeable. Second, we examined 10 hybrids in which blocks of the structurally divergent C-terminal region were exchanged. These hybrids showed varying levels of thermal stability and suggested that the key residues in the Xfaso 1 C terminus specifying the all-α fold were concentrated near the end of helix 4 in Xfaso 1, which aligns to the end of strand 2 in Pfl 6. Finally, we generated hybrid substitutions for each individual residue in this critical region and measured thermal stabilities. The results suggested that R47 and V48 were the strongest factors that excluded formation of the α + β fold in the C-terminal region of Xfaso 1. In support of this idea, we found that the folding stability of one of the original eight chimeras could be rescued by back-substituting these two residues. Overall, the results show not only that the key factors for Cro fold specificity and evolution are global and multifarious, but also that some all-α Cro proteins have a C-terminal subdomain sequence within a few substitutions of switching to the α + β fold.
Collapse
Affiliation(s)
- Karen V Eaton
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - William J Anderson
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Matthew S Dubrava
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Vlad K Kumirov
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Emily M Dykstra
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| |
Collapse
|
18
|
Elber R. Two Is a Pair, Three Is a Network. Biophys J 2015; 108:22. [DOI: 10.1016/j.bpj.2014.11.1855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2014] [Revised: 11/12/2014] [Accepted: 11/12/2014] [Indexed: 11/28/2022] Open
|