1
|
Wayment-Steele HK, Ojoawo A, Otten R, Apitz JM, Pitsawong W, Hömberger M, Ovchinnikov S, Colwell L, Kern D. Predicting multiple conformations via sequence clustering and AlphaFold2. Nature 2024; 625:832-839. [PMID: 37956700 PMCID: PMC10808063 DOI: 10.1038/s41586-023-06832-9] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 11/03/2023] [Indexed: 11/15/2023]
Abstract
AlphaFold2 (ref. 1) has revolutionized structural biology by accurately predicting single structures of proteins. However, a protein's biological function often depends on multiple conformational substates2, and disease-causing point mutations often cause population changes within these substates3,4. We demonstrate that clustering a multiple-sequence alignment by sequence similarity enables AlphaFold2 to sample alternative states of known metamorphic proteins with high confidence. Using this method, named AF-Cluster, we investigated the evolutionary distribution of predicted structures for the metamorphic protein KaiB5 and found that predictions of both conformations were distributed in clusters across the KaiB family. We used nuclear magnetic resonance spectroscopy to confirm an AF-Cluster prediction: a cyanobacteria KaiB variant is stabilized in the opposite state compared with the more widely studied variant. To test AF-Cluster's sensitivity to point mutations, we designed and experimentally verified a set of three mutations predicted to flip KaiB from Rhodobacter sphaeroides from the ground to the fold-switched state. Finally, screening for alternative states in protein families without known fold switching identified a putative alternative state for the oxidoreductase Mpt53 in Mycobacterium tuberculosis. Further development of such bioinformatic methods in tandem with experiments will probably have a considerable impact on predicting protein energy landscapes, essential for illuminating biological function.
Collapse
Affiliation(s)
- Hannah K Wayment-Steele
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
| | - Adedolapo Ojoawo
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
| | - Renee Otten
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
- Treeline Biosciences, Watertown, MA, USA
| | - Julia M Apitz
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
| | - Warintra Pitsawong
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
- Biomolecular Discovery, Relay Therapeutics, Cambridge, MA, USA
| | - Marc Hömberger
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
- Treeline Biosciences, Watertown, MA, USA
| | | | - Lucy Colwell
- Google Research, Cambridge, MA, USA
- Cambridge University, Cambridge, UK
| | - Dorothee Kern
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA.
| |
Collapse
|
2
|
Chakravarty D, Sreenivasan S, Swint-Kruse L, Porter LL. Identification of a covert evolutionary pathway between two protein folds. Nat Commun 2023; 14:3177. [PMID: 37264049 DOI: 10.1038/s41467-023-38519-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 05/03/2023] [Indexed: 06/03/2023] Open
Abstract
Although homologous protein sequences are expected to adopt similar structures, some amino acid substitutions can interconvert α-helices and β-sheets. Such fold switching may have occurred over evolutionary history, but supporting evidence has been limited by the: (1) abundance and diversity of sequenced genes, (2) quantity of experimentally determined protein structures, and (3) assumptions underlying the statistical methods used to infer homology. Here, we overcome these barriers by applying multiple statistical methods to a family of ~600,000 bacterial response regulator proteins. We find that their homologous DNA-binding subunits assume divergent structures: helix-turn-helix versus α-helix + β-sheet (winged helix). Phylogenetic analyses, ancestral sequence reconstruction, and AlphaFold2 models indicate that amino acid substitutions facilitated a switch from helix-turn-helix into winged helix. This structural transformation likely expanded DNA-binding specificity. Our approach uncovers an evolutionary pathway between two protein folds and provides a methodology to identify secondary structure switching in other protein families.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Shwetha Sreenivasan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Lauren L Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
3
|
Kim AK, Looger LL, Porter LL. A high-throughput predictive method for sequence-similar fold switchers. Biopolymers 2021; 112:e23416. [PMID: 33462801 PMCID: PMC8404102 DOI: 10.1002/bip.23416] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 12/09/2020] [Accepted: 12/11/2020] [Indexed: 12/02/2022]
Abstract
Although most experimentally characterized proteins with similar sequences assume the same folds and perform similar functions, an increasing number of exceptions is emerging. One class of exceptions comprises sequence-similar fold switchers, whose secondary structures shift from α-helix <-> β-sheet through a small number of mutations, a sequence insertion, or a deletion. Predictive methods for identifying sequence-similar fold switchers are desirable because some are associated with disease and/or can perform different functions in cells. Here, we use homology-based secondary structure predictions to identify sequence-similar fold switchers from their amino acid sequences alone. To do this, we predicted the secondary structures of sequence-similar fold switchers using three different homology-based secondary structure predictors: PSIPRED, JPred4, and SPIDER3. We found that α-helix <-> β-strand prediction discrepancies from JPred4 discriminated between the different conformations of sequence-similar fold switchers with high statistical significance (P < 1.8*10-19 ). Thus, we used these discrepancies as a classifier and found that they can often robustly discriminate between sequence-similar fold switchers and sequence-similar proteins that maintain the same folds (Matthews Correlation Coefficient of 0.82). We found that JPred4 is a more robust predictor of sequence-similar fold switchers because of (a) the curated sequence database it uses to produce multiple sequence alignments and (b) its use of sequence profiles based on Hidden Markov Models. Our results indicate that inconsistencies between JPred4 secondary structure predictions can be used to identify some sequence-similar fold switchers from their sequences alone. Thus, the negative information from inconsistent secondary structure predictions can potentially be leveraged to identify sequence-similar fold switchers from the broad base of genomic sequences.
Collapse
Affiliation(s)
- Allen K. Kim
- National Library of MedicineNational Institutes of HealthBethesdaMarylandUSA
- National Heart, Lung, and Blood InstituteNational Institutes of HealthBethesdaMarylandUSA
| | - Loren L. Looger
- Howard Hughes Medical InstituteJanelia Research CampusAshburnVirginiaUSA
| | - Lauren L. Porter
- National Library of MedicineNational Institutes of HealthBethesdaMarylandUSA
- National Heart, Lung, and Blood InstituteNational Institutes of HealthBethesdaMarylandUSA
| |
Collapse
|
4
|
Hall BM, Roberts SA, Cordes MHJ. Extreme divergence between one-to-one orthologs: the structure of N15 Cro bound to operator DNA and its relationship to the λ Cro complex. Nucleic Acids Res 2020; 47:7118-7129. [PMID: 31180482 PMCID: PMC6649833 DOI: 10.1093/nar/gkz507] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Revised: 05/25/2019] [Accepted: 06/01/2019] [Indexed: 11/13/2022] Open
Abstract
The gene cro promotes lytic growth of phages through binding of Cro protein dimers to regulatory DNA sites. Most Cro proteins are one-to-one orthologs, yet their sequence, structure and binding site sequences are quite divergent across lambdoid phages. We report the cocrystal structure of bacteriophage N15 Cro with a symmetric consensus site. We contrast this complex with an orthologous structure from phage λ, which has a dissimilar binding site sequence and a Cro protein that is highly divergent in sequence, dimerization interface and protein fold. The N15 Cro complex has less DNA bending and smaller DNA-induced changes in protein structure. N15 Cro makes fewer direct contacts and hydrogen bonds to bases, relying mostly on water-mediated and Van der Waals contacts to recognize the sequence. The recognition helices of N15 Cro and λ Cro make mostly nonhomologous and nonanalogous contacts. Interface alignment scores show that half-site binding geometries of N15 Cro and λ Cro are less similar to each other than to distantly related CI repressors. Despite this divergence, the Cro family shows several code-like protein–DNA sequence covariations. In some cases, orthologous genes can achieve a similar biological function using very different specific molecular interactions.
Collapse
Affiliation(s)
- Branwen M Hall
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721, USA
| | - Sue A Roberts
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721, USA
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
5
|
Zamora-Carreras H, Maestro B, Sanz JM, Jiménez MA. Turncoat Polypeptides: We Adapt to Our Environment. Chembiochem 2019; 21:432-441. [PMID: 31456307 DOI: 10.1002/cbic.201900446] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Indexed: 01/25/2023]
Abstract
A common interpretation of Anfinsen's hypothesis states that one amino acid sequence should fold into a single, native, ordered state, or a highly similar set thereof, coinciding with the global minimum in the folding-energy landscape, which, in turn, is responsible for the function of the protein. However, this classical view is challenged by many proteins and peptide sequences, which can adopt exchangeable, significantly dissimilar conformations that even fulfill different biological roles. The similarities and differences of concepts related to these proteins, mainly chameleon sequences, metamorphic proteins, and switch peptides, which are all denoted herein "turncoat" polypeptides, are reviewed. As well as adding a twist to the conventional view of protein folding, the lack of structural definition adds clear versatility to the activity of proteins and can be used as a tool for protein design and further application in biotechnology and biomedicine.
Collapse
Affiliation(s)
- Héctor Zamora-Carreras
- Instituto de Química-Física Rocasolano (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Serrano 119, 28006, Madrid, Spain
| | - Beatriz Maestro
- Centro de Investigaciones Biológicas (CIB), Consejo Superior de Investigaciones Científicas (CSIC), Ramiro de Maeztu 9, 28040, Madrid, Spain
| | - Jesús M Sanz
- Centro de Investigaciones Biológicas (CIB), Consejo Superior de Investigaciones Científicas (CSIC), Ramiro de Maeztu 9, 28040, Madrid, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES), Av. Monforte de Lemos, 3-5. Pabellón, 28029, Madrid, Spain
| | - M Angeles Jiménez
- Instituto de Química-Física Rocasolano (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Serrano 119, 28006, Madrid, Spain
| |
Collapse
|
6
|
Reversible fold-switching controls the functional cycle of the antitermination factor RfaH. Nat Commun 2019; 10:702. [PMID: 30742024 PMCID: PMC6370827 DOI: 10.1038/s41467-019-08567-6] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 01/17/2019] [Indexed: 01/25/2023] Open
Abstract
RfaH, member of the NusG/Spt5 family, activates virulence genes in Gram-negative pathogens. RfaH exists in two states, with its C-terminal domain (CTD) folded either as α-helical hairpin or β-barrel. In free RfaH, the α-helical CTD interacts with, and masks the RNA polymerase binding site on, the N-terminal domain, autoinhibiting RfaH and restricting its recruitment to opsDNA sequences. Upon activation, the domains separate and the CTD refolds into the β-barrel, which recruits a ribosome, activating translation. Using NMR spectroscopy, we show that only a complete ops-paused transcription elongation complex activates RfaH, probably via a transient encounter complex, allowing the refolded CTD to bind ribosomal protein S10. We also demonstrate that upon release from the elongation complex, the CTD transforms back into the autoinhibitory α-state, resetting the cycle. Transformation-coupled autoinhibition allows RfaH to achieve high specificity and potent activation of gene expression.
Collapse
|
7
|
Kumirov VK, Dykstra EM, Hall BM, Anderson WJ, Szyszka TN, Cordes MHJ. Multistep mutational transformation of a protein fold through structural intermediates. Protein Sci 2018; 27:1767-1779. [PMID: 30051937 DOI: 10.1002/pro.3488] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2018] [Revised: 07/24/2018] [Accepted: 07/25/2018] [Indexed: 12/24/2022]
Abstract
New protein folds may evolve from existing folds through metamorphic evolution involving a dramatic switch in structure. To mimic pathways by which amino acid sequence changes could induce a change in fold, we designed two folded hybrids of Xfaso 1 and Pfl 6, a pair of homologous Cro protein sequences with ~40% identity but different folds (all-α vs. α + β, respectively). Each hybrid, XPH1 or XPH2, is 85% identical in sequence to its parent, Xfaso 1 or Pfl 6, respectively; 55% identical to its noncognate parent; and ~70% identical to the other hybrid. XPH1 and XPH2 also feature a designed hybrid chameleon sequence corresponding to the C-terminal region, which switched from α-helical to β-sheet structure during Cro evolution. We report solution nuclear magnetic resonance (NMR) structures of XPH1 and XPH2 at 0.3 Å and 0.5 Å backbone root mean square deviation (RMSD), respectively. XPH1 retains a global fold generally similar to Xfaso 1, and XPH2 retains a fold similar to Pfl 6, as measured by TM-align scores (~0.7), DALI Z-scores (7-9), and backbone RMSD (2-3 Å RMSD for the most ordered regions). However, these scores also indicate significant deviations in structure. Most notably, XPH1 and XPH2 have different, and intermediate, secondary structure content relative to Xfaso 1 and Pfl 6. The multistep progression in sequence, from Xfaso 1 to XPH1 to XPH2 to Pfl 6, thus involves both abrupt and gradual changes in folding pattern. The plasticity of some protein folds may allow for "polymetamorphic" evolution through intermediate structures.
Collapse
Affiliation(s)
- Vlad K Kumirov
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Emily M Dykstra
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Branwen M Hall
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - William J Anderson
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Taylor N Szyszka
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| |
Collapse
|
8
|
Kulkarni P, Solomon TL, He Y, Chen Y, Bryan PN, Orban J. Structural metamorphism and polymorphism in proteins on the brink of thermodynamic stability. Protein Sci 2018; 27:1557-1567. [PMID: 30144197 PMCID: PMC6194243 DOI: 10.1002/pro.3458] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Accepted: 06/11/2018] [Indexed: 12/13/2022]
Abstract
The classical view of the structure-function paradigm advanced by Anfinsen in the 1960s is that a protein's function is inextricably linked to its three-dimensional structure and is encrypted in its amino acid sequence. However, it is now known that a significant fraction of the proteome consists of intrinsically disordered proteins (IDPs). These proteins populate a polymorphic ensemble of conformations rather than a unique structure but are still capable of performing biological functions. At the boundary, between well-ordered and inherently disordered states are proteins that are on the brink of stability, either weakly stable ordered systems or disordered but on the verge of being stable. In such marginal states, even relatively minor changes can significantly alter the energy landscape, leading to large-scale conformational remodeling. Some proteins on the edge of stability are metamorphic, with the capacity to switch from one fold topology to another in response to an environmental trigger (e.g., pH, temperature/salt, redox). Many IDPs, on the other hand, are marginally unstable such that small perturbations (e.g., phosphorylation, ligands) tip the balance over to a range of ordered, partially ordered, or even more disordered states. In general, the structural transitions described by metamorphic fold switches and polymorphic IDPs possess a number of common features including low or diminished stability, large-scale conformational changes, critical disordered regions, latent or attenuated binding sites, and expansion of function. We suggest that these transitions are, therefore, conceptually and mechanistically analogous, representing adjacent regions in the continuum of order/disorder transitions.
Collapse
Affiliation(s)
- Prakash Kulkarni
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Tsega L. Solomon
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Yanan He
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Yihong Chen
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - Philip N. Bryan
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
| | - John Orban
- W. M. Keck Laboratory for Structural BiologyUniversity of Maryland Institute for Bioscience and Biotechnology ResearchRockvilleMaryland20850
- Department of Chemistry and BiochemistryUniversity of MarylandCollege ParkMaryland20742
| |
Collapse
|
9
|
Abstract
![]()
Since
the proposal of Anfinsen’s thermodynamic hypothesis
in 1963, our understanding of protein folding and dynamics has gained
significant appreciation of its nuance and complexity. Intrinsically
disordered proteins, chameleonic sequences, morpheeins, and metamorphic
proteins have broadened the protein folding paradigm. Here, we discuss
noncanonical protein folding patterns, with an emphasis on metamorphic
proteins, and we review known metamorphic proteins that occur naturally
and that have been engineered in the laboratory. Finally, we discuss
research areas surrounding metamorphic proteins that are primed for
future exploration, including evolution, drug discovery, and the quest
for previously unrecognized metamorphs. As we enter an age where we
are capable of complex bioinformatic searches and de novo protein design, we are primed to search for previously unrecognized
metamorphic proteins and to design our own metamorphs to act as targeted,
switchable drugs; biosensors; and more.
Collapse
Affiliation(s)
- Acacia F. Dishman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, United States
| | - Brian F. Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, United States
| |
Collapse
|
10
|
Hou J, Zeng W, Zong Y, Chen Z, Miao C, Wang B, Lou C. Engineering the Ultrasensitive Transcription Factors by Fusing a Modular Oligomerization Domain. ACS Synth Biol 2018; 7:1188-1194. [PMID: 29733626 DOI: 10.1021/acssynbio.7b00414] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The dimerization and high-order oligomerization of transcription factors has endowed them with cooperative regulatory capabilities that play important roles in many cellular functions. However, such advanced regulatory capabilities have not been fully exploited in synthetic biology and genetic engineering. Here, we engineered a C-terminally fused oligomerization domain to improve the cooperativity of transcription factors. First, we found that two of three designed oligomerization domains significantly increased the cooperativity and ultrasensitivity of a transcription factor for the regulated promoter. Then, seven additional transcription factors were used to assess the modularity of the oligomerization domains, and their ultrasensitivity was generally improved, as assessed by their Hill coefficients. Moreover, we also demonstrated that the allosteric capability of the ligand-responsive domain remained intact when fusing with the designed oligomerization domain. As an example application, we showed that the engineered ultrasensitive transcription factor could be used to significantly improve the performance of a "stripe-forming" gene circuit. We envision that the oligomerization modules engineered in this study could act as a powerful tool to rapidly tune the underlying response profiles of synthetic gene circuits and metabolic pathway controllers.
Collapse
Affiliation(s)
- Junran Hou
- CAS Key Laboratory of Microbial Physiological and Metabolic Engineering and Institute of Microbiology, State Key Laboratory of Microbial Resources, Institute of Microbiology Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Science, University of Chinese Academy of Science, Beijing, 100149, China
| | - Weiqian Zeng
- Institute of Molecular Precision Medicine, The Xiangya Hospital, Central South University, Changsha, Hunan 410008, P. R. China
| | - Yeqing Zong
- CAS Key Laboratory of Microbial Physiological and Metabolic Engineering and Institute of Microbiology, State Key Laboratory of Microbial Resources, Institute of Microbiology Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Science, University of Chinese Academy of Science, Beijing, 100149, China
| | - Zehua Chen
- CAS Key Laboratory of Microbial Physiological and Metabolic Engineering and Institute of Microbiology, State Key Laboratory of Microbial Resources, Institute of Microbiology Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Science, University of Chinese Academy of Science, Beijing, 100149, China
| | - Chensi Miao
- CAS Key Laboratory of Microbial Physiological and Metabolic Engineering and Institute of Microbiology, State Key Laboratory of Microbial Resources, Institute of Microbiology Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Science and Technology of China, Hefei, 230027, China
| | - Baojun Wang
- School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3FF, U.K
- Centre for Synthetic and System Biology, University of Edinburgh, Edinburgh, EH9 3FF, U.K
| | - Chunbo Lou
- CAS Key Laboratory of Microbial Physiological and Metabolic Engineering and Institute of Microbiology, State Key Laboratory of Microbial Resources, Institute of Microbiology Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Science, University of Chinese Academy of Science, Beijing, 100149, China
- College of Life Sciences, University of Science and Technology of China, Hefei, 230027, China
| |
Collapse
|
11
|
Lella M, Mahalakshmi R. Metamorphic Proteins: Emergence of Dual Protein Folds from One Primary Sequence. Biochemistry 2017; 56:2971-2984. [DOI: 10.1021/acs.biochem.7b00375] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Muralikrishna Lella
- Molecular Biophysics Laboratory,
Department of Biological Sciences, Indian Institute of Science Education and Research, Bhopal 462066, India
| | - Radhakrishnan Mahalakshmi
- Molecular Biophysics Laboratory,
Department of Biological Sciences, Indian Institute of Science Education and Research, Bhopal 462066, India
| |
Collapse
|
12
|
Dybas JM, Fiser A. Development of a motif-based topology-independent structure comparison method to identify evolutionarily related folds. Proteins 2016; 84:1859-1874. [PMID: 27671894 PMCID: PMC5118133 DOI: 10.1002/prot.25169] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Revised: 08/17/2016] [Accepted: 08/25/2016] [Indexed: 11/09/2022]
Abstract
Structure conservation, functional similarities, and homologous relationships that exist across diverse protein topologies suggest that some regions of the protein fold universe are continuous. However, the current structure classification systems are based on hierarchical organizations, which cannot accommodate structural relationships that span fold definitions. Here, we describe a novel, super-secondary-structure motif-based, topology-independent structure comparison method (SmotifCOMP) that is able to quantitatively identify structural relationships between disparate topologies. The basis of SmotifCOMP is a systematically defined super-secondary-structure motif library whose representative geometries are shown to be saturated in the Protein Data Bank and exhibit a unique distribution within the known folds. SmotifCOMP offers a robust and quantitative technique to compare domains that adopt different topologies since the method does not rely on a global superposition. SmotifCOMP is used to perform an exhaustive comparison of the known folds and the identified relationships are used to produce a nonhierarchical representation of the fold space that reflects the notion of a continuous and connected fold universe. The current work offers insight into previously hypothesized evolutionary relationships between disparate folds and provides a resource for exploring novel ones. Proteins 2016; 84:1859-1874. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Joseph M. Dybas
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
| | - Andras Fiser
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
| |
Collapse
|
13
|
Au L, Green DF. Direct Calculation of Protein Fitness Landscapes through Computational Protein Design. Biophys J 2016; 110:75-84. [PMID: 26745411 DOI: 10.1016/j.bpj.2015.11.029] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Revised: 11/03/2015] [Accepted: 11/16/2015] [Indexed: 11/24/2022] Open
Abstract
Naturally selected amino-acid sequences or experimentally derived ones are often the basis for understanding how protein three-dimensional conformation and function are determined by primary structure. Such sequences for a protein family comprise only a small fraction of all possible variants, however, representing the fitness landscape with limited scope. Explicitly sampling and characterizing alternative, unexplored protein sequences would directly identify fundamental reasons for sequence robustness (or variability), and we demonstrate that computational methods offer an efficient mechanism toward this end, on a large scale. The dead-end elimination and A(∗) search algorithms were used here to find all low-energy single mutant variants, and corresponding structures of a G-protein heterotrimer, to measure changes in structural stability and binding interactions to define a protein fitness landscape. We established consistency between these algorithms with known biophysical and evolutionary trends for amino-acid substitutions, and could thus recapitulate known protein side-chain interactions and predict novel ones.
Collapse
Affiliation(s)
- Loretta Au
- Department of Statistics, The University of Chicago, Chicago, Illinois.
| | - David F Green
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York
| |
Collapse
|
14
|
A λ Cro-Like Repressor Is Essential for the Induction of Conjugative Transfer of SXT/R391 Elements in Response to DNA Damage. J Bacteriol 2015; 197:3822-33. [PMID: 26438816 DOI: 10.1128/jb.00638-15] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Accepted: 09/26/2015] [Indexed: 11/20/2022] Open
Abstract
UNLABELLED Integrative and conjugative elements (ICEs) of the SXT/R391 family are the main contributors to acquired multidrug resistance in the seventh pandemic lineage of Vibrio cholerae, the etiological agent of the diarrheal disease cholera. Conjugative transfer of SXT/R391 ICEs is triggered by antibiotics and agents promoting DNA damage through RecA-dependent autoproteolysis of SetR, an ICE-encoded λ CI-like repressor. Here, we describe the role of CroS, a distant λ Cro homolog, as a key component contributing to the regulation of expression of the activator SetCD that orchestrates the expression of the conjugative transfer genes. We show that deletion of croS abolishes the SOS response-dependent induction of SXT despite the presence of a functional setR gene. Using quantitative reverse transcription-PCR and lacZ reporter assays, we also show that CroS represses setR and setCD expression by binding to operator sites shared with SetR. Furthermore, we provide evidence of an additional operator site bound by SetR and CroS. Finally, we show that SetCD expression generates a positive feedback loop due to SXT excision and replication in a fraction of the cell population. Together, these results refine our understanding of the genetic regulation governing the propagation of major vectors of multidrug resistance. IMPORTANCE Healthcare systems worldwide are challenged by an alarming drug resistance crisis caused by the massive and rapid propagation of antibiotic resistance genes and the associated emergence of multidrug-resistant pathogenic bacteria. SXT/R391 ICEs contribute to this phenomenon not only in clinical and environmental vibrios but also in several members of the family Enterobacteriaceae. We have identified and characterized here the regulator CroS as a key factor in the stimulation of conjugative transfer of these ICEs in response to DNA-damaging agents. We have also untangled conflicting evidence regarding autoactivation of transfer by the master activator of SXT/R391 ICEs, SetCD. Discovery of CroS provides a clearer and more complete understanding of the regulatory network that governs the dissemination of SXT/R391 ICEs in bacterial populations.
Collapse
|
15
|
Eaton KV, Anderson WJ, Dubrava MS, Kumirov VK, Dykstra EM, Cordes MHJ. Studying protein fold evolution with hybrids of differently folded homologs. Protein Eng Des Sel 2015; 28:241-50. [PMID: 25991865 DOI: 10.1093/protein/gzv027] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2015] [Accepted: 04/20/2015] [Indexed: 11/13/2022] Open
Abstract
To study the sequence determinants governing protein fold evolution, we generated hybrid sequences from two homologous proteins with 40% identity but different folds: Pfl 6 Cro, which has a mixed α + β structure, and Xfaso 1 Cro, which has an all α-helical structure. First, we first examined eight chimeric hybrids in which the more structurally conserved N-terminal half of one protein was fused to the more structurally divergent C-terminal half of the other. None of these chimeras folded, as judged by circular dichroism spectra and thermal melts, suggesting that both halves have strong intrinsic preferences for the native global fold pattern, and/or that the interfaces between the halves are not readily interchangeable. Second, we examined 10 hybrids in which blocks of the structurally divergent C-terminal region were exchanged. These hybrids showed varying levels of thermal stability and suggested that the key residues in the Xfaso 1 C terminus specifying the all-α fold were concentrated near the end of helix 4 in Xfaso 1, which aligns to the end of strand 2 in Pfl 6. Finally, we generated hybrid substitutions for each individual residue in this critical region and measured thermal stabilities. The results suggested that R47 and V48 were the strongest factors that excluded formation of the α + β fold in the C-terminal region of Xfaso 1. In support of this idea, we found that the folding stability of one of the original eight chimeras could be rescued by back-substituting these two residues. Overall, the results show not only that the key factors for Cro fold specificity and evolution are global and multifarious, but also that some all-α Cro proteins have a C-terminal subdomain sequence within a few substitutions of switching to the α + β fold.
Collapse
Affiliation(s)
- Karen V Eaton
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - William J Anderson
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Matthew S Dubrava
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Vlad K Kumirov
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Emily M Dykstra
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| |
Collapse
|
16
|
Stewart KL, Nelson MR, Eaton KV, Anderson WJ, Cordes MHJ. A role for indels in the evolution of Cro protein folds. Proteins 2013; 81:1988-96. [PMID: 23843258 DOI: 10.1002/prot.24358] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Revised: 05/30/2013] [Accepted: 06/10/2013] [Indexed: 11/06/2022]
Abstract
Insertions and deletions in protein sequences, or indels, can disrupt structure and may result in changes in protein folds during evolution or in association with alternative splicing. Pfl 6 and Xfaso 1 are two proteins in the Cro family that share a common ancestor but have different folds. Sequence alignments of the two proteins show two gaps, one at the N terminus, where the sequence of Xfaso 1 is two residues shorter, and one near the center of the sequence, where the sequence of Pfl 6 is five residues shorter. To test the potential importance of indels in Cro protein evolution, we generated hybrid variants of Pfl 6 and Xfaso 1 with indels in one or both regions, chosen according to several plausible sequence alignments. All but one deletion variant completely unfolded both proteins, showing that a longer N-terminal sequence was critical for Pfl 6 folding and a longer central region sequence was critical for Xfaso 1 folding. By contrast, Xfaso 1 tolerated a longer N-terminal sequence with little destabilization, and Pfl 6 tolerated central region insertions, albeit with substantial effects on thermal stability and some perturbation of the surrounding structure. None of the mutations appeared to convert one stable fold into the other. On the basis of this two-protein comparison, short insertion and deletion mutations probably played a role in evolutionary fold change in the Cro family, but were also not the only factors.
Collapse
Affiliation(s)
- Katie L Stewart
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, 85721-0088
| | | | | | | | | |
Collapse
|
17
|
Dey F, Cliff Zhang Q, Petrey D, Honig B. Toward a "structural BLAST": using structural relationships to infer function. Protein Sci 2013; 22:359-66. [PMID: 23349097 DOI: 10.1002/pro.2225] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2012] [Revised: 01/17/2013] [Accepted: 01/17/2013] [Indexed: 02/05/2023]
Abstract
We outline a set of strategies to infer protein function from structure. The overall approach depends on extensive use of homology modeling, the exploitation of a wide range of global and local geometric relationships between protein structures and the use of machine learning techniques. The combination of modeling with broad searches of protein structure space defines a "structural BLAST" approach to infer function with high genomic coverage. Applications are described to the prediction of protein-protein and protein-ligand interactions. In the context of protein-protein interactions, our structure-based prediction algorithm, PrePPI, has comparable accuracy to high-throughput experiments. An essential feature of PrePPI involves the use of Bayesian methods to combine structure-derived information with non-structural evidence (e.g. co-expression) to assign a likelihood for each predicted interaction. This, combined with a structural BLAST approach significantly expands the range of applications of protein structure in the annotation of protein function, including systems level biological applications where it has previously played little role.
Collapse
Affiliation(s)
- Fabian Dey
- Department of Biochemistry and Molecular Biophysics, Howard Hughes Medical Institute, Center for Computational Biology and Bioinformatics and Initiative in Systems Biology, Columbia University, New York, New York 10032, USA
| | | | | | | |
Collapse
|
18
|
Häuser R, Blasche S, Dokland T, Haggård-Ljungquist E, von Brunn A, Salas M, Casjens S, Molineux I, Uetz P. Bacteriophage protein-protein interactions. Adv Virus Res 2012; 83:219-98. [PMID: 22748812 PMCID: PMC3461333 DOI: 10.1016/b978-0-12-394438-2.00006-2] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Bacteriophages T7, λ, P22, and P2/P4 (from Escherichia coli), as well as ϕ29 (from Bacillus subtilis), are among the best-studied bacterial viruses. This chapter summarizes published protein interaction data of intraviral protein interactions, as well as known phage-host protein interactions of these phages retrieved from the literature. We also review the published results of comprehensive protein interaction analyses of Pneumococcus phages Dp-1 and Cp-1, as well as coliphages λ and T7. For example, the ≈55 proteins encoded by the T7 genome are connected by ≈43 interactions with another ≈15 between the phage and its host. The chapter compiles published interactions for the well-studied phages λ (33 intra-phage/22 phage-host), P22 (38/9), P2/P4 (14/3), and ϕ29 (20/2). We discuss whether different interaction patterns reflect different phage lifestyles or whether they may be artifacts of sampling. Phages that infect the same host can interact with different host target proteins, as exemplified by E. coli phage λ and T7. Despite decades of intensive investigation, only a fraction of these phage interactomes are known. Technical limitations and a lack of depth in many studies explain the gaps in our knowledge. Strategies to complete current interactome maps are described. Although limited space precludes detailed overviews of phage molecular biology, this compilation will allow future studies to put interaction data into the context of phage biology.
Collapse
Affiliation(s)
- Roman Häuser
- Institute of Toxicology and Genetics, Karlsruhe Institute of Technology, Karlsruhe, Germany
- Deutsches Krebsforschungszentrum, Heidelberg, Germany
| | - Sonja Blasche
- Deutsches Krebsforschungszentrum, Heidelberg, Germany
| | - Terje Dokland
- Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | | | - Albrecht von Brunn
- Max-von-Pettenkofer-Institut, Lehrstuhl Virologie, Ludwig-Maximilians-Universität, München, Germany
| | - Margarita Salas
- Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Cantoblanco, Madrid, Spain
| | - Sherwood Casjens
- Division of Microbiology and Immunology, Pathology Department, University of Utah School of Medicine, Salt Lake City, Utah
| | - Ian Molineux
- Molecular Genetics and Microbiology, Institute for Cell and Molecular Biology, University of Texas–Austin, Austin, Texas, USA
| | - Peter Uetz
- Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, USA
| |
Collapse
|
19
|
Ahlstrom LS, Miyashita O. Molecular simulation uncovers the conformational space of the λ Cro dimer in solution. Biophys J 2011; 101:2516-24. [PMID: 22098751 DOI: 10.1016/j.bpj.2011.10.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Revised: 09/29/2011] [Accepted: 10/11/2011] [Indexed: 01/25/2023] Open
Abstract
The significant variation among solved structures of the λ Cro dimer suggests its flexibility. However, contacts in the crystal lattice could have stabilized a conformation which is unrepresentative of its dominant solution form. Here we report on the conformational space of the Cro dimer in solution using replica exchange molecular dynamics in explicit solvent. The simulated ensemble shows remarkable correlation with available x-ray structures. Network analysis and a free energy surface reveal the predominance of closed and semi-open dimers, with a modest barrier separating these two states. The fully open conformation lies higher in free energy, indicating that it requires stabilization by DNA or crystal contacts. Most NMR models are found to be unstable conformations in solution. Intersubunit salt bridging between Arg(4) and Glu(53) during simulation stabilizes closed conformations. Because a semi-open state is among the low-energy conformations sampled in simulation, we propose that Cro-DNA binding may not entail a large conformational change relative to the dominant dimer forms in solution.
Collapse
Affiliation(s)
- Logan S Ahlstrom
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, USA
| | | |
Collapse
|
20
|
Hall BM, Vaughn EE, Begaye AR, Cordes MHJ. Reengineering Cro protein functional specificity with an evolutionary code. J Mol Biol 2011; 413:914-28. [PMID: 21945527 DOI: 10.1016/j.jmb.2011.08.056] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2011] [Revised: 08/13/2011] [Accepted: 08/29/2011] [Indexed: 11/17/2022]
Abstract
Cro proteins from different lambdoid bacteriophages are extremely variable in their target consensus DNA sequences and constitute an excellent model for evolution of transcription factor specificity. We experimentally tested a bioinformatically derived evolutionary code relating switches between pairs of amino acids at three recognition helix sites in Cro proteins to switches between pairs of nucleotide bases in the cognate consensus DNA half-sites. We generated all eight possible code variants of bacteriophage λ Cro and used electrophoretic mobility shift assays to compare binding of each variant to its own putative cognate site and to the wild-type cognate site; we also tested the wild-type protein against all eight DNA sites. Each code variant showed stronger binding to its putative cognate site than to the wild-type site, except some variants containing proline at position 27; each also bound its cognate site better than wild-type Cro bound the same site. Most code variants, however, displayed poorer affinity and specificity than wild-type λ Cro. Fluorescence anisotropy assays on λ Cro and the triple code variant (PSQ) against the two cognate sites confirmed the switch in specificity and showed larger apparent effects on binding affinity and specificity. Bacterial one-hybrid assays of λ Cro and PSQ against libraries of sequences with a single randomized half-site showed the expected switches in specificity at two of three coded positions and no clear switches in specificity at noncoded positions. With a few caveats, these results confirm that the proposed Cro evolutionary code can be used to reengineer Cro specificity.
Collapse
Affiliation(s)
- Branwen M Hall
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721, USA
| | | | | | | |
Collapse
|
21
|
Anderson WJ, Van Dorn LO, Ingram WM, Cordes MHJ. Evolutionary bridges to new protein folds: design of C-terminal Cro protein chameleon sequences. Protein Eng Des Sel 2011; 24:765-71. [PMID: 21676898 DOI: 10.1093/protein/gzr027] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Regions of amino-acid sequence that are compatible with multiple folds may facilitate evolutionary transitions in protein structure. In a previous study, we described a heuristically designed chameleon sequence (SASF1, structurally ambivalent sequence fragment 1) that could adopt either of two naturally occurring conformations (α-helical or β-sheet) when incorporated as part of the C-terminal dimerization subdomain of two structurally divergent transcription factors, P22 Cro and λ Cro. Here we describe longer chameleon designs (SASF2 and SASF3) that in the case of SASF3 correspond to the full C-terminal half of the ordered region of a P22 Cro/λ Cro sequence alignment (residues 34-57). P22-SASF2 and λ(WDD)-SASF2 show moderate thermal stability in denaturation curves monitored by circular dichroism (T(m) values of 46 and 55°C, respectively), while P22-SASF3 and λ(WDD)-SASF3 have somewhat reduced stability (T(m) values of 33 and 49°C, respectively). (13)C and (1)H NMR secondary chemical shift analysis confirms two C-terminal α-helices for P22-SASF2 (residues 36-45 and 54-57) and two C-terminal β-strands for λ(WDD)-SASF2 (residues 40-45 and 50-52), corresponding to secondary structure locations in the two parent sequences. Backbone relaxation data show that both chameleon sequences have a relatively well-ordered structure. Comparisons of (15)N-(1)H correlation spectra for SASF2 and SASF3-containing proteins strongly suggest that SASF3 retains the chameleonism of SASF2. Both Cro C-terminal conformations can be encoded in a single sequence, showing the plausibility of linking different Cro folds by smooth evolutionary transitions. The N-terminal subdomain, though largely conserved in structure, also exerts an important contextual influence on the structure of the C-terminal region.
Collapse
Affiliation(s)
- William J Anderson
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | | | | | | |
Collapse
|
22
|
Vaidya S, Hardy JA. Caspase-6 latent state stability relies on helical propensity. Biochemistry 2011; 50:3282-7. [PMID: 21381717 DOI: 10.1021/bi2001664] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Caspase-6 is an apoptotic protease that also plays important roles in neurodegenerative disorders, including Huntington's and Alzheimer's diseases. Caspase-6 is the only caspase known to form a latent state in which two extended helices block access to the active site. These helices must convert to strands for binding substrate. We probed the interconverting region and found that the absence of helix-breaking residues is more critical than a helix-bridging, hydrogen-bond network for formation of the extended conformation. In addition, our results suggest that caspase-6 must undergo a transition through a low-stability intermediate to bind the active-site ligand. Mature caspase-6 is capable of adopting a latent state not observed in any other caspase. The absence of any helix-breaking residues allows caspase-6 to adopt the extended helical conformation. When we introduced helix-breaking residues similar to those seen in caspase-3 or -7, the structure and stability of the latent state were compromised.
Collapse
Affiliation(s)
- Sravanti Vaidya
- Department of Chemistry, 710 North Pleasant Street, University of Massachusetts, Amherst, Massachusetts 01003, United States
| | | |
Collapse
|
23
|
Phages have adapted the same protein fold to fulfill multiple functions in virion assembly. Proc Natl Acad Sci U S A 2010; 107:14384-9. [PMID: 20660769 DOI: 10.1073/pnas.1005822107] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Evolutionary relationships may exist among very diverse groups of proteins even though they perform different functions and display little sequence similarity. The tailed bacteriophages present a uniquely amenable system for identifying such groups because of their huge diversity yet conserved genome structures. In this work, we used structural, functional, and genomic context comparisons to conclude that the head-tail connector protein and tail tube protein of bacteriophage lambda diverged from a common ancestral protein. Further comparisons of tertiary and quaternary structures indicate that the baseplate hub and tail terminator proteins of bacteriophage may also be part of this same family. We propose that all of these proteins evolved from a single ancestral tail tube protein fold, and that gene duplication followed by differentiation led to the specialized roles of these proteins seen in bacteriophages today. Although this type of evolutionary mechanism has been proposed for other systems, our work provides an evolutionary mechanism for a group of proteins with different functions that bear no sequence similarity. Our data also indicate that the addition of a structural element at the N terminus of the lambda head-tail connector protein endows it with a distinctive protein interaction capability compared with many of its putative homologues.
Collapse
|
24
|
Cao B, Elber R. Computational exploration of the network of sequence flow between protein structures. Proteins 2010; 78:985-1003. [PMID: 19899165 DOI: 10.1002/prot.22622] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
We investigate small sequence adjustments (of one or a few amino acids) that induce large conformational transitions between distinct and stable folds of proteins. Such transitions are intriguing from evolutionary and protein-design perspectives. They make it possible to search for ancient protein structures or to design protein switches that flip between folds and functions. A network of sequence flow between protein folds is computed for representative structures of the Protein Data Bank. The computed network is dense, on an average each structure is connected to tens of other folds. Proteins that attract sequences from a higher than expected number of neighboring folds are more likely to be enzymes and alpha/beta fold. The large number of connections between folds may reflect the need of enzymes to adjust their structures for alternative substrates. The network of the Cro family is discussed, and we speculate that capacity is an important factor (but not the only one) that determines protein evolution. The experimentally observed flip from all alpha to alpha + beta fold is examined by the network tools. A kinetic model for the transition of sequences between the folds (with only protein stability in mind) is proposed. Proteins 2010. (c) 2009 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Baoqiang Cao
- Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas 78712, USA
| | | |
Collapse
|
25
|
Cardarelli L, Lam R, Tuite A, Baker LA, Sadowski PD, Radford DR, Rubinstein JL, Battaile KP, Chirgadze N, Maxwell KL, Davidson AR. The crystal structure of bacteriophage HK97 gp6: defining a large family of head-tail connector proteins. J Mol Biol 2009; 395:754-68. [PMID: 19895817 DOI: 10.1016/j.jmb.2009.10.067] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2009] [Revised: 10/27/2009] [Accepted: 10/27/2009] [Indexed: 10/20/2022]
Abstract
The final step in the morphogenesis of long-tailed double-stranded DNA bacteriophages is the joining of the DNA-filled head to the tail. The connector is a specialized structure of the head that serves as the interface for tail attachment and the point of egress for DNA from the head during infection. Here, we report the determination of a 2.1 A crystal structure of gp6 of bacteriophage HK97. Through structural comparisons, functional studies, and bioinformatic analysis, gp6 has been determined to be a component of the connector of phage HK97 that is evolutionarily related to gp15, a well-characterized connector component of bacteriophage SPP1. Whereas the structure of gp15 was solved in a monomeric form, gp6 crystallized as an oligomeric ring with the dimensions expected for a connector protein. Although this ring is composed of 13 subunits, which does not match the symmetry of the connector within the phage, sequence conservation and modeling of this structure into the cryo-electron microscopy density of the SPP1 connector indicate that this oligomeric structure represents the arrangement of gp6 subunits within the mature phage particle. Through sequence searches and genomic position analysis, we determined that gp6 is a member of a large family of connector proteins that are present in long-tailed phages. We have also identified gp7 of HK97 as a homologue of gp16 of phage SPP1, which is the second component of the connector of this phage. These proteins are members of another large protein family involved in connector assembly.
Collapse
Affiliation(s)
- Lia Cardarelli
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Structural relationships among proteins with different global topologies and their implications for function annotation strategies. Proc Natl Acad Sci U S A 2009; 106:17377-82. [PMID: 19805138 DOI: 10.1073/pnas.0907971106] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It has become increasingly apparent that geometric relationships often exist between regions of two proteins that have quite different global topologies or folds. In this article, we examine whether such relationships can be used to infer a functional connection between the two proteins in question. We find, by considering a number of examples involving metal and cation binding, sugar binding, and aromatic group binding, that geometrically similar protein fragments can share related functions, even if they have been classified as belonging to different folds and topologies. Thus, the use of classifications inevitably limits the number of functional inferences that can be obtained from the comparative analysis of protein structures. In contrast, the development of interactive computational tools that recognize the "continuous" nature of protein structure/function space, by increasing the number of potentially meaningful relationships that are considered, may offer a dramatic enhancement in the ability to extract information from protein structure databases. We introduce the MarkUs server, that embodies this strategy and that is designed for a user interested in developing and validating specific functional hypotheses.
Collapse
|
27
|
Pell LG, Liu A, Edmonds L, Donaldson LW, Howell PL, Davidson AR. The X-ray crystal structure of the phage lambda tail terminator protein reveals the biologically relevant hexameric ring structure and demonstrates a conserved mechanism of tail termination among diverse long-tailed phages. J Mol Biol 2009; 389:938-51. [PMID: 19426744 DOI: 10.1016/j.jmb.2009.04.072] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Revised: 04/23/2009] [Accepted: 04/28/2009] [Indexed: 01/28/2023]
Abstract
The tail terminator protein (TrP) plays an essential role in phage tail assembly by capping the rapidly polymerizing tail once it has reached its requisite length and serving as the interaction surface for phage heads. Here, we present the 2.7-A crystal structure of a hexameric ring of gpU, the TrP of phage lambda. Using sequence alignment analysis and site-directed mutagenesis, we have shown that this multimeric structure is biologically relevant and we have delineated its functional surfaces. Comparison of the hexameric crystal structure with the solution structure of gpU that we previously solved using NMR spectroscopy shows large structural changes occurring upon multimerization and suggests a mechanism that allows gpU to remain monomeric at high concentrations on its own, yet polymerize readily upon contact with an assembled tail tube. The gpU hexamer displays several flexible loops that play key roles in head and tail binding, implying a role for disorder-to-order transitions in controlling assembly as has been observed with other lambda morphogenetic proteins. Finally, we have found that the hexameric structure of gpU is very similar to the structure of a putative TrP from a contractile phage tail even though it displays no detectable sequence similarity. This finding coupled with further bioinformatic investigations has led us to conclude that the TrPs of non-contractile-tailed phages, such as lambda, are evolutionarily related to those of contractile-tailed phages, such as P2 and Mu, and that all long-tailed phages may utilize a conserved mechanism for tail termination.
Collapse
Affiliation(s)
- Lisa G Pell
- Department of Biochemistry, University of Toronto, ON, Canada
| | | | | | | | | | | |
Collapse
|
28
|
Das M, Ganguly T, Bandhu A, Mondal R, Chanda PK, Jana B, Sau S. Moderately thermostable phage Φ11 Cro repressor has novel DNA-binding capacity and physicochemical properties. BMB Rep 2009; 42:160-5. [DOI: 10.5483/bmbrep.2009.42.3.160] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
29
|
Dubrava MS, Ingram WM, Roberts SA, Weichsel A, Montfort WR, Cordes MHJ. N15 Cro and lambda Cro: orthologous DNA-binding domains with completely different but equally effective homodimer interfaces. Protein Sci 2008; 17:803-12. [PMID: 18369196 DOI: 10.1110/ps.073330808] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Bacteriophage Cro proteins bind to target DNA as dimers but do not all dimerize with equal strength, and differ in fold in the region of the dimer interface. We report the structure of the Cro protein from Enterobacteria phage N15 at 1.05 A resolution. The subunit fold contains five alpha-helices and is closely similar to the structure of P22 Cro (1.3 A backbone room mean square difference over 52 residues), but quite different from that of lambda Cro, a structurally diverged member of this family with a mixed alpha-helix/beta-sheet fold. N15 Cro crystallizes as a biological dimer with an extensive interface (1303 A(2) change in accessible surface area per dimer) and also dimerizes in solution with a K(d) of 5.1 +/- 1.5 microM. Its dimerization is much stronger than that of its structural homolog P22 Cro, which does not self-associate detectably in solution. Instead, the level of self-association and interfacial area for N15 Cro is similar to that of lambda Cro, even though these two orthologs do not share the same fold and have dimer interfaces that are qualitatively different in structure. The common Cro ancestor is thought to be an all-helical monomer similar to P22 Cro. We propose that two Cro descendants independently developed stronger dimerization by entirely different mechanisms.
Collapse
Affiliation(s)
- Matthew S Dubrava
- Department of Biochemistry and Molecular Biophysics, University of Arizona, Tucson, Arizona 85721, USA
| | | | | | | | | | | |
Collapse
|
30
|
|
31
|
Transitive homology-guided structural studies lead to discovery of Cro proteins with 40% sequence identity but different folds. Proc Natl Acad Sci U S A 2008; 105:2343-8. [PMID: 18227506 DOI: 10.1073/pnas.0711589105] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a "stepping-stone" method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and lambda. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and lambda. The domains show 40% sequence identity but differ by switching of alpha-helix to beta-sheet in a C-terminal region spanning approximately 25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization.
Collapse
|
32
|
Lou C, Yang X, Liu X, He B, Ouyang Q. A quantitative study of lambda-phage SWITCH and its components. Biophys J 2007; 92:2685-93. [PMID: 17259278 PMCID: PMC1831702 DOI: 10.1529/biophysj.106.097089] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We propose what we believe is a new model to quantitatively describe the lambda-phage SWITCH system. The model incorporates facilitated transfer mechanism of transcription factor, which can be simplified into a two-step reaction. We first sequentially obtain two indispensable parameters by fitting our model to experimental data of two simple systems, and then apply them to study the natural lambda-SWITCH system. By incorporating the facilitated transfer mechanism, we find that in RecA(-) host Escherichia coli, the wild-type lambda-lysogenic state is in a monostable regime rather than in a bistable regime. Furthermore, the model explains the weak role of Cro protein and probably sheds light on the evolution of lambda-Cro protein, which is known to be structurally distinct from the other Cros in lambdoid family members.
Collapse
Affiliation(s)
- Chunbo Lou
- Center for Theoretical Biology and School of Physics, Peking University, Beijing, 100871, China
| | | | | | | | | |
Collapse
|
33
|
Casjens SR. Comparative genomics and evolution of the tailed-bacteriophages. Curr Opin Microbiol 2005; 8:451-8. [PMID: 16019256 DOI: 10.1016/j.mib.2005.06.014] [Citation(s) in RCA: 164] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2005] [Accepted: 06/16/2005] [Indexed: 11/29/2022]
Abstract
The number of completely sequenced tailed-bacteriophage genomes that have been published increased to more than 125 last year. The comparison of these genomes has brought their highly mosaic nature into much sharper focus. Furthermore, reports of the complete sequences of about 150 bacterial genomes have shown that the many prophage and parts thereof that reside in these bacterial genomes must comprise a significant fraction of Earth's phage gene pool. These phage and prophage genomes are fertile ground for attempts to deduce the nature of viral evolutionary processes, and such analyses have made it clear that these phage have enjoyed a significant level of horizontal exchange of genetic information throughout their long histories. The strength of these evolutionary deductions rests largely on the extensive knowledge that has accumulated during intensive study into the molecular nature of the life cycles of a few 'model system' phages over the past half century. Recent molecular studies of phages other than these model system phages have made it clear that much remains to be learnt about the variety of lifestyle strategies utilized by the tailed-phage.
Collapse
Affiliation(s)
- Sherwood R Casjens
- Division of Cell Biology and Immunology, Department of Pathology, University of Utah Medical School, Salt Lake City, UT 84132, USA.
| |
Collapse
|
34
|
Hall BM, Lefevre KR, Cordes MHJ. Sequence correlations between Cro recognition helices and cognate O(R) consensus half-sites suggest conserved rules of protein-DNA recognition. J Mol Biol 2005; 350:667-81. [PMID: 15967464 DOI: 10.1016/j.jmb.2005.05.025] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2005] [Revised: 04/09/2005] [Accepted: 05/12/2005] [Indexed: 11/27/2022]
Abstract
The O(R) regions from several lambdoid bacteriophages contain the three regulatory sites O(R)1, O(R)2 and O(R)3, to which the Cro and CI proteins can bind. These sites show imperfect dyad symmetry, have similar sequences, and generally lie on the same face of the DNA double helix. We have developed a computational method, which analyzes the O(R) regions of additional phages and predicts the location of these three sites. After tuning the method to predict known O(R) sites accurately, we used it to predict unknown sites, and ultimately compiled a database of 32 known and predicted O(R) binding site sets. We then identified sequences of the recognition helices (RH) for the cognate Cro proteins through manual inspection of multiple sequence alignments. Comparison of Cro RH and consensus O(R) half-site sequences revealed strong one-to-one correlations between two amino acids at each of three RH positions and two bases at each of three half-site positions (H1-->2, H3-->5 and H6-->6). In each of these three cases, one of the two amino acid/base-pairings corresponds to a contact observed in the crystal structure of a lambda Cro/consensus operator complex. The alternate amino acid/base combinations were rationalized using structural models. We suggest that the pairs of amino acid residues act as binary switches that efficiently modulate specificity for different consensus half-site variants during evolution. The observation of structurally reasonable amino acid-to-base correlations suggests that Cro proteins share some common rules of recognition despite their functional and structural diversity.
Collapse
Affiliation(s)
- Branwen M Hall
- Department of Biochemistry and Molecular Biophysics, University of Arizona, Tucson, AZ 85721, USA
| | | | | |
Collapse
|
35
|
Levy Y, Cho SS, Onuchic JN, Wolynes PG. A Survey of Flexible Protein Binding Mechanisms and their Transition States Using Native Topology Based Energy Landscapes. J Mol Biol 2005; 346:1121-45. [PMID: 15701522 DOI: 10.1016/j.jmb.2004.12.021] [Citation(s) in RCA: 170] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2004] [Revised: 12/03/2004] [Accepted: 12/09/2004] [Indexed: 11/16/2022]
Abstract
Many cellular functions rely on interactions between protein pairs and higher oligomers. We have recently shown that binding mechanisms are robust and owing to the minimal frustration principle, just as for protein folding, are governed primarily by the protein's native topology, which is characterized by the network of non-covalent residue-residue interactions. The detailed binding mechanisms of nine dimers, a trimer, and a tetramer, each involving different degrees of flexibility and plasticity during assembly, are surveyed here using a model that is based solely on the protein topology, having a perfectly funneled energy landscape. The importance of flexibility in binding reactions is manifested by the fly-casting effect, which is diminished in magnitude when protein flexibility is removed. Many of the grosser and finer structural aspects of the various binding mechanisms (including binding of pre-folded monomers, binding of intrinsically unfolded monomers, and binding by domain-swapping) predicted by the native topology based landscape model are consistent with the mechanisms found in the laboratory. An asymmetric binding mechanism is often observed for the formation of the symmetric homodimers where one monomer is more structured at the binding transition state and serves as a template for the folding of the other monomer. Phi values were calculated to show how the structure of the binding transition state ensemble would be manifested in protein engineering studies. For most systems, the simulated Phi values are reasonably correlated with the available experimental values. This agreement suggests that the overall binding mechanism and the nature of the binding transition state ensemble can be understood from the network of interactions that stabilize the native fold. The Phi values for the formation of an antibody-antigen complex indicate a possible role for solvation of the interface in biomolecular association of large rigid proteins.
Collapse
Affiliation(s)
- Yaakov Levy
- Center for Theoretical Biological Physics, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA.
| | | | | | | |
Collapse
|