1
|
Porter LL, Artsimovitch I, Ramírez-Sarmiento CA. Metamorphic proteins and how to find them. Curr Opin Struct Biol 2024; 86:102807. [PMID: 38537533 PMCID: PMC11102287 DOI: 10.1016/j.sbi.2024.102807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/05/2024] [Accepted: 03/06/2024] [Indexed: 04/04/2024]
Abstract
In the last two decades, our existing notion that most foldable proteins have a unique native state has been challenged by the discovery of metamorphic proteins, which reversibly interconvert between multiple, sometimes highly dissimilar, native states. As the number of known metamorphic proteins increases, several computational and experimental strategies have emerged for gaining insights about their refolding processes and identifying unknown metamorphic proteins amongst the known proteome. In this review, we describe the current advances in biophysically and functionally ascertaining the structural interconversions of metamorphic proteins and how coevolution can be harnessed to identify novel metamorphic proteins from sequence information. We also discuss the challenges and ongoing efforts in using artificial intelligence-based protein structure prediction methods to discover metamorphic proteins and predict their corresponding three-dimensional structures.
Collapse
Affiliation(s)
- Lauren L Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| | - Irina Artsimovitch
- Department of Microbiology and Center for RNA Biology, The Ohio State University, Columbus, OH 43210, USA.
| | - César A Ramírez-Sarmiento
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, Santiago 7820436, Chile; ANID, Millennium Science Initiative Program, Millennium Institute for Integrative Biology (iBio), Santiago 833150, Chile.
| |
Collapse
|
2
|
Zuber PK, Said N, Hilal T, Wang B, Loll B, González-Higueras J, Ramírez-Sarmiento CA, Belogurov GA, Artsimovitch I, Wahl MC, Knauer SH. Concerted transformation of a hyper-paused transcription complex and its reinforcing protein. Nat Commun 2024; 15:3040. [PMID: 38589445 PMCID: PMC11001881 DOI: 10.1038/s41467-024-47368-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 03/28/2024] [Indexed: 04/10/2024] Open
Abstract
RfaH, a paralog of the universally conserved NusG, binds to RNA polymerases (RNAP) and ribosomes to activate expression of virulence genes. In free, autoinhibited RfaH, an α-helical KOW domain sequesters the RNAP-binding site. Upon recruitment to RNAP paused at an ops site, KOW is released and refolds into a β-barrel, which binds the ribosome. Here, we report structures of ops-paused transcription elongation complexes alone and bound to the autoinhibited and activated RfaH, which reveal swiveled, pre-translocated pause states stabilized by an ops hairpin in the non-template DNA. Autoinhibited RfaH binds and twists the ops hairpin, expanding the RNA:DNA hybrid to 11 base pairs and triggering the KOW release. Once activated, RfaH hyper-stabilizes the pause, which thus requires anti-backtracking factors for escape. Our results suggest that the entire RfaH cycle is solely determined by the ops and RfaH sequences and provide insights into mechanisms of recruitment and metamorphosis of NusG homologs across all life.
Collapse
Affiliation(s)
- Philipp K Zuber
- Biochemistry IV-Biophysical Chemistry, Universität Bayreuth, Bayreuth, Germany
- MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Cambridge, UK
| | - Nelly Said
- Institute of Chemistry and Biochemistry, Laboratory of Structural Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Tarek Hilal
- Institute of Chemistry and Biochemistry, Laboratory of Structural Biochemistry, Freie Universität Berlin, Berlin, Germany
- Research Center of Electron Microscopy and Core Facility BioSupraMol, Freie Universität Berlin, Berlin, Germany
| | - Bing Wang
- Department of Microbiology and Center for RNA Biology, The Ohio State University, Columbus, OH, USA
| | - Bernhard Loll
- Institute of Chemistry and Biochemistry, Laboratory of Structural Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Jorge González-Higueras
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile
- ANID, Millennium Science Initiative Program, Millennium Institute for Integrative Biology, Santiago, Chile
| | - César A Ramírez-Sarmiento
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile
- ANID, Millennium Science Initiative Program, Millennium Institute for Integrative Biology, Santiago, Chile
| | | | - Irina Artsimovitch
- Department of Microbiology and Center for RNA Biology, The Ohio State University, Columbus, OH, USA.
| | - Markus C Wahl
- Institute of Chemistry and Biochemistry, Laboratory of Structural Biochemistry, Freie Universität Berlin, Berlin, Germany.
- Macromolecular Crystallography, Helmholtz-Zentrum Berlin für Materialien und Energie, Berlin, Germany.
| | - Stefan H Knauer
- Biochemistry IV-Biophysical Chemistry, Universität Bayreuth, Bayreuth, Germany.
- Bristol-Myers Squibb GmbH & Co. KGaA, Munich, Germany.
| |
Collapse
|
3
|
Parui S, Brini E, Dill KA. Computing Free Energies of Fold-Switching Proteins Using MELD x MD. J Chem Theory Comput 2023; 19:6839-6847. [PMID: 37725050 DOI: 10.1021/acs.jctc.3c00679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/21/2023]
Abstract
Some proteins are conformational switches, able to transition between relatively different conformations. To understand what drives them requires computing the free-energy difference ΔGAB between their stable states, A and B. Molecular dynamics (MD) simulations alone are often slow because they require a reaction coordinate and must sample many transitions in between. Here, we show that modeling employing limited data (MELD) x MD on known endstates A and B is accurate and efficient because it does not require passing over barriers or knowing reaction coordinates. We validate this method on two problems: (1) it gives correct relative populations of α and β conformers for small designed chameleon sequences of protein G; and (2) it correctly predicts the conformations of the C-terminal domain (CTD) of RfaH. Free-energy methods like MELD x MD can often resolve structures that confuse machine-learning (ML) methods.
Collapse
Affiliation(s)
- Sridip Parui
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Emiliano Brini
- School of Chemistry and Materials Science, 85 Lomb Memorial Drive, Rochester, New York 14623, United States
| | - Ken A Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
4
|
Bazmi S, Seifi B, Wallin S. Simulations of a protein fold switch reveal crowding-induced population shifts driven by disordered regions. Commun Chem 2023; 6:191. [PMID: 37689829 PMCID: PMC10492864 DOI: 10.1038/s42004-023-00995-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 08/24/2023] [Indexed: 09/11/2023] Open
Abstract
Macromolecular crowding effects on globular proteins, which usually adopt a single stable fold, have been widely studied. However, little is known about crowding effects on fold-switching proteins, which reversibly switch between distinct folds. Here we study the mutationally driven switch between the folds of GA and GB, the two 56-amino acid binding domains of protein G, using a structure-based dual-basin model. We show that, in the absence of crowders, the fold populations PA and PB can be controlled by the strengths of contacts in the two folds, κA and κB. A population balance, PA ≈ PB, is obtained for κB/κA = 0.92. The resulting model protein is subject to crowding at different packing fractions, ϕc. We find that crowding increases the GB population and reduces the GA population, reaching PB/PA ≈ 4 at ϕc = 0.44. We analyze the ϕc-dependence of the crowding-induced GA-to-GB switch using scaled particle theory, which provides a qualitative, but not quantitative, fit of our data, suggesting effects beyond a spherical description of the folds. We show that the terminal regions of the protein chain, which are intrinsically disordered only in GA, play a dominant role in the response of the fold switch to crowding effects.
Collapse
Affiliation(s)
- Saman Bazmi
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John's, NL, A1B 3X7, Canada
| | - Bahman Seifi
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John's, NL, A1B 3X7, Canada
| | - Stefan Wallin
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John's, NL, A1B 3X7, Canada.
| |
Collapse
|
5
|
Retamal-Farfán I, González-Higueras J, Galaz-Davison P, Rivera M, Ramírez-Sarmiento CA. Exploring the structural acrobatics of fold-switching proteins using simplified structure-based models. Biophys Rev 2023; 15:787-799. [PMID: 37681096 PMCID: PMC10480104 DOI: 10.1007/s12551-023-01087-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 06/22/2023] [Indexed: 09/09/2023] Open
Abstract
Metamorphic proteins are a paradigm of the protein folding process, by encoding two or more native states, highly dissimilar in terms of their secondary, tertiary, and even quaternary structure, on a single amino acid sequence. Moreover, these proteins structurally interconvert between these native states in a reversible manner at biologically relevant timescales as a result of different environmental cues. The large-scale rearrangements experienced by these proteins, and their sometimes high mass interacting partners that trigger their metamorphosis, makes the computational and experimental study of their structural interconversion challenging. Here, we present our efforts in studying the refolding landscapes of two quintessential metamorphic proteins, RfaH and KaiB, using simplified dual-basin structure-based models (SBMs), rigorously footed on the energy landscape theory of protein folding and the principle of minimal frustration. By using coarse-grained models in which the native contacts and bonded interactions extracted from the available experimental structures of the two native states of RfaH and KaiB are merged into a single Hamiltonian, dual-basin SBM models can be generated and savvily calibrated to explore their fold-switch in a reversible manner in molecular dynamics simulations. We also describe how some of the insights offered by these simulations have driven the design of experiments and the validation of the conformational ensembles and refolding routes observed using this simple and computationally efficient models.
Collapse
Affiliation(s)
- Ignacio Retamal-Farfán
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, 7820436 Santiago, Chile
- ANID — Millennium Science Initiative Program — Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Jorge González-Higueras
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, 7820436 Santiago, Chile
- ANID — Millennium Science Initiative Program — Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Pablo Galaz-Davison
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, 7820436 Santiago, Chile
- ANID — Millennium Science Initiative Program — Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Maira Rivera
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, 7820436 Santiago, Chile
- Department of Chemistry, Faculty of Science, McGill University, Montreal, Quebec H3A 0B8 Canada
| | - César A. Ramírez-Sarmiento
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, 7820436 Santiago, Chile
- ANID — Millennium Science Initiative Program — Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| |
Collapse
|
6
|
Artsimovitch I, Ramírez-Sarmiento CA. Metamorphic proteins under a computational microscope: Lessons from a fold-switching RfaH protein. Comput Struct Biotechnol J 2022; 20:5824-5837. [PMID: 36382197 PMCID: PMC9630627 DOI: 10.1016/j.csbj.2022.10.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 10/18/2022] [Accepted: 10/18/2022] [Indexed: 11/28/2022] Open
Abstract
Metamorphic proteins constitute unexpected paradigms of the protein folding problem, as their sequences encode two alternative folds, which reversibly interconvert within biologically relevant timescales to trigger different cellular responses. Once considered a rare aberration, metamorphism may be common among proteins that must respond to rapidly changing environments, exemplified by NusG-like proteins, the only transcription factors present in every domain of life. RfaH, a specialized paralog of bacterial NusG, undergoes an all-α to all-β domain switch to activate expression of virulence and conjugation genes in many animal and plant pathogens and is the quintessential example of a metamorphic protein. The dramatic nature of RfaH structural transformation and the richness of its evolutionary history makes for an excellent model for studying how metamorphic proteins switch folds. Here, we summarize the structural and functional evidence that sparked the discovery of RfaH as a metamorphic protein, the experimental and computational approaches that enabled the description of the molecular mechanism and refolding pathways of its structural interconversion, and the ongoing efforts to find signatures and general properties to ultimately describe the protein metamorphome.
Collapse
Affiliation(s)
- Irina Artsimovitch
- Department of Microbiology and The Center for RNA Biology, The Ohio State University, Columbus, OH, USA
| | - César A. Ramírez-Sarmiento
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile
- ANID, Millennium Science Initiative Program, Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| |
Collapse
|
7
|
Molina JA, Galaz-Davison P, Komives EA, Artsimovitch I, Ramírez-Sarmiento CA. Allosteric couplings upon binding of RfaH to transcription elongation complexes. Nucleic Acids Res 2022; 50:6384-6397. [PMID: 35670666 PMCID: PMC9226497 DOI: 10.1093/nar/gkac453] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 05/07/2022] [Accepted: 05/18/2022] [Indexed: 12/03/2022] Open
Abstract
In every domain of life, NusG-like proteins bind to the elongating RNA polymerase (RNAP) to support processive RNA synthesis and to couple transcription to ongoing cellular processes. Structures of factor-bound transcription elongation complexes (TECs) reveal similar contacts to RNAP, consistent with a shared mechanism of action. However, NusG homologs differ in their regulatory roles, modes of recruitment, and effects on RNA synthesis. Some of these differences could be due to conformational changes in RNAP and NusG-like proteins, which cannot be captured in static structures. Here, we employed hydrogen-deuterium exchange mass spectrometry to investigate changes in local and non-local structural dynamics of Escherichia coli NusG and its paralog RfaH, which have opposite effects on expression of xenogenes, upon binding to TEC. We found that NusG and RfaH regions that bind RNAP became solvent-protected in factor-bound TECs, whereas RNAP regions that interact with both factors showed opposite deuterium uptake changes when bound to NusG or RfaH. Additional changes far from the factor-binding site were observed only with RfaH. Our results provide insights into differences in structural dynamics exerted by NusG and RfaH during binding to TEC, which may explain their different functional outcomes and allosteric regulation of transcriptional pausing by RfaH.
Collapse
Affiliation(s)
- José Alejandro Molina
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile.,ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Pablo Galaz-Davison
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile.,ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| | - Elizabeth A Komives
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093, USA
| | - Irina Artsimovitch
- Department of Microbiology and The Center for RNA Biology, The Ohio State University, Columbus, OH 43210, USA
| | - César A Ramírez-Sarmiento
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile.,ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Santiago, Chile
| |
Collapse
|
8
|
Galaz‐Davison P, Ferreiro DU, Ramírez‐Sarmiento CA. Coevolution-derived native and non-native contacts determine the emergence of a novel fold in a universally conserved family of transcription factors. Protein Sci 2022; 31:e4337. [PMID: 35634768 PMCID: PMC9123645 DOI: 10.1002/pro.4337] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 04/18/2022] [Accepted: 05/03/2022] [Indexed: 11/07/2022]
Abstract
The NusG protein family is structurally and functionally conserved in all domains of life. Its members directly bind RNA polymerases and regulate transcription processivity and termination. RfaH, a divergent sub-family in its evolutionary history, is known for displaying distinct features than those in NusG proteins, which allows them to regulate the expression of virulence factors in enterobacteria in a DNA sequence-dependent manner. A striking feature is its structural interconversion between an active fold, which is the canonical NusG three-dimensional structure, and an autoinhibited fold, which is distinctively novel. How this novel fold is encoded within RfaH sequence to encode a metamorphic protein remains elusive. In this work, we used publicly available genomic RfaH protein sequences to construct a complete multiple sequence alignment, which was further augmented with metagenomic sequences and curated by predicting their secondary structure propensities using JPred. Coevolving pairs of residues were calculated from these sequences using plmDCA and GREMLIN, which allowed us to detect the enrichment of key metamorphic contacts after sequence filtering. Finally, we combined our coevolutionary predictions with molecular dynamics to demonstrate that these interactions are sufficient to predict the structures of both native folds, where coevolutionary-derived non-native contacts may play a key role in achieving the compact RfaH novel fold. All in all, emergent coevolutionary signals found within RfaH sequences encode the autoinhibited and active folds of this protein, shedding light on the key interactions responsible for the action of this metamorphic protein.
Collapse
Affiliation(s)
- Pablo Galaz‐Davison
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological SciencesPontificia Universidad Católica de ChileSantiagoChile
- ANID—Millennium Science Initiative Program—Millennium Institute for Integrative Biology (iBio)SantiagoChile
| | - Diego U. Ferreiro
- Protein Physiology Lab, Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales (IQUIBICEN‐CONICET)Universidad de Buenos AiresBuenos AiresArgentina
| | - César A. Ramírez‐Sarmiento
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological SciencesPontificia Universidad Católica de ChileSantiagoChile
- ANID—Millennium Science Initiative Program—Millennium Institute for Integrative Biology (iBio)SantiagoChile
| |
Collapse
|