1
|
Nguyen A, Zhao H, Myagmarsuren D, Srinivasan S, Wu D, Chen J, Piszczek G, Schuck P. Modulation of biophysical properties of nucleocapsid protein in the mutant spectrum of SARS-CoV-2. eLife 2024; 13:RP94836. [PMID: 38941236 DOI: 10.7554/elife.94836] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024] Open
Abstract
Genetic diversity is a hallmark of RNA viruses and the basis for their evolutionary success. Taking advantage of the uniquely large genomic database of SARS-CoV-2, we examine the impact of mutations across the spectrum of viable amino acid sequences on the biophysical phenotypes of the highly expressed and multifunctional nucleocapsid protein. We find variation in the physicochemical parameters of its extended intrinsically disordered regions (IDRs) sufficient to allow local plasticity, but also observe functional constraints that similarly occur in related coronaviruses. In biophysical experiments with several N-protein species carrying mutations associated with major variants, we find that point mutations in the IDRs can have nonlocal impact and modulate thermodynamic stability, secondary structure, protein oligomeric state, particle formation, and liquid-liquid phase separation. In the Omicron variant, distant mutations in different IDRs have compensatory effects in shifting a delicate balance of interactions controlling protein assembly properties, and include the creation of a new protein-protein interaction interface in the N-terminal IDR through the defining P13L mutation. A picture emerges where genetic diversity is accompanied by significant variation in biophysical characteristics of functional N-protein species, in particular in the IDRs.
Collapse
Affiliation(s)
- Ai Nguyen
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Huaying Zhao
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Dulguun Myagmarsuren
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Sanjana Srinivasan
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Di Wu
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, United States
| | - Jiji Chen
- Advanced Imaging and Microscopy Resource, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Grzegorz Piszczek
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, United States
| | - Peter Schuck
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| |
Collapse
|
2
|
Bradley D, Garand C, Belda H, Gagnon-Arsenault I, Treeck M, Elowe S, Landry CR. The substrate quality of CK2 target sites has a determinant role on their function and evolution. Cell Syst 2024; 15:544-562.e8. [PMID: 38861992 DOI: 10.1016/j.cels.2024.05.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 02/29/2024] [Accepted: 05/20/2024] [Indexed: 06/13/2024]
Abstract
Most biological processes are regulated by signaling modules that bind to short linear motifs. For protein kinases, substrates may have full or only partial matches to the kinase recognition motif, a property known as "substrate quality." However, it is not clear whether differences in substrate quality represent neutral variation or if they have functional consequences. We examine this question for the kinase CK2, which has many fundamental functions. We show that optimal CK2 sites are phosphorylated at maximal stoichiometries and found in many conditions, whereas minimal substrates are more weakly phosphorylated and have regulatory functions. Optimal CK2 sites tend to be more conserved, and substrate quality is often tuned by selection. For intermediate sites, increases or decreases in substrate quality may be deleterious, as we demonstrate for a CK2 substrate at the kinetochore. The results together suggest a strong role for substrate quality in phosphosite function and evolution. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- David Bradley
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Québec City, QC G1V 0A6, Canada; Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, Québec City, QC G1V 0A6, Canada; Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec City, QC G1V 0A6, Canada; Département de Biologie, Faculté des Sciences et de Génie, Université Laval, Québec City, QC G1V 0A6, Canada.
| | - Chantal Garand
- PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, Québec City, QC G1V 0A6, Canada; Axe de Reproduction, Santé de la mère et de l'enfant, CHU de Québec, Université Laval, Québec City, QC, Canada
| | - Hugo Belda
- Signalling in Host-Pathogen Interaction Laboratory, The Francis Crick Institute, London NW11AT, UK
| | - Isabelle Gagnon-Arsenault
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Québec City, QC G1V 0A6, Canada; Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, Québec City, QC G1V 0A6, Canada; Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec City, QC G1V 0A6, Canada; Département de Biologie, Faculté des Sciences et de Génie, Université Laval, Québec City, QC G1V 0A6, Canada
| | - Moritz Treeck
- Signalling in Host-Pathogen Interaction Laboratory, The Francis Crick Institute, London NW11AT, UK; Cell Biology of Host-Pathogen Interaction Laboratory, The Gulbenkian Institute of Science, Oeiras 2780-156, Portugal
| | - Sabine Elowe
- PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, Québec City, QC G1V 0A6, Canada; Axe de Reproduction, Santé de la mère et de l'enfant, CHU de Québec, Université Laval, Québec City, QC, Canada; Department of Pediatrics, Faculty of Medicine, Université Laval, Québec City, QC, Canada; Centre de Recherche sur le Cancer, CHU de Québec, Université Laval, Québec City, QC, Canada
| | - Christian R Landry
- Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, Québec City, QC G1V 0A6, Canada; Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, Québec City, QC G1V 0A6, Canada; Centre de Recherche sur les Données Massives (CRDM), Université Laval, Québec City, QC G1V 0A6, Canada; Département de Biologie, Faculté des Sciences et de Génie, Université Laval, Québec City, QC G1V 0A6, Canada.
| |
Collapse
|
3
|
Ginell GM, Emenecker RJ, Lotthammer JM, Usher ET, Holehouse AS. Direct prediction of intermolecular interactions driven by disordered regions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.03.597104. [PMID: 38895487 PMCID: PMC11185574 DOI: 10.1101/2024.06.03.597104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Intrinsically disordered regions (IDRs) are critical for a wide variety of cellular functions, many of which involve interactions with partner proteins. Molecular recognition is typically considered through the lens of sequence-specific binding events. However, a growing body of work has shown that IDRs often interact with partners in a manner that does not depend on the precise order of the amino acid order, instead driven by complementary chemical interactions leading to disordered bound-state complexes. Despite this emerging paradigm, we lack tools to describe, quantify, predict, and interpret these types of structurally heterogeneous interactions from the underlying amino acid sequences. Here, we repurpose the chemical physics developed originally for molecular simulations to develop an approach for predicting intermolecular interactions between IDRs and partner proteins. Our approach enables the direct prediction of phase diagrams, the identification of chemically-specific interaction hotspots on IDRs, and a route to develop and test mechanistic hypotheses regarding IDR function in the context of molecular recognition. We use our approach to examine a range of systems and questions to highlight its versatility and applicability.
Collapse
Affiliation(s)
- Garrett M. Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Ryan. J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Jeffrey M. Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Emery T. Usher
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| |
Collapse
|
4
|
Alston JJ, Soranno A, Holehouse AS. Conserved molecular recognition by an intrinsically disordered region in the absence of sequence conservation. RESEARCH SQUARE 2024:rs.3.rs-4477977. [PMID: 38883712 PMCID: PMC11177979 DOI: 10.21203/rs.3.rs-4477977/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2024]
Abstract
Intrinsically disordered regions (IDRs) are critical for cellular function yet often appear to lack sequence conservation when assessed by multiple sequence alignments. This raises the question of if and how function can be encoded and preserved in these regions despite massive sequence variation. To address this question, we have applied coarse-grained molecular dynamics simulations to investigate non-specific RNA binding of coronavirus nucleocapsid proteins. Coronavirus nucleocapsid proteins consist of multiple interspersed disordered and folded domains that bind RNA. Here, we focus on the first two domains of coronavirus nucleocapsid proteins: the disordered N-terminal domain (NTD) and the folded RNA binding domain (RBD). While the NTD is highly variable across evolution, the RBD is structurally conserved. This combination makes the NTD-RBD a convenient model system for exploring the interplay between an IDR adjacent to a folded domain and how changes in IDR sequence can influence molecular recognition of a partner. Our results reveal a surprising degree of sequence-specificity encoded by both the composition and the precise order of the amino acids in the NTD. The presence of an NTD can - depending on the sequence - either suppress or enhance RNA binding. Despite this sensitivity, large-scale variation in NTD sequences is possible while certain sequence features are retained. Consequently, a conformationally-conserved dynamic and disordered RNA:protein complex is found across nucleocapsid protein orthologs despite large-scale changes in both NTD sequence and RBD surface chemistry. Taken together, these insights shed light on the ability of disordered regions to preserve functional characteristics despite their sequence variability.
Collapse
Affiliation(s)
- Jhullian J. Alston
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
- Present Address, Program In Cellular and Molecular Medicine (PCMM), Boston Children’s Hospital, Boston, MA, USA
| | - Andrea Soranno
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| |
Collapse
|
5
|
Sood A, Zhang B. Preserving condensate structure and composition by lowering sequence complexity. Biophys J 2024:S0006-3495(24)00373-4. [PMID: 38824391 DOI: 10.1016/j.bpj.2024.05.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 04/25/2024] [Accepted: 05/28/2024] [Indexed: 06/03/2024] Open
Abstract
Biomolecular condensates play a vital role in organizing cellular chemistry. They selectively partition biomolecules, preventing unwanted cross talk and buffering against chemical noise. Intrinsically disordered proteins (IDPs) serve as primary components of these condensates due to their flexibility and ability to engage in multivalent interactions, leading to spontaneous aggregation. Theoretical advancements are critical at connecting IDP sequences with condensate emergent properties to establish the so-called molecular grammar. We proposed an extension to the stickers and spacers model, incorporating heterogeneous, nonspecific pairwise interactions between spacers alongside specific interactions among stickers. Our investigation revealed that although spacer interactions contribute to phase separation and co-condensation, their nonspecific nature leads to disorganized condensates. Specific sticker-sticker interactions drive the formation of condensates with well-defined networked structures and molecular composition. We discussed how evolutionary pressures might emerge to affect these interactions, leading to the prevalence of low-complexity domains in IDP sequences. These domains suppress spurious interactions and facilitate the formation of biologically meaningful condensates.
Collapse
Affiliation(s)
- Amogh Sood
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts.
| |
Collapse
|
6
|
Singleton MD, Eisen MB. Evolutionary analyses of intrinsically disordered regions reveal widespread signals of conservation. PLoS Comput Biol 2024; 20:e1012028. [PMID: 38662765 PMCID: PMC11075841 DOI: 10.1371/journal.pcbi.1012028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 05/07/2024] [Accepted: 03/28/2024] [Indexed: 05/08/2024] Open
Abstract
Intrinsically disordered regions (IDRs) are segments of proteins without stable three-dimensional structures. As this flexibility allows them to interact with diverse binding partners, IDRs play key roles in cell signaling and gene expression. Despite the prevalence and importance of IDRs in eukaryotic proteomes and various biological processes, associating them with specific molecular functions remains a significant challenge due to their high rates of sequence evolution. However, by comparing the observed values of various IDR-associated properties against those generated under a simulated model of evolution, a recent study found most IDRs across the entire yeast proteome contain conserved features. Furthermore, it showed clusters of IDRs with common "evolutionary signatures," i.e. patterns of conserved features, were associated with specific biological functions. To determine if similar patterns of conservation are found in the IDRs of other systems, in this work we applied a series of phylogenetic models to over 7,500 orthologous IDRs identified in the Drosophila genome to dissect the forces driving their evolution. By comparing models of constrained and unconstrained continuous trait evolution using the Brownian motion and Ornstein-Uhlenbeck models, respectively, we identified signals of widespread constraint, indicating conservation of distributed features is mechanism of IDR evolution common to multiple biological systems. In contrast to the previous study in yeast, however, we observed limited evidence of IDR clusters with specific biological functions, which suggests a more complex relationship between evolutionary constraints and function in the IDRs of multicellular organisms.
Collapse
Affiliation(s)
- Marc D. Singleton
- Howard Hughes Medical Institute, UC Berkeley, Berkeley, California, United States of America
| | - Michael B. Eisen
- Howard Hughes Medical Institute, UC Berkeley, Berkeley, California, United States of America
- Department of Molecular and Cell Biology, UC Berkeley, Berkeley, California, United States of America
| |
Collapse
|
7
|
Nguyen A, Zhao H, Myagmarsuren D, Srinivasan S, Wu D, Chen J, Piszczek G, Schuck P. Modulation of Biophysical Properties of Nucleocapsid Protein in the Mutant Spectrum of SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.21.568093. [PMID: 38045241 PMCID: PMC10690151 DOI: 10.1101/2023.11.21.568093] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Genetic diversity is a hallmark of RNA viruses and the basis for their evolutionary success. Taking advantage of the uniquely large genomic database of SARS-CoV-2, we examine the impact of mutations across the spectrum of viable amino acid sequences on the biophysical phenotypes of the highly expressed and multifunctional nucleocapsid protein. We find variation in the physicochemical parameters of its extended intrinsically disordered regions (IDRs) sufficient to allow local plasticity, but also exhibiting functional constraints that similarly occur in related coronaviruses. In biophysical experiments with several N-protein species carrying mutations associated with major variants, we find that point mutations in the IDRs can have nonlocal impact and modulate thermodynamic stability, secondary structure, protein oligomeric state, particle formation, and liquid-liquid phase separation. In the Omicron variant, distant mutations in different IDRs have compensatory effects in shifting a delicate balance of interactions controlling protein assembly properties, and include the creation of a new protein-protein interaction interface in the N-terminal IDR through the defining P13L mutation. A picture emerges where genetic diversity is accompanied by significant variation in biophysical characteristics of functional N-protein species, in particular in the IDRs.
Collapse
Affiliation(s)
- Ai Nguyen
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Huaying Zhao
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Dulguun Myagmarsuren
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Sanjana Srinivasan
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Di Wu
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Jiji Chen
- Advanced Imaging and Microscopy Resource, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Grzegorz Piszczek
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Peter Schuck
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
8
|
Lotthammer JM, Ginell GM, Griffith D, Emenecker RJ, Holehouse AS. Direct prediction of intrinsically disordered protein conformational properties from sequence. Nat Methods 2024; 21:465-476. [PMID: 38297184 PMCID: PMC10927563 DOI: 10.1038/s41592-023-02159-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 12/20/2023] [Indexed: 02/02/2024]
Abstract
Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles. While folded domains are generally well described by a stable three-dimensional structure, IDRs exist in a collection of interconverting states known as an ensemble. This structural heterogeneity means that IDRs are largely absent from the Protein Data Bank, contributing to a lack of computational approaches to predict ensemble conformational properties from sequence. Here we combine rational sequence design, large-scale molecular simulations and deep learning to develop ALBATROSS, a deep-learning model for predicting ensemble dimensions of IDRs, including the radius of gyration, end-to-end distance, polymer-scaling exponent and ensemble asphericity, directly from sequences at a proteome-wide scale. ALBATROSS is lightweight, easy to use and accessible as both a locally installable software package and a point-and-click-style interface via Google Colab notebooks. We first demonstrate the applicability of our predictors by examining the generalizability of sequence-ensemble relationships in IDRs. Then, we leverage the high-throughput nature of ALBATROSS to characterize the sequence-specific biophysical behavior of IDRs within and between proteomes.
Collapse
Affiliation(s)
- Jeffrey M Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Garrett M Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Daniel Griffith
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Ryan J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA.
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA.
| |
Collapse
|
9
|
Holehouse AS, Kragelund BB. The molecular basis for cellular function of intrinsically disordered protein regions. Nat Rev Mol Cell Biol 2024; 25:187-211. [PMID: 37957331 DOI: 10.1038/s41580-023-00673-0] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2023] [Indexed: 11/15/2023]
Abstract
Intrinsically disordered protein regions exist in a collection of dynamic interconverting conformations that lack a stable 3D structure. These regions are structurally heterogeneous, ubiquitous and found across all kingdoms of life. Despite the absence of a defined 3D structure, disordered regions are essential for cellular processes ranging from transcriptional control and cell signalling to subcellular organization. Through their conformational malleability and adaptability, disordered regions extend the repertoire of macromolecular interactions and are readily tunable by their structural and chemical context, making them ideal responders to regulatory cues. Recent work has led to major advances in understanding the link between protein sequence and conformational behaviour in disordered regions, yet the link between sequence and molecular function is less well defined. Here we consider the biochemical and biophysical foundations that underlie how and why disordered regions can engage in productive cellular functions, provide examples of emerging concepts and discuss how protein disorder contributes to intracellular information processing and regulation of cellular function.
Collapse
Affiliation(s)
- Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St Louis, MO, USA.
- Center for Biomolecular Condensates, Washington University in St Louis, St Louis, MO, USA.
| | - Birthe B Kragelund
- REPIN, Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
10
|
Mihalič F, Arcila D, Pettersson ME, Farkhondehkish P, Andersson E, Andersson L, Betancur-R R, Jemth P. Conservation of Affinity Rather Than Sequence Underlies a Dynamic Evolution of the Motif-Mediated p53/MDM2 Interaction in Ray-Finned Fishes. Mol Biol Evol 2024; 41:msae018. [PMID: 38301272 PMCID: PMC10901556 DOI: 10.1093/molbev/msae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 12/12/2023] [Accepted: 01/22/2024] [Indexed: 02/03/2024] Open
Abstract
The transcription factor and cell cycle regulator p53 is marked for degradation by the ubiquitin ligase MDM2. The interaction between these 2 proteins is mediated by a conserved binding motif in the disordered p53 transactivation domain (p53TAD) and the folded SWIB domain in MDM2. The conserved motif in p53TAD from zebrafish displays a 20-fold weaker interaction with MDM2, compared to the interaction in human and chicken. To investigate this apparent difference, we tracked the molecular evolution of the p53TAD/MDM2 interaction among ray-finned fishes (Actinopterygii), the largest vertebrate clade. Intriguingly, phylogenetic analyses, ancestral sequence reconstructions, and binding experiments showed that different loss-of-affinity changes in the canonical binding motif within p53TAD have occurred repeatedly and convergently in different fish lineages, resulting in relatively low extant affinities (KD = 0.5 to 5 μM). However, for 11 different fish p53TAD/MDM2 interactions, nonconserved regions flanking the canonical motif increased the affinity 4- to 73-fold to be on par with the human interaction. Our findings suggest that compensating changes at conserved and nonconserved positions within the motif, as well as in flanking regions of low conservation, underlie a stabilizing selection of "functional affinity" in the p53TAD/MDM2 interaction. Such interplay complicates bioinformatic prediction of binding and calls for experimental validation. Motif-mediated protein-protein interactions involving short binding motifs and folded interaction domains are very common across multicellular life. It is likely that the evolution of affinity in motif-mediated interactions often involves an interplay between specific interactions made by conserved motif residues and nonspecific interactions by nonconserved disordered regions.
Collapse
Affiliation(s)
- Filip Mihalič
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Dahiana Arcila
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Mats E Pettersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Pouria Farkhondehkish
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Leif Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX 77483, USA
| | - Ricardo Betancur-R
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| |
Collapse
|
11
|
Sood A, Zhang B. Preserving condensate structure and composition by lowering sequence complexity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.29.569249. [PMID: 38076908 PMCID: PMC10705451 DOI: 10.1101/2023.11.29.569249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
Biological condensates play a vital role in organizing cellular chemistry. They selectively partition biomolecules, preventing unwanted cross-talk and buffering against chemical noise. Intrinsically disordered proteins (IDPs) serve as primary components of these condensates due to their flexibility and ability to engage in multivalent, non-specific interactions, leading to spontaneous aggregation. Theoretical advancements are critical at connecting IDP sequences with condensate emergent properties to establish the so-called molecular grammar. We proposed an extension to the stickers and spacers model, incorporating non-specific pairwise interactions between spacers alongside specific interactions among stickers. Our investigation revealed that while spacer interactions contribute to phase separation and co-condensation, their non-specific nature leads to disorganized condensates. Specific sticker-sticker interactions drive the formation of condensates with well-defined structures and molecular composition. We discussed how evolutionary pressures might emerge to affect these interactions, leading to the prevalence of low complexity domains in IDP sequences. These domains suppress spurious interactions and facilitate the formation of biologically meaningful condensates.
Collapse
Affiliation(s)
- Amogh Sood
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
12
|
Alston JJ, Soranno A, Holehouse AS. Conserved molecular recognition by an intrinsically disordered region in the absence of sequence conservation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.06.552128. [PMID: 37609146 PMCID: PMC10441348 DOI: 10.1101/2023.08.06.552128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Intrinsically disordered regions (IDRs) are critical for cellular function, yet often appear to lack sequence conservation when assessed by multiple sequence alignments. This raises the question of if and how function can be encoded and preserved in these regions despite massive sequence variation. To address this question, we have applied coarse-grained molecular dynamics simulations to investigate non-specific RNA binding of coronavirus nucleocapsid proteins. Coronavirus nucleocapsid proteins consist of multiple interspersed disordered and folded domains that bind RNA. We focussed here on the first two domains of coronavirus nucleocapsid proteins, the disordered N-terminal domain (NTD) followed by the folded RNA binding domain (RBD). While the NTD is highly variable across evolution, the RBD is structurally conserved. This combination makes the NTD-RBD a convenient model system to explore the interplay between an IDR adjacent to a folded domain, and how changes in IDR sequence can influence molecular recognition of a partner. Our results reveal a surprising degree of sequence-specificity encoded by both the composition and the precise order of the amino acids in the NTD. The presence of an NTD can - depending on the sequence - either suppress or enhance RNA binding. Despite this sensitivity, large-scale variation in NTD sequences is possible while certain sequence features are retained. Consequently, a conformationally-conserved fuzzy RNA:protein complex is found across nucleocapsid protein orthologs, despite large-scale changes in both NTD sequence and RBD surface chemistry. Taken together, these insights shed light on the ability of disordered regions to preserve functional characteristics despite their sequence variability.
Collapse
|
13
|
Flynn AJ, Miller K, Codjoe JM, King MR, Haswell ES. Mechanosensitive ion channels MSL8, MSL9, and MSL10 have environmentally sensitive intrinsically disordered regions with distinct biophysical characteristics in vitro. PLANT DIRECT 2023; 7:e515. [PMID: 37547488 PMCID: PMC10400277 DOI: 10.1002/pld3.515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 06/26/2023] [Accepted: 06/27/2023] [Indexed: 08/08/2023]
Abstract
Intrinsically disordered protein regions (IDRs) are highly dynamic sequences that rapidly sample a collection of conformations over time. In the past several decades, IDRs have emerged as a major component of many proteomes, comprising ~30% of all eukaryotic protein sequences. Proteins with IDRs function in a wide range of biological pathways and are notably enriched in signaling cascades that respond to environmental stresses. Here, we identify and characterize intrinsic disorder in the soluble cytoplasmic N-terminal domains of MSL8, MSL9, and MSL10, three members of the MscS-like (MSL) family of mechanosensitive ion channels. In plants, MSL channels are proposed to mediate cell and organelle osmotic homeostasis. Bioinformatic tools unanimously predicted that the cytosolic N-termini of MSL channels are intrinsically disordered. We examined the N-terminus of MSL10 (MSL10N) as an exemplar of these IDRs and circular dichroism spectroscopy confirms its disorder. MSL10N adopted a predominately helical structure when exposed to the helix-inducing compound trifluoroethanol (TFE). Furthermore, in the presence of molecular crowding agents, MSL10N underwent structural changes and exhibited alterations to its homotypic interaction favorability. Lastly, interrogations of collective behavior via in vitro imaging of condensates indicated that MSL8N, MSL9N, and MSL10N have sharply differing propensities for self-assembly into condensates, both inherently and in response to salt, temperature, and molecular crowding. Taken together, these data establish the N-termini of MSL channels as intrinsically disordered regions with distinct biophysical properties and the potential to respond uniquely to changes in their physiochemical environment.
Collapse
Affiliation(s)
- Aidan J. Flynn
- Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
- NSF Center for Engineering Mechanobiology, Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
- Department of Biochemistry and BiophysicsWashington University in St. LouisSt. LouisMissouriUSA
| | - Kari Miller
- Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
- NSF Center for Engineering Mechanobiology, Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
| | - Jennette M. Codjoe
- Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
- NSF Center for Engineering Mechanobiology, Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
| | - Matthew R. King
- Department of Biomedical EngineeringWashington University in St. LouisSt. LouisMissouriUSA
| | - Elizabeth S. Haswell
- Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
- NSF Center for Engineering Mechanobiology, Department of BiologyWashington University in St. LouisSt. LouisMissouriUSA
| |
Collapse
|
14
|
Bardwell L, Thorner J. Mitogen-activated protein kinase (MAPK) cascades-A yeast perspective. Enzymes 2023; 54:137-170. [PMID: 37945169 DOI: 10.1016/bs.enz.2023.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]
Abstract
Discovery of the class of protein kinase now dubbed a mitogen (or messenger)-activated protein kinase (MAPK) is an illustrative example of how disparate lines of investigation can converge and reveal an enzyme family universally conserved among eukaryotes, from single-celled microbes to humans. Moreover, elucidation of the circuitry controlling MAPK function defined a now overarching principle in enzyme regulation-the concept of an activation cascade mediated by sequential phosphorylation events. Particularly ground-breaking for this field of exploration were the contributions of genetic approaches conducted using several model organisms, but especially the budding yeast Saccharomyces cerevisiae. Notably, examination of how haploid yeast cells respond to their secreted peptide mating pheromones was crucial in pinpointing genes encoding MAPKs and their upstream activators. Fully contemporaneous biochemical analysis of the activities elicited upon stimulation of mammalian cells by insulin and other growth- and differentiation-inducing factors lead eventually to the demonstration that components homologous to those in yeast were involved. Continued studies of these pathways in yeast were integral to other foundational discoveries in MAPK signaling, including the roles of tethering, scaffolding and docking interactions.
Collapse
Affiliation(s)
- Lee Bardwell
- Department of Developmental and Cell Biology, School of Biological Sciences, University of California, Irvine, Irvine, CA, United States
| | - Jeremy Thorner
- Division of Biochemistry, Biophysics and Structural Biology, Department of Molecular and Cell Biology, College of Letters and Science, University of California, Berkeley, Berkeley, CA, United States.
| |
Collapse
|
15
|
Jonas F, Carmi M, Krupkin B, Steinberger J, Brodsky S, Jana T, Barkai N. The molecular grammar of protein disorder guiding genome-binding locations. Nucleic Acids Res 2023; 51:4831-4844. [PMID: 36938874 PMCID: PMC10250222 DOI: 10.1093/nar/gkad184] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 01/25/2023] [Accepted: 03/15/2023] [Indexed: 03/21/2023] Open
Abstract
Intrinsically disordered regions (IDRs) direct transcription factors (TFs) towards selected genomic occurrences of their binding motif, as exemplified by budding yeast's Msn2. However, the sequence basis of IDR-directed TF binding selectivity remains unknown. To reveal this sequence grammar, we analyze the genomic localizations of >100 designed IDR mutants, each carrying up to 122 mutations within this 567-AA region. Our data points at multivalent interactions, carried by hydrophobic-mostly aliphatic-residues dispersed within a disordered environment and independent of linear sequence motifs, as the key determinants of Msn2 genomic localization. The implications of our results for the mechanistic basis of IDR-based TF binding preferences are discussed.
Collapse
Affiliation(s)
- Felix Jonas
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Miri Carmi
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Beniamin Krupkin
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Joseph Steinberger
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Sagie Brodsky
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Tamar Jana
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
16
|
Son S, Kim B, Yang J, Kim VN. Role of the proline-rich disordered domain of DROSHA in intronic microRNA processing. Genes Dev 2023; 37:383-397. [PMID: 37236670 PMCID: PMC10270192 DOI: 10.1101/gad.350275.122] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 04/24/2023] [Indexed: 05/28/2023]
Abstract
DROSHA serves as a gatekeeper of the microRNA (miRNA) pathway by processing primary transcripts (pri-miRNAs). While the functions of structured domains of DROSHA have been well documented, the contribution of N-terminal proline-rich disordered domain (PRD) remains elusive. Here we show that the PRD promotes the processing of miRNA hairpins located within introns. We identified a DROSHA isoform (p140) lacking the PRD, which is produced by proteolytic cleavage. Small RNA sequencing revealed that p140 is significantly impaired in the maturation of intronic miRNAs. Consistently, our minigene constructs demonstrated that PRD enhances the processing of intronic hairpins, but not those in exons. Splice site mutations did not affect the PRD's enhancing effect on intronic constructs, suggesting that the PRD acts independently of splicing reaction by interacting with sequences residing within introns. The N-terminal regions from zebrafish and Xenopus DROSHA can replace the human counterpart, indicating functional conservation despite poor sequence alignment. Moreover, we found that rapidly evolving intronic miRNAs are generally more dependent on PRD than conserved ones, suggesting a role of PRD in miRNA evolution. Our study reveals a new layer of miRNA regulation mediated by a low-complexity disordered domain that senses the genomic contexts of miRNA loci.
Collapse
Affiliation(s)
- Soomin Son
- Center for RNA Research, Institute for Basic Science, Seoul 08826, Korea
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - Baekgyu Kim
- Center for RNA Research, Institute for Basic Science, Seoul 08826, Korea
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - Jihye Yang
- Center for RNA Research, Institute for Basic Science, Seoul 08826, Korea
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - V Narry Kim
- Center for RNA Research, Institute for Basic Science, Seoul 08826, Korea;
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| |
Collapse
|
17
|
Shin SM, Lauzadis J, Itson-Zoske B, Cai Y, Fan F, Natarajan GK, Kwok WM, Puopolo M, Hogan QH, Yu H. Targeting intrinsically disordered regions facilitates discovery of calcium channels 3.2 inhibitory peptides for adeno-associated virus-mediated peripheral analgesia. Pain 2022; 163:2466-2484. [PMID: 35420557 PMCID: PMC9562599 DOI: 10.1097/j.pain.0000000000002650] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 03/19/2022] [Accepted: 03/23/2022] [Indexed: 11/27/2022]
Abstract
ABSTRACT Ample data support a prominent role of peripheral T-type calcium channels 3.2 (Ca V 3.2) in generating pain states. Development of primary sensory neuron-specific inhibitors of Ca V 3.2 channels is an opportunity for achieving effective analgesic therapeutics, but success has been elusive. Small peptides, especially those derived from natural proteins as inhibitory peptide aptamers (iPAs), can produce highly effective and selective blockade of specific nociceptive molecular pathways to reduce pain with minimal off-target effects. In this study, we report the engineering of the potent and selective iPAs of Ca V 3.2 from the intrinsically disordered regions (IDRs) of Ca V 3.2 intracellular segments. Using established prediction algorithms, we localized the IDRs in Ca V 3.2 protein and identified several Ca V 3.2iPA candidates that significantly reduced Ca V 3.2 current in HEK293 cells stably expressing human wide-type Ca V 3.2. Two prototype Ca V 3.2iPAs (iPA1 and iPA2) derived from the IDRs of Ca V 3.2 intracellular loops 2 and 3, respectively, were expressed selectively in the primary sensory neurons of dorsal root ganglia in vivo using recombinant adeno-associated virus (AAV), which produced sustained inhibition of calcium current conducted by Ca V 3.2/T-type channels and significantly attenuated both evoked and spontaneous pain behavior in rats with neuropathic pain after tibial nerve injury. Recordings from dissociated sensory neurons showed that AAV-mediated Ca V 3.2iPA expression suppressed neuronal excitability, suggesting that Ca V 3.2iPA treatment attenuated pain by reversal of injury-induced neuronal hypersensitivity. Collectively, our results indicate that Ca V 3.2iPAs are promising analgesic leads that, combined with AAV-mediated delivery in anatomically targeted sensory ganglia, have the potential to be a selective peripheral Ca V 3.2-targeting strategy for clinical treatment of pain.
Collapse
Affiliation(s)
- Seung Min Shin
- Department of Anesthesiology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Justas Lauzadis
- Department of Anesthesiology, Stony Brook University, Stony Brook, NY, United States
| | - Brandon Itson-Zoske
- Department of Anesthesiology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Yongsong Cai
- Department of Anesthesiology, Medical College of Wisconsin, Milwaukee, WI, United States
- Xi'an Jiaotong University Health Science Center, Xi'an, Shaanxi, People's Republic of China
| | - Fan Fan
- Department of Pharmacology and Toxicology, The University of Mississippi Medical Center, Jackson, MS, United States
| | - Gayathri K. Natarajan
- Department of Anesthesiology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Wai-Meng Kwok
- Department of Anesthesiology, Medical College of Wisconsin, Milwaukee, WI, United States
- Department of Pharmacology and Toxicology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Michelino Puopolo
- Department of Anesthesiology, Stony Brook University, Stony Brook, NY, United States
| | - Quinn H. Hogan
- Department of Anesthesiology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Hongwei Yu
- Department of Anesthesiology, Medical College of Wisconsin, Milwaukee, WI, United States
| |
Collapse
|
18
|
Sangster AG, Zarin T, Moses AM. Evolution of short linear motifs and disordered proteins Topic: yeast as model system to study evolution. Curr Opin Genet Dev 2022; 76:101964. [PMID: 35939968 DOI: 10.1016/j.gde.2022.101964] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 06/29/2022] [Accepted: 07/08/2022] [Indexed: 11/26/2022]
Abstract
Evolutionary preservation of protein structure had a major influence on the field of molecular evolution: changes in individual amino acids that did not disrupt protein folding would either have no effect or subtly change the 'lock' so that it could fit a new 'key'. Homology of individual amino acids could be confidently assigned through sequence alignments, and models of evolution could be tested. This view of molecular evolution excluded large regions of proteins that could not be confidently aligned, such as intrinsically disordered regions (IDRs) that do not fold into stable structures. In the last decade, major progress has been made in understanding the evolution of IDRs, much of it facilitated by new experimental and computational approaches in yeast. Here, we review this progress as well as several still outstanding questions.
Collapse
Affiliation(s)
- Ami G Sangster
- Cell & Systems Biology, University of Toronto, 25 Harbord St., Toronto, ON M5S 3G5, Canada
| | - Taraneh Zarin
- Cell & Systems Biology, University of Toronto, 25 Harbord St., Toronto, ON M5S 3G5, Canada. https://twitter.com/@taraneh_z
| | - Alan M Moses
- Cell & Systems Biology, University of Toronto, 25 Harbord St., Toronto, ON M5S 3G5, Canada.
| |
Collapse
|
19
|
Discovering molecular features of intrinsically disordered regions by using evolution for contrastive learning. PLoS Comput Biol 2022; 18:e1010238. [PMID: 35767567 PMCID: PMC9275697 DOI: 10.1371/journal.pcbi.1010238] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 07/12/2022] [Accepted: 05/23/2022] [Indexed: 02/07/2023] Open
Abstract
A major challenge to the characterization of intrinsically disordered regions (IDRs), which are widespread in the proteome, but relatively poorly understood, is the identification of molecular features that mediate functions of these regions, such as short motifs, amino acid repeats and physicochemical properties. Here, we introduce a proteome-scale feature discovery approach for IDRs. Our approach, which we call “reverse homology”, exploits the principle that important functional features are conserved over evolution. We use this as a contrastive learning signal for deep learning: given a set of homologous IDRs, the neural network has to correctly choose a held-out homolog from another set of IDRs sampled randomly from the proteome. We pair reverse homology with a simple architecture and standard interpretation techniques, and show that the network learns conserved features of IDRs that can be interpreted as motifs, repeats, or bulk features like charge or amino acid propensities. We also show that our model can be used to produce visualizations of what residues and regions are most important to IDR function, generating hypotheses for uncharacterized IDRs. Our results suggest that feature discovery using unsupervised neural networks is a promising avenue to gain systematic insight into poorly understood protein sequences. Intrinsically disordered regions (IDRs) are widespread in proteins but are poorly understood on a systematic level because they evolve too rapidly for classic bioinformatics methods to be effective. We designed a neural network that learns what features (for example, electrostatic charge, or the presence of certain motifs) might be important to the function of IDRs, even when we don’t have prior knowledge of function. Our neural network learns by exploiting principles of evolution. Important features tend to be conserved over species, so guessing what sequences evolved from the same common ancestor helps the neural network identify these features. Importantly, training a neural network this way can be defined as a fully automatic operation, so no manual effort is required. After our neural network is trained, we can apply interpretation techniques to understand what kinds of features are important to IDRs globally in the proteome, and to form hypotheses about specific IDRs. We show that many of the features our neural network learns are consistent with features we already know to be important to IDRs. We hope that our neural network can be applied to help biologists form hypotheses about poorly characterized IDRs.
Collapse
|
20
|
Ghosh K, Huihui J, Phillips M, Haider A. Rules of Physical Mathematics Govern Intrinsically Disordered Proteins. Annu Rev Biophys 2022; 51:355-376. [PMID: 35119946 PMCID: PMC9190209 DOI: 10.1146/annurev-biophys-120221-095357] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
In stark contrast to foldable proteins with a unique folded state, intrinsically disordered proteins and regions (IDPs) persist in perpetually disordered ensembles. Yet an IDP ensemble has conformational features-even when averaged-that are specific to its sequence. In fact, subtle changes in an IDP sequence can modulate its conformational features and its function. Recent advances in theoretical physics reveal a set of elegant mathematical expressions that describe the intricate relationships among IDP sequences, their ensemble conformations, and the regulation of their biological functions. These equations also describe the molecular properties of IDP sequences that predict similarities and dissimilarities in their functions and facilitate classification of sequences by function, an unmet challenge to traditional bioinformatics. These physical sequence-patterning metrics offer a promising new avenue for advancing synthetic biology at a time when multiple novel functional modes mediated by IDPs are emerging.
Collapse
Affiliation(s)
- Kingshuk Ghosh
- Department of Physics and Astronomy, University of Denver, Denver, Colorado, USA,Molecular and Cellular Biophysics Program, University of Denver, Denver, Colorado, USA
| | - Jonathan Huihui
- Department of Physics and Astronomy, University of Denver, Denver, Colorado, USA
| | - Michael Phillips
- Department of Physics and Astronomy, University of Denver, Denver, Colorado, USA
| | - Austin Haider
- Molecular and Cellular Biophysics Program, University of Denver, Denver, Colorado, USA
| |
Collapse
|
21
|
Balasubramanian S, Andreani M, Andrade JG, Saha T, Sundaravinayagam D, Garzón J, Zhang W, Popp O, Hiraga SI, Rahjouei A, Rosen DB, Mertins P, Chait BT, Donaldson AD, Di Virgilio M. Protection of nascent DNA at stalled replication forks is mediated by phosphorylation of RIF1 intrinsically disordered region. eLife 2022; 11:e75047. [PMID: 35416772 PMCID: PMC9007588 DOI: 10.7554/elife.75047] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 03/30/2022] [Indexed: 12/22/2022] Open
Abstract
RIF1 is a multifunctional protein that plays key roles in the regulation of DNA processing. During repair of DNA double-strand breaks (DSBs), RIF1 functions in the 53BP1-Shieldin pathway that inhibits resection of DNA ends to modulate the cellular decision on which repair pathway to engage. Under conditions of replication stress, RIF1 protects nascent DNA at stalled replication forks from degradation by the DNA2 nuclease. How these RIF1 activities are regulated at the post-translational level has not yet been elucidated. Here, we identified a cluster of conserved ATM/ATR consensus SQ motifs within the intrinsically disordered region (IDR) of mouse RIF1 that are phosphorylated in proliferating B lymphocytes. We found that phosphorylation of the conserved IDR SQ cluster is dispensable for the inhibition of DSB resection by RIF1, but is essential to counteract DNA2-dependent degradation of nascent DNA at stalled replication forks. Therefore, our study identifies a key molecular feature that enables the genome-protective function of RIF1 during DNA replication stress.
Collapse
Affiliation(s)
- Sandhya Balasubramanian
- Laboratory of Genome Diversification & Integrity, Max Delbrück Center for Molecular Medicine in the Helmholtz AssociationBerlinGermany
- Freie Universität BerlinBerlinGermany
| | - Matteo Andreani
- Laboratory of Genome Diversification & Integrity, Max Delbrück Center for Molecular Medicine in the Helmholtz AssociationBerlinGermany
- Freie Universität BerlinBerlinGermany
| | - Júlia Goncalves Andrade
- Laboratory of Genome Diversification & Integrity, Max Delbrück Center for Molecular Medicine in the Helmholtz AssociationBerlinGermany
| | - Tannishtha Saha
- Laboratory of Genome Diversification & Integrity, Max Delbrück Center for Molecular Medicine in the Helmholtz AssociationBerlinGermany
- Freie Universität BerlinBerlinGermany
| | - Devakumar Sundaravinayagam
- Laboratory of Genome Diversification & Integrity, Max Delbrück Center for Molecular Medicine in the Helmholtz AssociationBerlinGermany
| | - Javier Garzón
- Institute of Medical Sciences, University of Aberdeen, ForesterhillAberdeenUnited Kingdom
| | - Wenzhu Zhang
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller UniversityNew YorkUnited States
| | - Oliver Popp
- Proteomics Platform, Max Delbrück Center for Molecular Medicine in the Helmholtz Association and Berlin Institute of HealthBerlinGermany
| | - Shin-ichiro Hiraga
- Institute of Medical Sciences, University of Aberdeen, ForesterhillAberdeenUnited Kingdom
| | - Ali Rahjouei
- Laboratory of Genome Diversification & Integrity, Max Delbrück Center for Molecular Medicine in the Helmholtz AssociationBerlinGermany
| | - Daniel B Rosen
- Laboratory of Molecular Immunology, The Rockefeller UniversityNew YorkUnited States
| | - Philipp Mertins
- Proteomics Platform, Max Delbrück Center for Molecular Medicine in the Helmholtz Association and Berlin Institute of HealthBerlinGermany
| | - Brian T Chait
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller UniversityNew YorkUnited States
| | - Anne D Donaldson
- Institute of Medical Sciences, University of Aberdeen, ForesterhillAberdeenUnited Kingdom
| | - Michela Di Virgilio
- Laboratory of Genome Diversification & Integrity, Max Delbrück Center for Molecular Medicine in the Helmholtz AssociationBerlinGermany
- Charité-Universitätsmedizin BerlinBerlinGermany
| |
Collapse
|
22
|
Huang X, Xiao N, Zou Y, Xie Y, Tang L, Zhang Y, Yu Y, Li Y, Xu C. Heterotypic transcriptional condensates formed by prion-like paralogous proteins canalize flowering transition in tomato. Genome Biol 2022; 23:78. [PMID: 35287709 PMCID: PMC8919559 DOI: 10.1186/s13059-022-02646-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 03/07/2022] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Paralogs that arise from gene duplications during genome evolution enable genetic redundancy and phenotypic robustness. Variation in the coding or regulatory sequence of paralogous transcriptional regulators diversifies their functions and relationships, which provides developmental robustness against genetic or environmental perturbation. The fate transition of plant shoot stem cells for flowering and reproductive success requires a robust transcriptional control. However, how paralogs function and interact to achieve such robustness is unknown. RESULTS Here, we explore the genetic relationship and protein behavior of ALOG family transcriptional factors with diverse transcriptional abundance in shoot meristems. A mutant spectrum covers single and higher-order mutant combinations of five ALOG paralogs and creates a continuum of flowering transition defects, showing gradually enhanced precocious flowering, along with inflorescence simplification from wild-type-like to progressively fewer flowers until solitary flower with sterile floral organs. Therefore, these paralogs play unequal roles and act together to achieve a robust genetic canalization. All five proteins contain prion-like intrinsically disordered regions (IDRs) and undergo phase separation. Accumulated mutations following gene duplications lead to IDR variations among ALOG paralogs, resulting in divergent phase separation and transcriptional regulation capabilities. Remarkably, they retain the ancestral abilities to assemble into a heterotypic condensate that prevents precocious activation of the floral identity gene ANANTHA. CONCLUSIONS Our study reveals a novel genetic canalization mechanism enabled by heterotypic transcriptional condensates formed by paralogous protein interactions and phase separation, uncovering the molecular link between gene duplication caused IDR variation and robust transcriptional control of stem cell fate transition.
Collapse
Affiliation(s)
- Xiaozhen Huang
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Nan Xiao
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Yupan Zou
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Yue Xie
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Lingli Tang
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Yueqin Zhang
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.,College of Coastal Agricultural Sciences Guangdong Ocean University, Zhanjiang, China
| | - Yuan Yu
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Yiting Li
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China.,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Cao Xu
- State Key Laboratory of Plant Genomics, National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China. .,CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
23
|
Cohan MC, Shinn MK, Lalmansingh JM, Pappu RV. Uncovering Non-random Binary Patterns Within Sequences of Intrinsically Disordered Proteins. J Mol Biol 2022; 434:167373. [PMID: 34863777 PMCID: PMC10178624 DOI: 10.1016/j.jmb.2021.167373] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 10/24/2021] [Accepted: 11/16/2021] [Indexed: 01/21/2023]
Abstract
Sequence-ensemble relationships of intrinsically disordered proteins (IDPs) are governed by binary patterns such as the linear clustering or mixing of specific residues or residue types with respect to one another. To enable the discovery of potentially important, shared patterns across sequence families, we describe a computational method referred to as NARDINI for Non-random Arrangement of Residues in Disordered Regions Inferred using Numerical Intermixing. This work was partially motivated by the observation that parameters that are currently in use for describing different binary patterns are not interoperable across IDPs of different amino acid compositions and lengths. In NARDINI, we generate an ensemble of scrambled sequences to set up a composition-specific null model for the patterning parameters of interest. We then compute a series of pattern-specific z-scores to quantify how each pattern deviates from a null model for the IDP of interest. The z-scores help in identifying putative non-random linear sequence patterns within an IDP. We demonstrate the use of NARDINI derived z-scores by identifying sequence patterns in three well-studied IDP systems. We also demonstrate how NARDINI can be deployed to study archetypal IDPs across homologs and orthologs. Overall, NARDINI is likely to aid in designing novel IDPs with a view toward engineering new sequence-function relationships or uncovering cryptic ones. We further propose that the z-scores introduced here are likely to be useful for theoretical and computational descriptions of sequence-ensemble relationships across IDPs of different compositions and lengths.
Collapse
Affiliation(s)
- Megan C Cohan
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, MO 63130, USA
| | - Min Kyung Shinn
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, MO 63130, USA
| | | | - Rohit V Pappu
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, MO 63130, USA.
| |
Collapse
|
24
|
Chin AF, Zheng Y, Hilser VJ. Phylogenetic convergence of phase separation and mitotic function in the disordered protein BuGZ. Protein Sci 2022; 31:822-834. [PMID: 34984754 DOI: 10.1002/pro.4270] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 12/29/2021] [Accepted: 12/30/2021] [Indexed: 11/05/2022]
Abstract
Intrinsically disordered proteins (IDPs) effect biological function despite their sequence-encoded lack of preference for stable three-dimensional structure. Among their many functions, IDPs form membraneless cellular compartments through liquid-liquid phase separation (LLPS), also termed biomolecular condensation. The extent to which LLPS has been evolutionarily selected remains largely unknown, as the complexities of IDP evolution hamper progress. Unlike structured proteins, rapid sequence divergence typical of IDPs confounds inference of their biophysical or biological functions from comparative sequence analyses. Here, we leverage mitosis as a universal eukaryotic feature to interrogate condensate evolutionary history. We observe that evolution has conserved the ability for six homologs of the mitotic IDP BuGZ to undergo LLPS and to serve the same mitotic function, despite low sequence conservation. We also observe that cellular context may tune LLPS. The phylogenetic correlation of LLPS and mitotic function in one protein raises the possibility of an ancient evolutionary interplay between LLPS and biological function, dating back at least 1.6 billion years to the last common ancestor of plants and animals. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Alexander F Chin
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, USA
| | - Yixian Zheng
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, USA.,Department of Embryology, Carnegie Institution for Science, 3520 San Martin Drive, Baltimore, Maryland, USA
| | - Vincent J Hilser
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, USA.,T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, Maryland, USA
| |
Collapse
|
25
|
Dasmeh P, Doronin R, Wagner A. The length scale of multivalent interactions is evolutionarily conserved in fungal and vertebrate phase-separating proteins. Genetics 2022; 220:iyab184. [PMID: 34791214 PMCID: PMC8733453 DOI: 10.1093/genetics/iyab184] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 10/06/2021] [Indexed: 11/14/2022] Open
Abstract
One key feature of proteins that form liquid droplets by phase separation inside a cell is multivalency-the presence of multiple sites that mediate interactions with other proteins. We know little about the variation of multivalency on evolutionary time scales. Here, we investigated the long-term evolution (∼600 million years) of multivalency in fungal mRNA decapping subunit 2 protein (Dcp2), and in the FET (FUS, EWS and TAF15) protein family. We found that multivalency varies substantially among the orthologs of these proteins. However, evolution has maintained the length scale at which sequence motifs that enable protein-protein interactions occur. That is, the total number of such motifs per hundred amino acids is higher and less variable than expected by neutral evolution. To help explain this evolutionary conservation, we developed a conformation classifier using machine-learning algorithms. This classifier demonstrates that disordered segments in Dcp2 and FET proteins tend to adopt compact conformations, which is necessary for phase separation. Thus, the evolutionary conservation we detected may help proteins preserve the ability to undergo phase separation. Altogether, our study reveals that the length scale of multivalent interactions is an evolutionarily conserved feature of two classes of phase-separating proteins in fungi and vertebrates.
Collapse
Affiliation(s)
- Pouria Dasmeh
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich 8057, Switzerland
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02139, USA
- Swiss Institute of Bioinformatics (SIB), Lausanne 1015, Switzerland
| | - Roman Doronin
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich 8057, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne 1015, Switzerland
| | - Andreas Wagner
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich 8057, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne 1015, Switzerland
- The Santa Fe Institute, Santa Fe, NM 87501, USA
- Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch 7600, South Africa
| |
Collapse
|
26
|
Ruff KM, Pappu RV. AlphaFold and Implications for Intrinsically Disordered Proteins. J Mol Biol 2021; 433:167208. [PMID: 34418423 DOI: 10.1016/j.jmb.2021.167208] [Citation(s) in RCA: 222] [Impact Index Per Article: 74.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 08/11/2021] [Accepted: 08/12/2021] [Indexed: 10/20/2022]
Abstract
Accurate predictions of the three-dimensional structures of proteins from their amino acid sequences have come of age. AlphaFold, a deep learning-based approach to protein structure prediction, shows remarkable success in independent assessments of prediction accuracy. A significant epoch in structural bioinformatics was the structural annotation of over 98% of protein sequences in the human proteome. Interestingly, many predictions feature regions of very low confidence, and these regions largely overlap with intrinsically disordered regions (IDRs). That over 30% of regions within the proteome are disordered is congruent with estimates that have been made over the past two decades, as intense efforts have been undertaken to generalize the structure-function paradigm to include the importance of conformational heterogeneity and dynamics. With structural annotations from AlphaFold in hand, there is the temptation to draw inferences regarding the "structures" of IDRs and their interactomes. Here, we offer a cautionary note regarding the misinterpretations that might ensue and highlight efforts that provide concrete understanding of sequence-ensemble-function relationships of IDRs. This perspective is intended to emphasize the importance of IDRs in sequence-function relationships (SERs) and to highlight how one might go about extracting quantitative SERs to make sense of how IDRs function.
Collapse
Affiliation(s)
- Kiersten M Ruff
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, Campus Box 1097, St. Louis, MO 63130, USA
| | - Rohit V Pappu
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, Campus Box 1097, St. Louis, MO 63130, USA.
| |
Collapse
|
27
|
Hsu IS, Strome B, Lash E, Robbins N, Cowen LE, Moses AM. A functionally divergent intrinsically disordered region underlying the conservation of stochastic signaling. PLoS Genet 2021; 17:e1009629. [PMID: 34506483 PMCID: PMC8457507 DOI: 10.1371/journal.pgen.1009629] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 09/22/2021] [Accepted: 08/06/2021] [Indexed: 12/18/2022] Open
Abstract
Stochastic signaling dynamics expand living cells' information processing capabilities. An increasing number of studies report that regulators encode information in their pulsatile dynamics. The evolutionary mechanisms that lead to complex signaling dynamics remain uncharacterized, perhaps because key interactions of signaling proteins are encoded in intrinsically disordered regions (IDRs), whose evolution is difficult to analyze. Here we focused on the IDR that controls the stochastic pulsing dynamics of Crz1, a transcription factor in fungi downstream of the widely conserved calcium signaling pathway. We find that Crz1 IDRs from anciently diverged fungi can all respond transiently to calcium stress; however, only Crz1 IDRs from the Saccharomyces clade support pulsatility, encode extra information, and rescue fitness in competition assays, while the Crz1 IDRs from distantly related fungi do none of the three. On the other hand, we find that Crz1 pulsing is conserved in the distantly related fungi, consistent with the evolutionary model of stabilizing selection on the signaling phenotype. Further, we show that a calcineurin docking site in a specific part of the IDRs appears to be sufficient for pulsing and show evidence for a beneficial increase in the relative calcineurin affinity of this docking site. We propose that evolutionary flexibility of functionally divergent IDRs underlies the conservation of stochastic signaling by stabilizing selection.
Collapse
Affiliation(s)
- Ian S. Hsu
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
| | - Bob Strome
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
| | - Emma Lash
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
| | - Nicole Robbins
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
| | - Leah E. Cowen
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
| | - Alan M. Moses
- Department of Cell & Systems Biology, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
- * E-mail:
| |
Collapse
|
28
|
Peyro M, Soheilypour M, Nibber VS, Dickson AM, Mofrad MRK. FG nucleoporins feature unique patterns that distinguish them from other IDPs. Biophys J 2021; 120:3382-3391. [PMID: 34237287 PMCID: PMC8391076 DOI: 10.1016/j.bpj.2021.06.031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 05/14/2021] [Accepted: 06/16/2021] [Indexed: 02/06/2023] Open
Abstract
FG nucleoporins (FG Nups) are intrinsically disordered proteins and are the putative regulators of nucleocytoplasmic transport. They allow fast, yet selective, transport of molecules through the nuclear pore complex, but the underlying mechanism of nucleocytoplasmic transport is not yet fully discovered. As a result, FG Nups have been the subject of extensive research in the past two decades. Although most studies have been focused on analyzing the conformation and function of FG Nups from a biophysical standpoint, some recent studies have investigated the sequence-function relationship of FG Nups, with a few investigating amino acid sequences of a large number of FG Nups to understand common characteristics that might enable their function. Previously, we identified an evolutionarily conserved feature in FG Nup sequences, which are extended subsequences with low charge density, containing only positive charges, and located toward the N-terminus of FG Nups. We named these patterns longest positive like charge regions (lpLCRs). These patterns are specific to positively charged residues, and negatively charged residues do not demonstrate such a pattern. In this study, we compare FG Nups with other disordered proteins obtained from the DisProt and UniProt database in terms of presence of lpLCRs. Our results show that the lpLCRs are virtually exclusive to FG Nups and are not observed in other disordered proteins. Also, lpLCRs are what differentiate FG Nups from DisProt proteins in terms of charge distribution, meaning that excluding lpLCRs from the sequences of FG Nups make them similar to DisProt proteins in terms of charge distribution. We also previously showed the biophysical effect of lpLCRs in conformation of FG Nups. The results of this study are in line with our previous findings and imply that lpLCRs are virtually exclusive and functionally significant characteristics of FG Nups and nucleocytoplasmic transport.
Collapse
Affiliation(s)
- Mohaddeseh Peyro
- Molecular Cell Biomechanics Laboratory, Departments of Bioengineering and Mechanical Engineering, University of California Berkeley
| | - Mohammad Soheilypour
- Molecular Cell Biomechanics Laboratory, Departments of Bioengineering and Mechanical Engineering, University of California Berkeley
| | - Vikrum S Nibber
- Molecular Cell Biomechanics Laboratory, Departments of Bioengineering and Mechanical Engineering, University of California Berkeley
| | - Andrew M Dickson
- Molecular Cell Biomechanics Laboratory, Departments of Bioengineering and Mechanical Engineering, University of California Berkeley
| | - Mohammad R K Mofrad
- Molecular Cell Biomechanics Laboratory, Departments of Bioengineering and Mechanical Engineering, University of California Berkeley; Molecular Biophysics and Integrative Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California.
| |
Collapse
|
29
|
Whence Blobs? Phylogenetics of functional protein condensates. Biochem Soc Trans 2021; 48:2151-2158. [PMID: 32985656 DOI: 10.1042/bst20200355] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 09/04/2020] [Accepted: 09/07/2020] [Indexed: 12/15/2022]
Abstract
What do we know about the molecular evolution of functional protein condensation? The capacity of proteins to form biomolecular condensates (compact, protein-rich states, not bound by membranes, but still separated from the rest of the contents of the cell) appears in many cases to be bestowed by weak, transient interactions within one or between proteins. Natural selection is expected to remove or fix amino acid changes, insertions or deletions that preserve and change this condensation capacity when doing so is beneficial to the cell. A few recent studies have begun to explore this frontier of phylogenetics at the intersection of biophysics and cell biology.
Collapse
|
30
|
Jamecna D, Antonny B. Intrinsically disordered protein regions at membrane contact sites. Biochim Biophys Acta Mol Cell Biol Lipids 2021; 1866:159020. [PMID: 34352388 DOI: 10.1016/j.bbalip.2021.159020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 06/30/2021] [Accepted: 07/01/2021] [Indexed: 12/14/2022]
Abstract
Membrane contact sites (MCS) are regions of close apposition between membrane-bound organelles. Proteins that occupy MCS display various domain organisation. Among them, lipid transfer proteins (LTPs) frequently contain both structured domains as well as regions of intrinsic disorder. In this review, we discuss the various roles of intrinsically disordered protein regions (IDPRs) in LTPs as well as in other proteins that are associated with organelle contact sites. We distinguish the following functions: (i) to act as flexible tethers between two membranes; (ii) to act as entropic barriers to prevent protein crowding and regulate membrane tethering geometry; (iii) to define the action range of catalytic domains. These functions are added to other functions of IDPRs in membrane environments, such as mediating protein-protein and protein-membrane interactions. We suggest that the overall efficiency and fidelity of contact sites might require fine coordination between all these IDPR activities.
Collapse
Affiliation(s)
- Denisa Jamecna
- Université Côte d'Azur et CNRS, Institut de Pharmacologie Moléculaire et Cellulaire, 660 route des lucioles, 06560 Valbonne, France; Biochemistry Center (BZH), Heidelberg, Germany
| | - Bruno Antonny
- Université Côte d'Azur et CNRS, Institut de Pharmacologie Moléculaire et Cellulaire, 660 route des lucioles, 06560 Valbonne, France.
| |
Collapse
|
31
|
Dasmeh P, Wagner A. Natural Selection on the Phase-Separation Properties of FUS during 160 My of Mammalian Evolution. Mol Biol Evol 2021; 38:940-951. [PMID: 33022038 PMCID: PMC7947763 DOI: 10.1093/molbev/msaa258] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Protein phase separation can help explain the formation of many nonmembranous organelles. However, we know little about its ability to change in evolution. Here we studied the evolution of the mammalian RNA-binding protein Fused in Sarcoma (FUS), a protein whose prion-like domain (PLD) contributes to the formation of stress granules through liquid–liquid phase separation. Although the PLD evolves three times as rapidly as the remainder of FUS, it harbors absolutely conserved tyrosine residues that are crucial for phase separation. Ancestral reconstruction shows that the phosphorylation sites within the PLD are subject to stabilizing selection. They toggle among a small number of amino acid states. One exception to this pattern is primates, where the number of such phosphosites has increased through positive selection. In addition, we find frequent glutamine to proline changes that help maintain the unstructured state of FUS that is necessary for phase separation. Our work provides evidence that natural selection has stabilized the liquid forming potential of FUS and minimized the propensity of cytotoxic liquid-to-solid phase transitions during 160 My of mammalian evolution.
Collapse
Affiliation(s)
- Pouria Dasmeh
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.,Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA.,Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Andreas Wagner
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.,Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| |
Collapse
|
32
|
Huihui J, Ghosh K. Intrachain interaction topology can identify functionally similar intrinsically disordered proteins. Biophys J 2021; 120:1860-1868. [PMID: 33865811 DOI: 10.1016/j.bpj.2020.11.2282] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 10/17/2020] [Accepted: 11/19/2020] [Indexed: 01/06/2023] Open
Abstract
Functionally similar IDPs (intrinsically disordered proteins) often have little sequence similarity. This is in stark contrast to folded proteins and poses a challenge for the inverse problem, functional classification of IDPs using sequence alignment. The problem is further compounded because of the lack of structure in IDPs, preventing structural alignment as an alternate tool for classification. Recent advances in heteropolymer theory unveiled a powerful set of sequence-patterning metrics bridging molecular interaction with chain conformation. Focusing only on charge patterning, these set of metrics yield a sequence charge decoration matrix (SCDM). SCDMs can potentially identify functionally similar IDPs not apparent from sequence alignment alone. Here, we illustrate how these information-rich "molecular blueprints" encoded in SCDMs can be used for functional classification of IDPs with specific application in three protein families-Ste50, PSC, and RAM-in which electrostatics is known to be important. For both the Ste50 and PSC protein family, the set of metrics appropriately classifies proteins in functional and nonfunctional groups in agreement with experiment. Furthermore, our algorithm groups synthetic variants of the disordered RAM region of the Notch receptor protein-important in gene expression-in reasonable accordance with classification based on experimentally measured binding constants of RAM and transcription factor. Taken together, the novel classification scheme reveals the critical role of a high-dimensional set of metrics-manifest in self-interaction maps and topology-in functional annotation of IDPs even when there is low sequence homology, providing the much-needed alternate to a traditional sequence alignment tool.
Collapse
Affiliation(s)
- Jonathan Huihui
- Department of Physics and Astronomy, University of Denver, Denver, Colorado
| | - Kingshuk Ghosh
- Department of Physics and Astronomy, University of Denver, Denver, Colorado.
| |
Collapse
|
33
|
Zarin T, Strome B, Peng G, Pritišanac I, Forman-Kay JD, Moses AM. Identifying molecular features that are associated with biological function of intrinsically disordered protein regions. eLife 2021; 10:e60220. [PMID: 33616531 PMCID: PMC7932695 DOI: 10.7554/elife.60220] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 02/22/2021] [Indexed: 12/17/2022] Open
Abstract
In previous work, we showed that intrinsically disordered regions (IDRs) of proteins contain sequence-distributed molecular features that are conserved over evolution, despite little sequence similarity that can be detected in alignments (Zarin et al., 2019). Here, we aim to use these molecular features to predict specific biological functions for individual IDRs and identify the molecular features within them that are associated with these functions. We find that the predictable functions are diverse. Examining the associated molecular features, we note some that are consistent with previous reports and identify others that were previously unknown. We experimentally confirm that elevated isoelectric point and hydrophobicity, features that are positively associated with mitochondrial localization, are necessary for mitochondrial targeting function. Remarkably, increasing isoelectric point in a synthetic IDR restores weak mitochondrial targeting. We believe feature analysis represents a new systematic approach to understand how biological functions of IDRs are specified by their protein sequences.
Collapse
Affiliation(s)
- Taraneh Zarin
- Department of Cell and Systems Biology, University of TorontoTorontoCanada
| | - Bob Strome
- Department of Cell and Systems Biology, University of TorontoTorontoCanada
| | - Gang Peng
- Department of Cell and Systems Biology, University of TorontoTorontoCanada
| | - Iva Pritišanac
- Department of Cell and Systems Biology, University of TorontoTorontoCanada
- Program in Molecular Medicine, Hospital for Sick ChildrenTorontoCanada
| | - Julie D Forman-Kay
- Program in Molecular Medicine, Hospital for Sick ChildrenTorontoCanada
- Department of Biochemistry, University of TorontoTorontoCanada
| | - Alan M Moses
- Department of Cell and Systems Biology, University of TorontoTorontoCanada
| |
Collapse
|
34
|
Yang X, Yan J, Zhang Z, Lin T, Xin T, Wang B, Wang S, Zhao J, Zhang Z, Lucas WJ, Li G, Huang S. Regulation of plant architecture by a new histone acetyltransferase targeting gene bodies. NATURE PLANTS 2020; 6:809-822. [PMID: 32665652 DOI: 10.1038/s41477-020-0715-2] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Accepted: 05/29/2020] [Indexed: 05/10/2023]
Abstract
Axillary meristem development determines both plant architecture and crop yield; this critical process is regulated by the PROLIFERATING CELL FACTORS (TCP) family of transcription factors. Although TCP proteins bind primarily to promoter regions, some also target gene bodies for expression activation. However, the underlying regulatory mechanism remains unknown. Here we show that TEN, a TCP from cucumber (Cucumis sativus L.), controls the identity and mobility of tendrils. Through its C terminus, TEN binds at intragenic enhancers of target genes; its N-terminal domain functions as a non-canonical histone acetyltransferase (HAT) to preferentially act on lysine 56 and 122 of the histone H3 globular domain. This HAT activity is responsible for chromatin loosening and host-gene activation. The N termini of all tested CYCLOIDEA and TEOSINTE BRANCHED 1-like TCP proteins contain an intrinsically disordered region; despite their sequence divergence, they have conserved HAT activity. This study identifies a non-canonical class of HATs and provides a mechanism by which modification at the H3 globular domain is integrated with the transcription process.
Collapse
Affiliation(s)
- Xueyong Yang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jianbin Yan
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhen Zhang
- College of Horticulture, Northwest A&F University, Yangling, China
- State Key Laboratory of Crop Stress Adaptation and Improvement, School of Life Sciences, Henan University, Kaifeng, China
| | - Tao Lin
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Tongxu Xin
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Bowen Wang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Shenhao Wang
- College of Horticulture, Northwest A&F University, Yangling, China
| | - Jicheng Zhao
- University of Chinese Academy of Sciences, National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Zhonghua Zhang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - William J Lucas
- Department of Plant Biology, College of Biological Sciences, University of California, Davis, CA, USA
| | - Guohong Li
- University of Chinese Academy of Sciences, National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Sanwen Huang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| |
Collapse
|
35
|
Iserman C, Desroches Altamirano C, Jegers C, Friedrich U, Zarin T, Fritsch AW, Mittasch M, Domingues A, Hersemann L, Jahnel M, Richter D, Guenther UP, Hentze MW, Moses AM, Hyman AA, Kramer G, Kreysing M, Franzmann TM, Alberti S. Condensation of Ded1p Promotes a Translational Switch from Housekeeping to Stress Protein Production. Cell 2020; 181:818-831.e19. [PMID: 32359423 PMCID: PMC7237889 DOI: 10.1016/j.cell.2020.04.009] [Citation(s) in RCA: 100] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Revised: 11/16/2019] [Accepted: 04/06/2020] [Indexed: 11/24/2022]
Abstract
Cells sense elevated temperatures and mount an adaptive heat shock response that involves changes in gene expression, but the underlying mechanisms, particularly on the level of translation, remain unknown. Here we report that, in budding yeast, the essential translation initiation factor Ded1p undergoes heat-induced phase separation into gel-like condensates. Using ribosome profiling and an in vitro translation assay, we reveal that condensate formation inactivates Ded1p and represses translation of housekeeping mRNAs while promoting translation of stress mRNAs. Testing a variant of Ded1p with altered phase behavior as well as Ded1p homologs from diverse species, we demonstrate that Ded1p condensation is adaptive and fine-tuned to the maximum growth temperature of the respective organism. We conclude that Ded1p condensation is an integral part of an extended heat shock response that selectively represses translation of housekeeping mRNAs to promote survival under conditions of severe heat stress.
Collapse
Affiliation(s)
- Christiane Iserman
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany
| | - Christine Desroches Altamirano
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; BIOTEC and CMCB, Technische Universität Dresden, Tatzberg 47/48, 01307 Dresden, Germany
| | - Ceciel Jegers
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany
| | - Ulrike Friedrich
- Center for Molecular Biology of the University of Heidelberg, German Cancer Research Center, DKFZ-ZMBH Alliance, 69120 Heidelberg, Germany
| | - Taraneh Zarin
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON M5S 3G5, Canada
| | - Anatol W Fritsch
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, 01307 Dresden, Germany
| | - Matthäus Mittasch
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, 01307 Dresden, Germany
| | - Antonio Domingues
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, 01307 Dresden, Germany
| | - Lena Hersemann
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, 01307 Dresden, Germany
| | - Marcus Jahnel
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; BIOTEC and CMCB, Technische Universität Dresden, Tatzberg 47/48, 01307 Dresden, Germany
| | - Doris Richter
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; BIOTEC and CMCB, Technische Universität Dresden, Tatzberg 47/48, 01307 Dresden, Germany
| | - Ulf-Peter Guenther
- DKMS Life Science Lab GmbH, St. Petersburger Str. 2, 01069 Dresden, Germany
| | - Matthias W Hentze
- EMBL Heidelberg, Director's Research Unit, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Alan M Moses
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON M5S 3G5, Canada; Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON M5S 3B2, Canada
| | - Anthony A Hyman
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany
| | - Günter Kramer
- Center for Molecular Biology of the University of Heidelberg, German Cancer Research Center, DKFZ-ZMBH Alliance, 69120 Heidelberg, Germany
| | - Moritz Kreysing
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, 01307 Dresden, Germany
| | - Titus M Franzmann
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; BIOTEC and CMCB, Technische Universität Dresden, Tatzberg 47/48, 01307 Dresden, Germany
| | - Simon Alberti
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307 Dresden, Germany; BIOTEC and CMCB, Technische Universität Dresden, Tatzberg 47/48, 01307 Dresden, Germany.
| |
Collapse
|
36
|
Ginell GM, Holehouse AS. Analyzing the Sequences of Intrinsically Disordered Regions with CIDER and localCIDER. Methods Mol Biol 2020; 2141:103-126. [PMID: 32696354 DOI: 10.1007/978-1-0716-0524-0_5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Intrinsically disordered proteins and protein regions are ubiquitous across eukaryotic proteomes where they play a range of functional roles. Unlike folded proteins, IDRs lack a well-defined native state but exist in heterogeneous ensembles of conformations. In the absence of a defined native state, structure-guided mutations to test specific mechanistic hypotheses are generally not possible. Despite this, the use of mutations to alter sequence properties has become a relatively common approach for teasing out the relationship between sequence, ensemble, and function. A key step in designing informative mutants is the ability to identify specific sequence features that may reveal an interpretable response if perturbed. Here, we provide guidance on using the CIDER and localCIDER tools for amino acid sequence analysis, with a focus on building intuition with respect to the most commonly described features.
Collapse
Affiliation(s)
- Garrett M Ginell
- Graduate Program in Biochemistry, Biophysics, and Structural Biology, Division of Biological and Biomedical Sciences, Washington University in St. Louis, St. Louis, MO, USA.,Center for the Science and Engineering of Living Systems, Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO, USA
| | - Alex S Holehouse
- Center for the Science and Engineering of Living Systems, Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO, USA. .,Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
37
|
In-Silico Evaluation of a New Gene From Wheat Reveals the Divergent Evolution of the CAP160 Homologous Genes Into Monocots. J Mol Evol 2019; 88:151-163. [PMID: 31820048 DOI: 10.1007/s00239-019-09920-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 11/19/2019] [Indexed: 10/25/2022]
Abstract
This study reports the evolutionary history and in-silico functional characterization of a novel water-deficit and ABA-responsive gene in wheat. This gene has remote sequence similarity to known abiotic stress-related genes in different plants, including CAP160 in Spinacia oleracea, RD29B in Arabidopsis thaliana, and CDeT11-24 in Craterostigma plantagineum. The study investigated if these genes form a close homologous relationship or if they are a result of convergent evolutionary processes. The results indicated a closely shared homologous relationship between these genes. Bayesian phylogenetic analysis of the protein sequences of the remotely related CAP160 proteins from various plant species indicated the presence of three distinct clades. Further analyses indicated that CAP160 homologous genes have predominantly evolved through neutral processes, with multiple regions experiencing signatures of purifying selection, while others were indicated to be the result of episodic diversifying selection events. Functional predictions revealed that these genes might share at least two functions related to abiotic stress conditions: one similar to the cryoprotective function of LEA protein, and the other a signalling molecule with phosphatidic acid binding specificity. Studies focused on the identification of cold-responsive genes are essential for the development of cold-tolerant crop plants, if we are to increase agricultural productivity throughout temperate regions.
Collapse
|
38
|
Evolutionary Forces and Codon Bias in Different Flavors of Intrinsic Disorder in the Human Proteome. J Mol Evol 2019; 88:164-178. [DOI: 10.1007/s00239-019-09921-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Accepted: 11/26/2019] [Indexed: 12/22/2022]
|
39
|
Powers SK, Holehouse AS, Korasick DA, Schreiber KH, Clark NM, Jing H, Emenecker R, Han S, Tycksen E, Hwang I, Sozzani R, Jez JM, Pappu RV, Strader LC. Nucleo-cytoplasmic Partitioning of ARF Proteins Controls Auxin Responses in Arabidopsis thaliana. Mol Cell 2019; 76:177-190.e5. [PMID: 31421981 DOI: 10.1016/j.molcel.2019.06.044] [Citation(s) in RCA: 132] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 06/06/2019] [Accepted: 06/26/2019] [Indexed: 12/14/2022]
Abstract
The phytohormone auxin plays crucial roles in nearly every aspect of plant growth and development. The auxin response factor (ARF) transcription factor family regulates auxin-responsive gene expression and exhibits nuclear localization in regions of high auxin responsiveness. Here we show that the ARF7 and ARF19 proteins accumulate in micron-sized assemblies within the cytoplasm of tissues with attenuated auxin responsiveness. We found that the intrinsically disordered middle region and the folded PB1 interaction domain of ARFs drive protein assembly formation. Mutation of a single lysine within the PB1 domain abrogates cytoplasmic assemblies, promotes ARF nuclear localization, and results in an altered transcriptome and morphological defects. Our data suggest a model in which ARF nucleo-cytoplasmic partitioning regulates auxin responsiveness, providing a mechanism for cellular competence for auxin signaling.
Collapse
Affiliation(s)
- Samantha K Powers
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Alex S Holehouse
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - David A Korasick
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Katherine H Schreiber
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA; Center for Engineering MechanoBiology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Natalie M Clark
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Hongwei Jing
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA; Center for Engineering MechanoBiology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Ryan Emenecker
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Soeun Han
- Department of Life Sciences, Pohang University of Science and Technology, Pohang 37673, Korea
| | - Eric Tycksen
- Genome Technology Access Center, Department of Genetics, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Ildoo Hwang
- Department of Life Sciences, Pohang University of Science and Technology, Pohang 37673, Korea
| | - Rosangela Sozzani
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Joseph M Jez
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Rohit V Pappu
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA; Center for Science and Engineering of Living Systems (CSELS), Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Lucia C Strader
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA; Center for Engineering MechanoBiology, Washington University in St. Louis, St. Louis, MO 63130, USA; Center for Science and Engineering of Living Systems (CSELS), Washington University in St. Louis, St. Louis, MO 63130, USA.
| |
Collapse
|
40
|
Entropy and Information within Intrinsically Disordered Protein Regions. ENTROPY 2019; 21:e21070662. [PMID: 33267376 PMCID: PMC7515160 DOI: 10.3390/e21070662] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 06/27/2019] [Accepted: 07/01/2019] [Indexed: 02/06/2023]
Abstract
Bioinformatics and biophysical studies of intrinsically disordered proteins and regions (IDRs) note the high entropy at individual sequence positions and in conformations sampled in solution. This prevents application of the canonical sequence-structure-function paradigm to IDRs and motivates the development of new methods to extract information from IDR sequences. We argue that the information in IDR sequences cannot be fully revealed through positional conservation, which largely measures stable structural contacts and interaction motifs. Instead, considerations of evolutionary conservation of molecular features can reveal the full extent of information in IDRs. Experimental quantification of the large conformational entropy of IDRs is challenging but can be approximated through the extent of conformational sampling measured by a combination of NMR spectroscopy and lower-resolution structural biology techniques, which can be further interpreted with simulations. Conformational entropy and other biophysical features can be modulated by post-translational modifications that provide functional advantages to IDRs by tuning their energy landscapes and enabling a variety of functional interactions and modes of regulation. The diverse mosaic of functional states of IDRs and their conformational features within complexes demands novel metrics of information, which will reflect the complicated sequence-conformational ensemble-function relationship of IDRs.
Collapse
|
41
|
Zarin T, Strome B, Nguyen Ba AN, Alberti S, Forman-Kay JD, Moses AM. Proteome-wide signatures of function in highly diverged intrinsically disordered regions. eLife 2019; 8:46883. [PMID: 31264965 PMCID: PMC6634968 DOI: 10.7554/elife.46883] [Citation(s) in RCA: 102] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Accepted: 07/01/2019] [Indexed: 12/24/2022] Open
Abstract
Intrinsically disordered regions make up a large part of the proteome, but the sequence-to-function relationship in these regions is poorly understood, in part because the primary amino acid sequences of these regions are poorly conserved in alignments. Here we use an evolutionary approach to detect molecular features that are preserved in the amino acid sequences of orthologous intrinsically disordered regions. We find that most disordered regions contain multiple molecular features that are preserved, and we define these as ‘evolutionary signatures’ of disordered regions. We demonstrate that intrinsically disordered regions with similar evolutionary signatures can rescue function in vivo, and that groups of intrinsically disordered regions with similar evolutionary signatures are strongly enriched for functional annotations and phenotypes. We propose that evolutionary signatures can be used to predict function for many disordered regions from their amino acid sequences.
Collapse
Affiliation(s)
- Taraneh Zarin
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
| | - Bob Strome
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
| | - Alex N Nguyen Ba
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States
| | - Simon Alberti
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.,Center for Molecular and Cellular Bioengineering, Biotechnology Center, Technische Universität Dresden, Dresden, Germany
| | - Julie D Forman-Kay
- Program in Molecular Medicine, Hospital for Sick Children, Toronto, Canada.,Department of Biochemistry, University of Toronto, Toronto, Canada
| | - Alan M Moses
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada.,Department of Computer Science, University of Toronto, Toronto, Canada
| |
Collapse
|
42
|
A Noisy Analog-to-Digital Converter Connects Cytosolic Calcium Bursts to Transcription Factor Nuclear Localization Pulses in Yeast. G3-GENES GENOMES GENETICS 2019; 9:561-570. [PMID: 30573469 PMCID: PMC6385971 DOI: 10.1534/g3.118.200841] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Several examples of transcription factors that show stochastic, unsynchronized pulses of nuclear localization have been described. Here we show that under constant calcium stress, nuclear localization pulses of the transcription factor Crz1 follow stochastic variations in cytosolic calcium concentration. We find that the size of the stochastic calcium bursts is positively correlated with the number of subsequent Crz1 pulses. Based on our observations, we propose a simple stochastic model of how the signaling pathway converts a constant external calcium concentration into a digital number of Crz1 pulses in the nucleus, due to the time delay from nuclear transport and the stochastic decoherence of individual Crz1 molecule dynamics. We find support for several additional predictions of the model and suggest that stochastic input to nuclear transport may produce noisy digital responses to analog signals in other signaling systems.
Collapse
|
43
|
Das S, Amin AN, Lin YH, Chan HS. Coarse-grained residue-based models of disordered protein condensates: utility and limitations of simple charge pattern parameters. Phys Chem Chem Phys 2018; 20:28558-28574. [PMID: 30397688 DOI: 10.1039/c8cp05095c] [Citation(s) in RCA: 76] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Biomolecular condensates undergirded by phase separations of proteins and nucleic acids serve crucial biological functions. To gain physical insights into their genetic basis, we study how liquid-liquid phase separation (LLPS) of intrinsically disordered proteins (IDPs) depends on their sequence charge patterns using a continuum Langevin chain model wherein each amino acid residue is represented by a single bead. Charge patterns are characterized by the "blockiness" measure κ and the "sequence charge decoration" (SCD) parameter. Consistent with random phase approximation (RPA) theory and lattice simulations, LLPS propensity as characterized by critical temperature Tcr* increases with increasingly negative SCD for a set of sequences showing a positive correlation between κ and -SCD. Relative to RPA, the simulated sequence-dependent variation in Tcr* is often-though not always-smaller, whereas the simulated critical volume fractions are higher. However, for a set of sequences exhibiting an anti-correlation between κ and -SCD, the simulated Tcr*'s are quite insensitive to either parameter. Additionally, we find that blocky sequences that allow for strong electrostatic repulsion can lead to coexistence curves with upward concavity as stipulated by RPA, but the LLPS propensity of a strictly alternating charge sequence was likely overestimated by RPA and lattice models because interchain stabilization of this sequence requires spatial alignments that are difficult to achieve in real space. These results help delineate the utility and limitations of the charge pattern parameters and of RPA, pointing to further efforts necessary for rationalizing the newly observed subtleties.
Collapse
Affiliation(s)
- Suman Das
- Department of Biochemistry, University of Toronto, Medical Sciences Building - 5th Fl., 1 King's College Circle, Toronto, Ontario M5S 1A8, Canada.
| | | | | | | |
Collapse
|
44
|
Mittal A, Holehouse AS, Cohan MC, Pappu RV. Sequence-to-Conformation Relationships of Disordered Regions Tethered to Folded Domains of Proteins. J Mol Biol 2018; 430:2403-2421. [DOI: 10.1016/j.jmb.2018.05.012] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2018] [Revised: 04/16/2018] [Accepted: 05/07/2018] [Indexed: 12/20/2022]
|
45
|
|
46
|
Evolution, dynamics and dysregulation of kinase signalling. Curr Opin Struct Biol 2018; 48:133-140. [DOI: 10.1016/j.sbi.2017.12.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2017] [Revised: 12/19/2017] [Accepted: 12/21/2017] [Indexed: 12/31/2022]
|