1
|
Castellane TCL, Fernandes CC, Pinheiro DG, Lemos MVF, Varani AM. Exploratory comparative transcriptomic analysis reveals potential gene targets associated with Cry1A.105 and Cry2Ab2 resistance in fall armyworm (Spodoptera frugiperda). Funct Integr Genomics 2024; 24:129. [PMID: 39039331 DOI: 10.1007/s10142-024-01408-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 07/05/2024] [Accepted: 07/15/2024] [Indexed: 07/24/2024]
Abstract
Genetically modified (GM) crops, expressing Bacillus thuringiensis (Bt) insecticidal toxins, have substantially transformed agriculture. Despite rapid adoption, their environmental and economic benefits face scrutiny due to unsustainable agricultural practices and the emergence of resistant pests like Spodoptera frugiperda, known as the fall armyworm (FAW). FAW's adaptation to Bt technology in corn and cotton compromises the long-term efficacy of Bt crops. To advance the understanding of the genetic foundations of resistance mechanisms, we conducted an exploratory comparative transcriptomic analysis of two divergent FAW populations. One population exhibited practical resistance to the Bt insecticidal proteins Cry1A.105 and Cry2Ab2, expressed in the genetically engineered MON-89Ø34 - 3 maize, while the other population remained susceptible to these proteins. Differential expression analysis supported that Cry1A.105 and Cry2Ab2 significantly affect the FAW physiology. A total of 247 and 254 differentially expressed genes were identified in the Cry-resistant and susceptible populations, respectively. By integrating our findings with established literature and databases, we underscored 53 gene targets potentially involved in FAW's resistance to Cry1A.105 and Cry2Ab2. In particular, we considered and discussed the potential roles of the differentially expressed genes encoding ABC transporters, G protein-coupled receptors, the P450 enzymatic system, and other Bt-related detoxification genes. Based on these findings, we emphasize the importance of exploratory transcriptomic analyses to uncover potential gene targets involved with Bt insecticidal proteins resistance, and to support the advantages of GM crops in the face of emerging challenges.
Collapse
Affiliation(s)
- Tereza Cristina L Castellane
- Departamento de Biologia, Faculdade de Ciências Agrárias E Veterinárias, Universidade Estadual Paulista (UNESP), Rod. Prof. Paulo Donato Castellane km 5, Jaboticabal, CEP 14884-900, SP, Brasil.
| | - Camila C Fernandes
- Instituto de Pesquisa em Bioenergia, Laboratório Multiusuário de Sequenciamento em Larga Escala e Expressão Gênica, IPBEN, 14884-900, Jaboticabal, SP, Brasil
| | - Daniel G Pinheiro
- Departamento de Biotecnologia Agropecuária e Ambiental, Faculdade de Ciências Agrárias E Veterinárias, Universidade Estadual Paulista (UNESP), Rod. Prof. Paulo Donato Castellane km 5, Jaboticabal, CEP 14884-900, SP, Brasil
| | - Manoel Victor Franco Lemos
- Departamento de Biologia, Faculdade de Ciências Agrárias E Veterinárias, Universidade Estadual Paulista (UNESP), Rod. Prof. Paulo Donato Castellane km 5, Jaboticabal, CEP 14884-900, SP, Brasil
- Instituto de Pesquisa em Bioenergia, Laboratório Multiusuário de Sequenciamento em Larga Escala e Expressão Gênica, IPBEN, 14884-900, Jaboticabal, SP, Brasil
| | - Alessandro M Varani
- Departamento de Biotecnologia Agropecuária e Ambiental, Faculdade de Ciências Agrárias E Veterinárias, Universidade Estadual Paulista (UNESP), Rod. Prof. Paulo Donato Castellane km 5, Jaboticabal, CEP 14884-900, SP, Brasil.
| |
Collapse
|
2
|
Hummel NFC, Markel K, Stefani J, Staller MV, Shih PM. Systematic identification of transcriptional activation domains from non-transcription factor proteins in plants and yeast. Cell Syst 2024; 15:662-672.e4. [PMID: 38866009 DOI: 10.1016/j.cels.2024.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 04/26/2024] [Accepted: 05/22/2024] [Indexed: 06/14/2024]
Abstract
Transcription factors can promote gene expression through activation domains. Whole-genome screens have systematically mapped activation domains in transcription factors but not in non-transcription factor proteins (e.g., chromatin regulators and coactivators). To fill this knowledge gap, we employed the activation domain predictor PADDLE to analyze the proteomes of Arabidopsis thaliana and Saccharomyces cerevisiae. We screened 18,000 predicted activation domains from >800 non-transcription factor genes in both species, confirming that 89% of candidate proteins contain active fragments. Our work enables the annotation of hundreds of nuclear proteins as putative coactivators, many of which have never been ascribed any function in plants. Analysis of peptide sequence compositions reveals how the distribution of key amino acids dictates activity. Finally, we validated short, "universal" activation domains with comparable performance to state-of-the-art activation domains used for genome engineering. Our approach enables the genome-wide discovery and annotation of activation domains that can function across diverse eukaryotes.
Collapse
Affiliation(s)
- Niklas F C Hummel
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Department of Biology, Technische Universität Darmstadt, 64287 Darmstadt, Germany
| | - Kasey Markel
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jordan Stefani
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA
| | - Max V Staller
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA; Center for Computational Biology, University of California, Berkeley, CA 94720, USA; Chan Zuckerberg Biohub-San Francisco, San Francisco, CA 9415, USA.
| | - Patrick M Shih
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, CA 94720, USA.
| |
Collapse
|
3
|
Morffy N, Van den Broeck L, Miller C, Emenecker RJ, Bryant JA, Lee TM, Sageman-Furnas K, Wilkinson EG, Pathak S, Kotha SR, Lam A, Mahatma S, Pande V, Waoo A, Wright RC, Holehouse AS, Staller MV, Sozzani R, Strader LC. Identification of plant transcriptional activation domains. Nature 2024:10.1038/s41586-024-07707-3. [PMID: 39020176 DOI: 10.1038/s41586-024-07707-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 06/12/2024] [Indexed: 07/19/2024]
Abstract
Gene expression in Arabidopsis is regulated by more than 1,900 transcription factors (TFs), which have been identified genome-wide by the presence of well-conserved DNA-binding domains. Activator TFs contain activation domains (ADs) that recruit coactivator complexes; however, for nearly all Arabidopsis TFs, we lack knowledge about the presence, location and transcriptional strength of their ADs1. To address this gap, here we use a yeast library approach to experimentally identify Arabidopsis ADs on a proteome-wide scale, and find that more than half of the Arabidopsis TFs contain an AD. We annotate 1,553 ADs, the vast majority of which are, to our knowledge, previously unknown. Using the dataset generated, we develop a neural network to accurately predict ADs and to identify sequence features that are necessary to recruit coactivator complexes. We uncover six distinct combinations of sequence features that result in activation activity, providing a framework to interrogate the subfunctionalization of ADs. Furthermore, we identify ADs in the ancient AUXIN RESPONSE FACTOR family of TFs, revealing that AD positioning is conserved in distinct clades. Our findings provide a deep resource for understanding transcriptional activation, a framework for examining function in intrinsically disordered regions and a predictive model of ADs.
Collapse
Affiliation(s)
| | - Lisa Van den Broeck
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Caelan Miller
- Department of Biology, Duke University, Durham, NC, USA
| | - Ryan J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - John A Bryant
- Biological Systems Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Tyler M Lee
- Department of Biology, Duke University, Durham, NC, USA
| | | | | | - Sunita Pathak
- Department of Biology, Duke University, Durham, NC, USA
| | - Sanjana R Kotha
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Angelica Lam
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Saloni Mahatma
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Vikram Pande
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Aman Waoo
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - R Clay Wright
- Biological Systems Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Max V Staller
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Rosangela Sozzani
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | | |
Collapse
|
4
|
Baer MH, Cascarina SM, Paul KR, Ross ED. Rational Tuning of the Concentration-independent Enrichment of Prion-like Domains in Stress Granules. J Mol Biol 2024; 436:168703. [PMID: 39004265 DOI: 10.1016/j.jmb.2024.168703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 06/27/2024] [Accepted: 07/09/2024] [Indexed: 07/16/2024]
Abstract
Stress granules (SGs) are large ribonucleoprotein assemblies that form in response to acute stress in eukaryotes. SG formation is thought to be initiated by liquid-liquid phase separation (LLPS) of key proteins and RNA. These molecules serve as a scaffold for recruitment of client molecules. LLPS of scaffold proteins in vitro is highly concentration-dependent, yet biomolecular condensates in vivo contain hundreds of unique proteins, most of which are thought to be clients rather than scaffolds. Many proteins that localize to SGs contain low-complexity, prion-like domains (PrLDs) that have been implicated in LLPS and SG recruitment. The degree of enrichment of proteins in biomolecular condensates such as SGs can vary widely, but the underlying basis for these differences is not fully understood. Here, we develop a toolkit of model PrLDs to examine the factors that govern efficiency of PrLD recruitment to stress granules. Recruitment was highly sensitive to amino acid composition: enrichment in SGs could be tuned through subtle changes in hydrophobicity. By contrast, SG recruitment was largely insensitive to PrLD concentration at both a population level and single-cell level. These observations point to a model wherein PrLDs are enriched in SGs through either simple solvation effects or interactions that are effectively non-saturable even at high expression levels.
Collapse
Affiliation(s)
- Matthew H Baer
- Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Sean M Cascarina
- Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Kacy R Paul
- Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Eric D Ross
- Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, CO 80523, USA.
| |
Collapse
|
5
|
Manriquez-Sandoval E, Brewer J, Lule G, Lopez S, Fried SD. FLiPPR: A Processor for Limited Proteolysis (LiP) Mass Spectrometry Data Sets Built on FragPipe. J Proteome Res 2024; 23:2332-2342. [PMID: 38787630 DOI: 10.1021/acs.jproteome.3c00887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2024]
Abstract
Here, we present FLiPPR, or FragPipe LiP (limited proteolysis) Processor, a tool that facilitates the analysis of data from limited proteolysis mass spectrometry (LiP-MS) experiments following primary search and quantification in FragPipe. LiP-MS has emerged as a method that can provide proteome-wide information on protein structure and has been applied to a range of biological and biophysical questions. Although LiP-MS can be carried out with standard laboratory reagents and mass spectrometers, analyzing the data can be slow and poses unique challenges compared to typical quantitative proteomics workflows. To address this, we leverage FragPipe and then process its output in FLiPPR. FLiPPR formalizes a specific data imputation heuristic that carefully uses missing data in LiP-MS experiments to report on the most significant structural changes. Moreover, FLiPPR introduces a data merging scheme and a protein-centric multiple hypothesis correction scheme, enabling processed LiP-MS data sets to be more robust and less redundant. These improvements strengthen statistical trends when previously published data are reanalyzed with the FragPipe/FLiPPR workflow. We hope that FLiPPR will lower the barrier for more users to adopt LiP-MS, standardize statistical procedures for LiP-MS data analysis, and systematize output to facilitate eventual larger-scale integration of LiP-MS data.
Collapse
Affiliation(s)
- Edgar Manriquez-Sandoval
- Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, United States
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Joy Brewer
- Department of Chemistry and Biochemistry, Old Dominion University, Norfolk, Virginia 23529, United States
| | - Gabriela Lule
- Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Samanta Lopez
- Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Stephen D Fried
- Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, United States
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, United States
| |
Collapse
|
6
|
Naderi J, Magalhaes AP, Kibar G, Stik G, Zhang Y, Mackowiak SD, Wieler HM, Rossi F, Buschow R, Christou-Kent M, Alcoverro-Bertran M, Graf T, Vingron M, Hnisz D. An activity-specificity trade-off encoded in human transcription factors. Nat Cell Biol 2024:10.1038/s41556-024-01411-0. [PMID: 38969762 DOI: 10.1038/s41556-024-01411-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 03/20/2024] [Indexed: 07/07/2024]
Abstract
Transcription factors (TFs) control specificity and activity of gene transcription, but whether a relationship between these two features exists is unclear. Here we provide evidence for an evolutionary trade-off between the activity and specificity in human TFs encoded as submaximal dispersion of aromatic residues in their intrinsically disordered protein regions. We identified approximately 500 human TFs that encode short periodic blocks of aromatic residues in their intrinsically disordered regions, resembling imperfect prion-like sequences. Mutation of periodic aromatic residues reduced transcriptional activity, whereas increasing the aromatic dispersion of multiple human TFs enhanced transcriptional activity and reprogramming efficiency, promoted liquid-liquid phase separation in vitro and more promiscuous DNA binding in cells. Together with recent work on enhancer elements, these results suggest an important evolutionary role of suboptimal features in transcriptional control. We propose that rational engineering of amino acid features that alter phase separation may be a strategy to optimize TF-dependent processes, including cellular reprogramming.
Collapse
Affiliation(s)
- Julian Naderi
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
- Institute of Chemistry and Biochemistry, Department of Biology, Chemistry and Pharmacy, Freie Universität Berlin, Berlin, Germany
| | - Alexandre P Magalhaes
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Gözde Kibar
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Gregoire Stik
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
- Josep Carreras Leukaemia Research Institute, Badalona, Spain
| | - Yaotian Zhang
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Sebastian D Mackowiak
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Hannah M Wieler
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Francesca Rossi
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Rene Buschow
- Microscopy Core Facility, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Marie Christou-Kent
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Marc Alcoverro-Bertran
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Thomas Graf
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Martin Vingron
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Denes Hnisz
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany.
| |
Collapse
|
7
|
D’Alessandro S, Velay F, Lebrun R, Zafirov D, Mehrez M, Romand S, Saadouni R, Forzani C, Citerne S, Montané MH, Robaglia C, Menand B, Meyer C, Field B. Posttranslational regulation of photosynthetic activity via the TOR kinase in plants. SCIENCE ADVANCES 2024; 10:eadj3268. [PMID: 38896607 PMCID: PMC11186500 DOI: 10.1126/sciadv.adj3268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 05/13/2024] [Indexed: 06/21/2024]
Abstract
Chloroplasts are the powerhouse of the plant cell, and their activity must be matched to plant growth to avoid photooxidative damage. We have identified a posttranslational mechanism linking the eukaryotic target of rapamycin (TOR) kinase that promotes growth and the guanosine tetraphosphate (ppGpp) signaling pathway of prokaryotic origins that regulates chloroplast activity and photosynthesis in particular. We find that RelA SpoT homolog 3 (RSH3), a nuclear-encoded enzyme responsible for ppGpp biosynthesis, interacts directly with the TOR complex via a plant-specific amino-terminal region which is phosphorylated in a TOR-dependent manner. Down-regulating TOR activity causes a rapid increase in ppGpp synthesis in RSH3 overexpressors and reduces photosynthetic capacity in an RSH-dependent manner in wild-type plants. The TOR-RSH3 signaling axis therefore regulates the equilibrium between chloroplast activity and plant growth, setting a precedent for the regulation of organellar function by TOR.
Collapse
Affiliation(s)
- Stefano D’Alessandro
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
- Università di Torino, Dipartimento di Scienze della vita e Biologia dei Sistemi, 10135 Torino, Italy
| | - Florent Velay
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
| | - Régine Lebrun
- Aix Marseille Univ, CNRS, Plate-forme Protéomique, Marseille Protéomique (MaP), IMM FR 3479, 31 Chemin Joseph Aiguier, 13009 Marseille, France
| | - Delyan Zafirov
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
| | - Marwa Mehrez
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
- Faculty of Sciences of Tunis, University of Tunis El Manar, 2092 Tunis, Tunisia
| | - Shanna Romand
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
| | - Rim Saadouni
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
- Aix Marseille Univ, CNRS, Plate-forme Protéomique, Marseille Protéomique (MaP), IMM FR 3479, 31 Chemin Joseph Aiguier, 13009 Marseille, France
| | - Céline Forzani
- Institut Jean-Pierre Bourgin, INRAE, AgroParisTech, CNRS, Université Paris-Saclay, 78000 Versailles, France
| | - Sylvie Citerne
- Institut Jean-Pierre Bourgin, INRAE, AgroParisTech, CNRS, Université Paris-Saclay, 78000 Versailles, France
| | | | | | - Benoît Menand
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
| | - Christian Meyer
- Institut Jean-Pierre Bourgin, INRAE, AgroParisTech, CNRS, Université Paris-Saclay, 78000 Versailles, France
| | - Ben Field
- Aix Marseille Univ, CEA, CNRS, BIAM, LGBP Team, 13009 Marseille, France
| |
Collapse
|
8
|
Santos JR, Park J. MATR3's Role beyond the Nuclear Matrix: From Gene Regulation to Its Implications in Amyotrophic Lateral Sclerosis and Other Diseases. Cells 2024; 13:980. [PMID: 38891112 PMCID: PMC11171696 DOI: 10.3390/cells13110980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 05/31/2024] [Accepted: 06/02/2024] [Indexed: 06/21/2024] Open
Abstract
Matrin-3 (MATR3) was initially discovered as a component of the nuclear matrix about thirty years ago. Since then, accumulating studies have provided evidence that MATR3 not only plays a structural role in the nucleus, but that it is also an active protein involved in regulating gene expression at multiple levels, including chromatin organization, DNA transcription, RNA metabolism, and protein translation in the nucleus and cytoplasm. Furthermore, MATR3 may play a critical role in various cellular processes, including DNA damage response, cell proliferation, differentiation, and survival. In addition to the revelation of its biological role, recent studies have reported MATR3's involvement in the context of various diseases, including neurodegenerative and neurodevelopmental diseases, as well as cancer. Moreover, sequencing studies of patients revealed a handful of disease-associated mutations in MATR3 linked to amyotrophic lateral sclerosis (ALS), which further elevated the gene's importance as a topic of study. In this review, we synthesize the current knowledge regarding the diverse functions of MATR3 in DNA- and RNA-related processes, as well as its involvement in various diseases, with a particular emphasis on ALS.
Collapse
Affiliation(s)
- Jhune Rizsan Santos
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A1, Canada;
- Genetics and Genome Biology Program, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Jeehye Park
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A1, Canada;
- Genetics and Genome Biology Program, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| |
Collapse
|
9
|
Chen J, Li Q, Xia S, Arsala D, Sosa D, Wang D, Long M. The Rapid Evolution of De Novo Proteins in Structure and Complex. Genome Biol Evol 2024; 16:evae107. [PMID: 38753069 PMCID: PMC11149777 DOI: 10.1093/gbe/evae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2024] [Indexed: 06/06/2024] Open
Abstract
Recent studies in the rice genome-wide have established that de novo genes, evolving from noncoding sequences, enhance protein diversity through a stepwise process. However, the pattern and rate of their evolution in protein structure over time remain unclear. Here, we addressed these issues within a surprisingly short evolutionary timescale (<1 million years for 97% of Oryza de novo genes) with comparative approaches to gene duplicates. We found that de novo genes evolve faster than gene duplicates in the intrinsically disordered regions (such as random coils), secondary structure elements (such as α helix and β strand), hydrophobicity, and molecular recognition features. In de novo proteins, specifically, we observed an 8% to 14% decay in random coils and intrinsically disordered region lengths and a 2.3% to 6.5% increase in structured elements, hydrophobicity, and molecular recognition features, per million years on average. These patterns of structural evolution align with changes in amino acid composition over time as well. We also revealed higher positive charges but smaller molecular weights for de novo proteins than duplicates. Tertiary structure predictions showed that most de novo proteins, though not typically well folded on their own, readily form low-energy and compact complexes with other proteins facilitated by extensive residue contacts and conformational flexibility, suggesting a faster-binding scenario in de novo proteins to promote interaction. These analyses illuminate a rapid evolution of protein structure in de novo genes in rice genomes, originating from noncoding sequences, highlighting their quick transformation into active, protein complex-forming components within a remarkably short evolutionary timeframe.
Collapse
Affiliation(s)
- Jianhai Chen
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Qingrong Li
- Division of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular & Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Shengqian Xia
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Deanna Arsala
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Dylan Sosa
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Dong Wang
- Division of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular & Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
10
|
Ginell GM, Emenecker RJ, Lotthammer JM, Usher ET, Holehouse AS. Direct prediction of intermolecular interactions driven by disordered regions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.03.597104. [PMID: 38895487 PMCID: PMC11185574 DOI: 10.1101/2024.06.03.597104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Intrinsically disordered regions (IDRs) are critical for a wide variety of cellular functions, many of which involve interactions with partner proteins. Molecular recognition is typically considered through the lens of sequence-specific binding events. However, a growing body of work has shown that IDRs often interact with partners in a manner that does not depend on the precise order of the amino acid order, instead driven by complementary chemical interactions leading to disordered bound-state complexes. Despite this emerging paradigm, we lack tools to describe, quantify, predict, and interpret these types of structurally heterogeneous interactions from the underlying amino acid sequences. Here, we repurpose the chemical physics developed originally for molecular simulations to develop an approach for predicting intermolecular interactions between IDRs and partner proteins. Our approach enables the direct prediction of phase diagrams, the identification of chemically-specific interaction hotspots on IDRs, and a route to develop and test mechanistic hypotheses regarding IDR function in the context of molecular recognition. We use our approach to examine a range of systems and questions to highlight its versatility and applicability.
Collapse
Affiliation(s)
- Garrett M. Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Ryan. J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Jeffrey M. Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Emery T. Usher
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| |
Collapse
|
11
|
Alston JJ, Soranno A, Holehouse AS. Conserved molecular recognition by an intrinsically disordered region in the absence of sequence conservation. RESEARCH SQUARE 2024:rs.3.rs-4477977. [PMID: 38883712 PMCID: PMC11177979 DOI: 10.21203/rs.3.rs-4477977/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2024]
Abstract
Intrinsically disordered regions (IDRs) are critical for cellular function yet often appear to lack sequence conservation when assessed by multiple sequence alignments. This raises the question of if and how function can be encoded and preserved in these regions despite massive sequence variation. To address this question, we have applied coarse-grained molecular dynamics simulations to investigate non-specific RNA binding of coronavirus nucleocapsid proteins. Coronavirus nucleocapsid proteins consist of multiple interspersed disordered and folded domains that bind RNA. Here, we focus on the first two domains of coronavirus nucleocapsid proteins: the disordered N-terminal domain (NTD) and the folded RNA binding domain (RBD). While the NTD is highly variable across evolution, the RBD is structurally conserved. This combination makes the NTD-RBD a convenient model system for exploring the interplay between an IDR adjacent to a folded domain and how changes in IDR sequence can influence molecular recognition of a partner. Our results reveal a surprising degree of sequence-specificity encoded by both the composition and the precise order of the amino acids in the NTD. The presence of an NTD can - depending on the sequence - either suppress or enhance RNA binding. Despite this sensitivity, large-scale variation in NTD sequences is possible while certain sequence features are retained. Consequently, a conformationally-conserved dynamic and disordered RNA:protein complex is found across nucleocapsid protein orthologs despite large-scale changes in both NTD sequence and RBD surface chemistry. Taken together, these insights shed light on the ability of disordered regions to preserve functional characteristics despite their sequence variability.
Collapse
Affiliation(s)
- Jhullian J. Alston
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
- Present Address, Program In Cellular and Molecular Medicine (PCMM), Boston Children’s Hospital, Boston, MA, USA
| | - Andrea Soranno
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| |
Collapse
|
12
|
Wang L, Wen Z, Liu SW, Zhang L, Finley C, Lee HJ, Fan HJS. Overview of AlphaFold2 and breakthroughs in overcoming its limitations. Comput Biol Med 2024; 176:108620. [PMID: 38761500 DOI: 10.1016/j.compbiomed.2024.108620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Revised: 05/01/2024] [Accepted: 05/14/2024] [Indexed: 05/20/2024]
Abstract
Predicting three-dimensional (3D) protein structures has been challenging for decades. The emergence of AlphaFold2 (AF2), a deep learning-based machine learning method developed by DeepMind, became a game changer in the protein folding community. AF2 can predict a protein's three-dimensional structure with high confidence based on its amino acid sequence. Accurate prediction of protein structures can dramatically accelerate our understanding of biological mechanisms and provide a solid foundation for reliable drug design. Although AF2 breaks through the barriers in predicting protein structures, many rooms remain to be further studied. This review provides a brief historical overview of the development of protein structure prediction, covering template-based, template-free, and machine learning-based methods. In addition to reviewing the potential benefits (Pros) and considerations (Cons) of using AF2, this review summarizes the diverse applications, including protein structure predictions, dynamic changes, point mutation, integration of language model and experimental data, protein complex, and protein-peptide interaction. It underscores recent advancements in efficiency, reliability, and broad application of AF2. This comprehensive review offers valuable insights into the applications of AF2 and AF2-inspired AI methods in structural biology and its potential for clinically significant drug target discovery.
Collapse
Affiliation(s)
- Lei Wang
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China
| | - Zehua Wen
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China
| | - Shi-Wei Liu
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China
| | - Lihong Zhang
- Digestive Department, Binhai New Area Hospital of TCM Tianjin, Tianjin, 300451, China
| | - Cierra Finley
- Department of Natural Sciences, Southwest Tennessee Community College, Memphis, TN, 38015, USA
| | - Ho-Jin Lee
- Department of Natural Sciences, Southwest Tennessee Community College, Memphis, TN, 38015, USA; Division of Natural & Mathematical Sciences, LeMoyne-Own College, Memphis, TN, 38126, USA.
| | - Hua-Jun Shawn Fan
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China.
| |
Collapse
|
13
|
Cubuk J, Greenberg L, Greenberg AE, Emenecker RJ, Stuchell-Brereton MD, Holehouse AS, Soranno A, Greenberg MJ. Structural dynamics of the intrinsically disordered linker region of cardiac troponin T. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.30.596451. [PMID: 38853835 PMCID: PMC11160775 DOI: 10.1101/2024.05.30.596451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
The cardiac troponin complex, composed of troponins I, T, and C, plays a central role in regulating the calcium-dependent interactions between myosin and the thin filament. Mutations in troponin can cause cardiomyopathies; however, it is still a major challenge for the field to connect how changes in sequence affect troponin's function. Recent high-resolution structures of the thin filament revealed critical insights into the structure-function relationship of the troponin complex, but there remain large, unresolved segments of troponin, including the troponin-T linker region that is a hotspot for several cardiomyopathy mutations. This unresolved yet functionally-significant linker region has been proposed to be intrinsically disordered, with behaviors that are not well described by traditional structural approaches; however, this proposal has not been experimentally verified. Here, we used a combination of single-molecule Förster resonance energy transfer (FRET), molecular dynamics simulations, and functional reconstitution assays to investigate the troponin-T linker region. We experimentally and computationally show that in the context of both isolated troponin and the fully regulated troponin complex, the linker behaves as a dynamic, intrinsically disordered region. This region undergoes polyampholyte expansion in the presence of high salt and distinct conformational changes during the assembly of the troponin complex. We also examine the ΔE160 hypertrophic cardiomyopathy mutation in the linker, and we demonstrate that this mutation does not affect the conformational dynamics of the linker, rather it allosterically affects interactions with other subunits of the troponin complex, leading to increased molecular contractility. Taken together, our data clearly demonstrate the importance of disorder within the troponin-T linker and provide new insights into the molecular mechanisms controlling the pathogenesis of cardiomyopathies.
Collapse
Affiliation(s)
- Jasmine Cubuk
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St Louis, 1 Brookings Drive, 63130, Saint Louis, MO, USA
| | - Lina Greenberg
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
| | - Akiva E. Greenberg
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
| | - Ryan J. Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St Louis, 1 Brookings Drive, 63130, Saint Louis, MO, USA
| | - Melissa D. Stuchell-Brereton
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St Louis, 1 Brookings Drive, 63130, Saint Louis, MO, USA
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St Louis, 1 Brookings Drive, 63130, Saint Louis, MO, USA
| | - Andrea Soranno
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St Louis, 1 Brookings Drive, 63130, Saint Louis, MO, USA
| | - Michael J. Greenberg
- Department of Biochemistry and Molecular Biophysics, Washington University in St Louis, 660 Euclid Ave, 63110, Saint Louis, MO, USA
| |
Collapse
|
14
|
Wu Z, Pope SD, Ahmed NS, Leung DL, Hajjar S, Yue Q, Anand DM, Kopp EB, Okin D, Ma W, Kagan JC, Hargreaves DC, Medzhitov R, Zhou X. Control of Inflammatory Response by Tissue Microenvironment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.10.592432. [PMID: 38798655 PMCID: PMC11118380 DOI: 10.1101/2024.05.10.592432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Inflammation is an essential defense response but operates at the cost of normal functions. Whether and how the negative impact of inflammation is monitored remains largely unknown. Acidification of the tissue microenvironment is associated with inflammation. Here we investigated whether macrophages sense tissue acidification to adjust inflammatory responses. We found that acidic pH restructured the inflammatory response of macrophages in a gene-specific manner. We identified mammalian BRD4 as a novel intracellular pH sensor. Acidic pH disrupts the transcription condensates containing BRD4 and MED1, via histidine-enriched intrinsically disordered regions. Crucially, decrease in macrophage intracellular pH is necessary and sufficient to regulate transcriptional condensates in vitro and in vivo, acting as negative feedback to regulate the inflammatory response. Collectively, these findings uncovered a pH-dependent switch in transcriptional condensates that enables environmental sensing to directly control inflammation, with a broader implication for calibrating the magnitude and quality of inflammation by the inflammatory cost.
Collapse
Affiliation(s)
- Zhongyang Wu
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Scott D. Pope
- Department of Immunobiology, Yale University School of Medicine, New Haven, Connecticut 06510, USA
- Howard Hughes Medical Institute, Yale University School of Medicine, New Haven, Connecticut 06510, USA
| | - Nasiha S. Ahmed
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Diana L. Leung
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Stephanie Hajjar
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Qiuyu Yue
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
- Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, School of Life Sciences, Peking University, Beijing 100871, China
| | - Diya M. Anand
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Elizabeth B. Kopp
- Department of Immunobiology, Yale University School of Medicine, New Haven, Connecticut 06510, USA
| | - Daniel Okin
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
- Division of Pulmonary and Critical Care Medicine, Massachusetts General Hospital, Boston, Massachusetts, 02115
| | - Weiyi Ma
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Jonathan C. Kagan
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Diana C. Hargreaves
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Ruslan Medzhitov
- Department of Immunobiology, Yale University School of Medicine, New Haven, Connecticut 06510, USA
- Howard Hughes Medical Institute, Yale University School of Medicine, New Haven, Connecticut 06510, USA
- Tananbaum Center for Theoretical and Analytical Human Biology, Yale University School of Medicine
| | - Xu Zhou
- Division of Gastroenterology, Hepatology and Nutrition, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
15
|
DeHaro-Arbona FJ, Roussos C, Baloul S, Townson J, Gómez Lamarca MJ, Bray S. Dynamic modes of Notch transcription hubs conferring memory and stochastic activation revealed by live imaging the co-activator Mastermind. eLife 2024; 12:RP92083. [PMID: 38727722 PMCID: PMC11087053 DOI: 10.7554/elife.92083] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2024] Open
Abstract
Developmental programming involves the accurate conversion of signalling levels and dynamics to transcriptional outputs. The transcriptional relay in the Notch pathway relies on nuclear complexes containing the co-activator Mastermind (Mam). By tracking these complexes in real time, we reveal that they promote the formation of a dynamic transcription hub in Notch ON nuclei which concentrates key factors including the Mediator CDK module. The composition of the hub is labile and persists after Notch withdrawal conferring a memory that enables rapid reformation. Surprisingly, only a third of Notch ON hubs progress to a state with nascent transcription, which correlates with polymerase II and core Mediator recruitment. This probability is increased by a second signal. The discovery that target-gene transcription is probabilistic has far-reaching implications because it implies that stochastic differences in Notch pathway output can arise downstream of receptor activation.
Collapse
Affiliation(s)
- F Javier DeHaro-Arbona
- Department of Physiology Development and Neuroscience, University of CambridgeCambridgeUnited Kingdom
| | - Charalambos Roussos
- Department of Physiology Development and Neuroscience, University of CambridgeCambridgeUnited Kingdom
| | - Sarah Baloul
- Department of Physiology Development and Neuroscience, University of CambridgeCambridgeUnited Kingdom
| | - Jonathan Townson
- Department of Physiology Development and Neuroscience, University of CambridgeCambridgeUnited Kingdom
| | - María J Gómez Lamarca
- Department of Physiology Development and Neuroscience, University of CambridgeCambridgeUnited Kingdom
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Universitario Virgen del Rocıo/CSIC/Universidad de Sevilla, Departamento de Biologıa CelularSevilleSpain
| | - Sarah Bray
- Department of Physiology Development and Neuroscience, University of CambridgeCambridgeUnited Kingdom
| |
Collapse
|
16
|
Adiji OA, McConnell BS, Parker MW. The origin recognition complex requires chromatin tethering by a hypervariable intrinsically disordered region that is functionally conserved from sponge to man. Nucleic Acids Res 2024; 52:4344-4360. [PMID: 38381902 PMCID: PMC11077064 DOI: 10.1093/nar/gkae122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 01/26/2024] [Accepted: 02/08/2024] [Indexed: 02/23/2024] Open
Abstract
The first step toward eukaryotic genome duplication is loading of the replicative helicase onto chromatin. This 'licensing' step initiates with the recruitment of the origin recognition complex (ORC) to chromatin, which is thought to occur via ORC's ATP-dependent DNA binding and encirclement activity. However, we have previously shown that ATP binding is dispensable for the chromatin recruitment of fly ORC, raising the question of how metazoan ORC binds chromosomes. We show here that the intrinsically disordered region (IDR) of fly Orc1 is both necessary and sufficient for recruitment of ORC to chromosomes in vivo and demonstrate that this is regulated by IDR phosphorylation. Consistently, we find that the IDR confers the ORC holocomplex with ATP-independent DNA binding activity in vitro. Using phylogenetic analysis, we make the surprising observation that metazoan Orc1 IDRs have diverged so markedly that they are unrecognizable as orthologs and yet we find that these compositionally homologous sequences are functionally conserved. Altogether, these data suggest that chromatin is recalcitrant to ORC's ATP-dependent DNA binding activity, necessitating IDR-dependent chromatin tethering, which we propose poises ORC to opportunistically encircle nucleosome-free regions as they become available.
Collapse
Affiliation(s)
- Olubu A Adiji
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX 75235, USA
| | - Brendan S McConnell
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX 75235, USA
| | - Matthew W Parker
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX 75235, USA
| |
Collapse
|
17
|
Bohn L, Huang J, Weidig S, Yang Z, Heidersberger C, Genty B, Falter-Braun P, Christmann A, Grill E. The temperature sensor TWA1 is required for thermotolerance in Arabidopsis. Nature 2024; 629:1126-1132. [PMID: 38750356 PMCID: PMC11136664 DOI: 10.1038/s41586-024-07424-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 04/15/2024] [Indexed: 05/31/2024]
Abstract
Plants exposed to incidences of excessive temperatures activate heat-stress responses to cope with the physiological challenge and stimulate long-term acclimation1,2. The mechanism that senses cellular temperature for inducing thermotolerance is still unclear3. Here we show that TWA1 is a temperature-sensing transcriptional co-regulator that is needed for basal and acquired thermotolerance in Arabidopsis thaliana. At elevated temperatures, TWA1 changes its conformation and allows physical interaction with JASMONATE-ASSOCIATED MYC-LIKE (JAM) transcription factors and TOPLESS (TPL) and TOPLESS-RELATED (TPR) proteins for repressor complex assembly. TWA1 is a predicted intrinsically disordered protein that has a key thermosensory role functioning through an amino-terminal highly variable region. At elevated temperatures, TWA1 accumulates in nuclear subdomains, and physical interactions with JAM2 and TPL appear to be restricted to these nuclear subdomains. The transcriptional upregulation of the heat shock transcription factor A2 (HSFA2) and heat shock proteins depended on TWA1, and TWA1 orthologues provided different temperature thresholds, consistent with the sensor function in early signalling of heat stress. The identification of the plant thermosensors offers a molecular tool for adjusting thermal acclimation responses of crops by breeding and biotechnology, and a sensitive temperature switch for thermogenetics.
Collapse
Affiliation(s)
- Lisa Bohn
- Chair of Botany, TUM School of Life Sciences Weihenstephan, Technische Universität München (TUM), Freising, Germany
| | - Jin Huang
- Chair of Botany, TUM School of Life Sciences Weihenstephan, Technische Universität München (TUM), Freising, Germany
- Chengdu Newsun Crop Science, Chengdu, China
| | - Susan Weidig
- Chair of Botany, TUM School of Life Sciences Weihenstephan, Technische Universität München (TUM), Freising, Germany
| | - Zhenyu Yang
- Chair of Botany, TUM School of Life Sciences Weihenstephan, Technische Universität München (TUM), Freising, Germany
| | - Christoph Heidersberger
- Chair of Botany, TUM School of Life Sciences Weihenstephan, Technische Universität München (TUM), Freising, Germany
| | - Bernard Genty
- Aix-Marseille University, Commissariat à l'Energie Atomique (CEA), Centre National de la Recherche Scientifique (CNRS), Institut de Biosciences et Biotechnologies Aix-Marseille, Saint-Paul-lez-Durance, France
| | - Pascal Falter-Braun
- Institute of Network Biology (INET), Molecular Targets and Therapeutics Center (MTTC), Helmholtz Center Munich, German Research Center for Environmental Health, Munich, Germany
- Microbe-Host Interactions, Faculty of Biology, Ludwig-Maximilians-Universität (LMU) München, Munich, Germany
| | - Alexander Christmann
- Chair of Botany, TUM School of Life Sciences Weihenstephan, Technische Universität München (TUM), Freising, Germany.
| | - Erwin Grill
- Chair of Botany, TUM School of Life Sciences Weihenstephan, Technische Universität München (TUM), Freising, Germany.
| |
Collapse
|
18
|
Jankowski MS, Griffith D, Shastry DG, Pelham JF, Ginell GM, Thomas J, Karande P, Holehouse AS, Hurley JM. Disordered clock protein interactions and charge blocks turn an hourglass into a persistent circadian oscillator. Nat Commun 2024; 15:3523. [PMID: 38664421 PMCID: PMC11045787 DOI: 10.1038/s41467-024-47761-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 04/11/2024] [Indexed: 04/28/2024] Open
Abstract
Organismal physiology is widely regulated by the molecular circadian clock, a feedback loop composed of protein complexes whose members are enriched in intrinsically disordered regions. These regions can mediate protein-protein interactions via SLiMs, but the contribution of these disordered regions to clock protein interactions had not been elucidated. To determine the functionality of these disordered regions, we applied a synthetic peptide microarray approach to the disordered clock protein FRQ in Neurospora crassa. We identified residues required for FRQ's interaction with its partner protein FRH, the mutation of which demonstrated FRH is necessary for persistent clock oscillations but not repression of transcriptional activity. Additionally, the microarray demonstrated an enrichment of FRH binding to FRQ peptides with a net positive charge. We found that positively charged residues occurred in significant "blocks" within the amino acid sequence of FRQ and that ablation of one of these blocks affected both core clock timing and physiological clock output. Finally, we found positive charge clusters were a commonly shared molecular feature in repressive circadian clock proteins. Overall, our study suggests a mechanistic purpose for positive charge blocks and yielded insights into repressive arm protein roles in clock function.
Collapse
Affiliation(s)
- Meaghan S Jankowski
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Daniel Griffith
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Divya G Shastry
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Jacqueline F Pelham
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Garrett M Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Joshua Thomas
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Pankaj Karande
- Department of Chemical and Biological Engineering, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
- Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Jennifer M Hurley
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
- Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| |
Collapse
|
19
|
Mukherjee A, Fallacaro S, Ratchasanmuang P, Zinski J, Boka A, Shankta K, Mir M. A fine kinetic balance of interactions directs transcription factor hubs to genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.16.589811. [PMID: 38659757 PMCID: PMC11042322 DOI: 10.1101/2024.04.16.589811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Eukaryotic gene regulation relies on the binding of sequence-specific transcription factors (TFs). TFs bind chromatin transiently yet occupy their target sites by forming high-local concentration microenvironments (hubs and condensates) that increase the frequency of binding events. Despite their ubiquity, such microenvironments have been difficult to study in endogenous contexts due to technical limitations. Here, we overcome these limitations and investigate how hubs drive TF occupancy at their targets. Using a DNA binding perturbation to a hub-forming TF, Zelda, in Drosophila embryos, we find that hub properties, including the stability and frequencies of associations to targets, are key determinants of TF occupancy. Our data suggest that the targeting of these hubs is driven not just by specific DNA motif recognition, but also by a fine-tuned kinetic balance of interactions between TFs and their co-binding partners.
Collapse
Affiliation(s)
- Apratim Mukherjee
- Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
| | - Samantha Fallacaro
- Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Developmental, Stem Cell, and Regenerative Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
| | - Puttachai Ratchasanmuang
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Howard Hughes Medical Institute, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
| | - Joseph Zinski
- Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
| | - Alan Boka
- Biochemistry and Molecular Biophysics Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kareena Shankta
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Roy and Diana Vagelos Program in Life Sciences and Management, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Mustafa Mir
- Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Howard Hughes Medical Institute, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104
| |
Collapse
|
20
|
Sanchez‐Martinez S, Nguyen K, Biswas S, Nicholson V, Romanyuk AV, Ramirez J, Kc S, Akter A, Childs C, Meese EK, Usher ET, Ginell GM, Yu F, Gollub E, Malferrari M, Francia F, Venturoli G, Martin EW, Caporaletti F, Giubertoni G, Woutersen S, Sukenik S, Woolfson DN, Holehouse AS, Boothby TC. Labile assembly of a tardigrade protein induces biostasis. Protein Sci 2024; 33:e4941. [PMID: 38501490 PMCID: PMC10949331 DOI: 10.1002/pro.4941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 02/01/2024] [Accepted: 02/09/2024] [Indexed: 03/20/2024]
Abstract
Tardigrades are microscopic animals that survive desiccation by inducing biostasis. To survive drying tardigrades rely on intrinsically disordered CAHS proteins, which also function to prevent perturbations induced by drying in vitro and in heterologous systems. CAHS proteins have been shown to form gels both in vitro and in vivo, which has been speculated to be linked to their protective capacity. However, the sequence features and mechanisms underlying gel formation and the necessity of gelation for protection have not been demonstrated. Here we report a mechanism of fibrillization and gelation for CAHS D similar to that of intermediate filament assembly. We show that in vitro, gelation restricts molecular motion, immobilizing and protecting labile material from the harmful effects of drying. In vivo, we observe that CAHS D forms fibrillar networks during osmotic stress. Fibrillar networking of CAHS D improves survival of osmotically shocked cells. We observe two emergent properties associated with fibrillization; (i) prevention of cell volume change and (ii) reduction of metabolic activity during osmotic shock. We find that there is no significant correlation between maintenance of cell volume and survival, while there is a significant correlation between reduced metabolism and survival. Importantly, CAHS D's fibrillar network formation is reversible and metabolic rates return to control levels after CAHS fibers are resolved. This work provides insights into how tardigrades induce reversible biostasis through the self-assembly of labile CAHS gels.
Collapse
Affiliation(s)
| | - K. Nguyen
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - S. Biswas
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - V. Nicholson
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - A. V. Romanyuk
- School of ChemistryUniversity of BristolBristolUK
- Max Planck‐Bristol Centre for Minimal BiologyUniversity of BristolBristolUK
| | - J. Ramirez
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - S. Kc
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - A. Akter
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - C. Childs
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - E. K. Meese
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - E. T. Usher
- Department of Biochemistry and Molecular BiophysicsWashington University School of MedicineSt. LouisMissouriUSA
- Center for Biomolecular CondensatesWashington University in St. LouisSt. LouisMissouriUSA
| | - G. M. Ginell
- Department of Biochemistry and Molecular BiophysicsWashington University School of MedicineSt. LouisMissouriUSA
- Center for Biomolecular CondensatesWashington University in St. LouisSt. LouisMissouriUSA
| | - F. Yu
- Quantitative Systems Biology ProgramUniversity of California MercedMercedCaliforniaUSA
| | - E. Gollub
- Department of Chemistry and BiochemistryUniversity of California MercedMercedCaliforniaUSA
| | - M. Malferrari
- Dipartimento di Chimica “Giacomo Ciamician”Università di BolognaBolognaItaly
| | - F. Francia
- Laboratorio di Biochimica e Biofisica Molecolare, Dipartimento di Farmacia e Biotecnologie, FaBiTUniversità di BolognaBolognaItaly
| | - G. Venturoli
- Laboratorio di Biochimica e Biofisica Molecolare, Dipartimento di Farmacia e Biotecnologie, FaBiTUniversità di BolognaBolognaItaly
- Consorzio Nazionale Interuniversitario per le Scienze Fisiche della Materia (CNISM), c/o Dipartimento di Fisica e Astronomia (DIFA)Università di BolognaBolognaItaly
| | - E. W. Martin
- Department of Structural BiologySt. Jude Children's Research HospitalMemphisTennesseeUSA
| | - F. Caporaletti
- Van't Hoff Institute for Molecular SciencesUniversity of AmsterdamAmsterdamThe Netherlands
| | - G. Giubertoni
- Van't Hoff Institute for Molecular SciencesUniversity of AmsterdamAmsterdamThe Netherlands
| | - S. Woutersen
- Van't Hoff Institute for Molecular SciencesUniversity of AmsterdamAmsterdamThe Netherlands
| | - S. Sukenik
- Quantitative Systems Biology ProgramUniversity of California MercedMercedCaliforniaUSA
- Department of Chemistry and BiochemistryUniversity of California MercedMercedCaliforniaUSA
| | - D. N. Woolfson
- School of ChemistryUniversity of BristolBristolUK
- Max Planck‐Bristol Centre for Minimal BiologyUniversity of BristolBristolUK
- School of BiochemistryUniversity of Bristol, Biomedical Sciences BuildingBristolUK
| | - A. S. Holehouse
- Department of Biochemistry and Molecular BiophysicsWashington University School of MedicineSt. LouisMissouriUSA
- Center for Biomolecular CondensatesWashington University in St. LouisSt. LouisMissouriUSA
| | - T. C. Boothby
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| |
Collapse
|
21
|
Shepherdson JL, Granas DM, Li J, Shariff Z, Plassmeyer SP, Holehouse AS, White MA, Cohen BA. Mutational scanning of CRX classifies clinical variants and reveals biochemical properties of the transcriptional effector domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.21.585809. [PMID: 38585983 PMCID: PMC10996540 DOI: 10.1101/2024.03.21.585809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Cone-Rod Homeobox, encoded by CRX, is a transcription factor (TF) essential for the terminal differentiation and maintenance of mammalian photoreceptors. Structurally, CRX comprises an ordered DNA-binding homeodomain and an intrinsically disordered transcriptional effector domain. Although a handful of human variants in CRX have been shown to cause several different degenerative retinopathies with varying cone and rod predominance, as with most human disease genes the vast majority of observed CRX genetic variants are uncharacterized variants of uncertain significance (VUS). We performed a deep mutational scan (DMS) of nearly all possible single amino acid substitution variants in CRX, using an engineered cell-based transcriptional reporter assay. We measured the ability of each CRX missense variant to transactivate a synthetic fluorescent reporter construct in a pooled fluorescence-activated cell sorting assay and compared the activation strength of each variant to that of wild-type CRX to compute an activity score, identifying thousands of variants with altered transcriptional activity. We calculated a statistical confidence for each activity score derived from multiple independent measurements of each variant marked by unique sequence barcodes, curating a high-confidence list of nearly 2,000 variants with significantly altered transcriptional activity compared to wild-type CRX. We evaluated the performance of the DMS assay as a clinical variant classification tool using gold-standard classified human variants from ClinVar, and determined that activity scores could be used to identify pathogenic variants with high specificity. That this performance could be achieved using a synthetic reporter assay in a foreign cell type, even for a highly cell type-specific TF like CRX, suggests that this approach shows promise for DMS of other TFs that function in cell types that are not easily accessible. Per-position average activity scores closely aligned to a predicted structure of the ordered homeodomain and demonstrated position-specific residue requirements. The intrinsically disordered transcriptional effector domain, by contrast, displayed a qualitatively different pattern of substitution effects, following compositional constraints without specific residue position requirements in the peptide chain. The observed compositional constraints of the effector domain were consistent with the acidic exposure model of transcriptional activation. Together, the results of the CRX DMS identify molecular features of the CRX effector domain and demonstrate clinical utility for variant classification.
Collapse
Affiliation(s)
- James L. Shepherdson
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - David M. Granas
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Jie Li
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Zara Shariff
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Stephen P. Plassmeyer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Michael A. White
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Barak A. Cohen
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| |
Collapse
|
22
|
Pandi B, Brenman S, Black A, Ng DCM, Lau E, Lam MPY. Tissue Usage Preference and Intrinsically Disordered Region Remodeling of Alternative Splicing Derived Proteoforms in the Heart. J Proteome Res 2024. [PMID: 38456420 DOI: 10.1021/acs.jproteome.3c00789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
A computational analysis of mass spectrometry data was performed to uncover alternative splicing derived protein variants across chambers of the human heart. Evidence for 216 non-canonical isoforms was apparent in the atrium and the ventricle, including 52 isoforms not documented on SwissProt and recovered using an RNA sequencing derived database. Among non-canonical isoforms, 29 show signs of regulation based on statistically significant preferences in tissue usage, including a ventricular enriched protein isoform of tensin-1 (TNS1) and an atrium-enriched PDZ and LIM Domain 3 (PDLIM3) isoform 2 (PDLIM3-2/ALP-H). Examined variant regions that differ between alternative and canonical isoforms are highly enriched with intrinsically disordered regions. Moreover, over two-thirds of such regions are predicted to function in protein binding and RNA binding. The analysis here lends further credence to the notion that alternative splicing diversifies the proteome by rewiring intrinsically disordered regions, which are increasingly recognized to play important roles in the generation of biological function from protein sequences.
Collapse
|
23
|
Lotthammer JM, Ginell GM, Griffith D, Emenecker RJ, Holehouse AS. Direct prediction of intrinsically disordered protein conformational properties from sequence. Nat Methods 2024; 21:465-476. [PMID: 38297184 PMCID: PMC10927563 DOI: 10.1038/s41592-023-02159-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 12/20/2023] [Indexed: 02/02/2024]
Abstract
Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles. While folded domains are generally well described by a stable three-dimensional structure, IDRs exist in a collection of interconverting states known as an ensemble. This structural heterogeneity means that IDRs are largely absent from the Protein Data Bank, contributing to a lack of computational approaches to predict ensemble conformational properties from sequence. Here we combine rational sequence design, large-scale molecular simulations and deep learning to develop ALBATROSS, a deep-learning model for predicting ensemble dimensions of IDRs, including the radius of gyration, end-to-end distance, polymer-scaling exponent and ensemble asphericity, directly from sequences at a proteome-wide scale. ALBATROSS is lightweight, easy to use and accessible as both a locally installable software package and a point-and-click-style interface via Google Colab notebooks. We first demonstrate the applicability of our predictors by examining the generalizability of sequence-ensemble relationships in IDRs. Then, we leverage the high-throughput nature of ALBATROSS to characterize the sequence-specific biophysical behavior of IDRs within and between proteomes.
Collapse
Affiliation(s)
- Jeffrey M Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Garrett M Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Daniel Griffith
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Ryan J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA.
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA.
| |
Collapse
|
24
|
KC S, Nguyen K, Nicholson V, Walgren A, Trent T, Gollub E, Romero S, Holehouse AS, Sukenik S, Boothby TC. Disordered proteins interact with the chemical environment to tune their protective function during drying. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.28.582506. [PMID: 38464187 PMCID: PMC10925285 DOI: 10.1101/2024.02.28.582506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
The conformational ensemble and function of intrinsically disordered proteins (IDPs) are sensitive to their solution environment. The inherent malleability of disordered proteins combined with the exposure of their residues accounts for this sensitivity. One context in which IDPs play important roles that is concomitant with massive changes to the intracellular environment is during desiccation (extreme drying). The ability of organisms to survive desiccation has long been linked to the accumulation of high levels of cosolutes such as trehalose or sucrose as well as the enrichment of IDPs, such as late embryogenesis abundant (LEA) proteins or cytoplasmic abundant heat soluble (CAHS) proteins. Despite knowing that IDPs play important roles and are co-enriched alongside endogenous, species-specific cosolutes during desiccation, little is known mechanistically about how IDP-cosolute interactions influence desiccation tolerance. Here, we test the notion that the protective function of desiccation-related IDPs is enhanced through conformational changes induced by endogenous cosolutes. We find that desiccation-related IDPs derived from four different organisms spanning two LEA protein families and the CAHS protein family, synergize best with endogenous cosolutes during drying to promote desiccation protection. Yet the structural parameters of protective IDPs do not correlate with synergy for either CAHS or LEA proteins. We further demonstrate that for CAHS, but not LEA proteins, synergy is related to self-assembly and the formation of a gel. Our results demonstrate that functional synergy between IDPs and endogenous cosolutes is a convergent desiccation protection strategy seen among different IDP families and organisms, yet, the mechanisms underlying this synergy differ between IDP families.
Collapse
Affiliation(s)
- Shraddha KC
- University of Wyoming, Department of Molecular Biology. Laramie, WY
| | - Kenny Nguyen
- University of Wyoming, Department of Molecular Biology. Laramie, WY
| | | | - Annie Walgren
- University of Wyoming, Department of Molecular Biology. Laramie, WY
| | - Tony Trent
- University of Wyoming, Department of Molecular Biology. Laramie, WY
| | - Edith Gollub
- Department of Chemistry and Biochemistry, University of California Merced, Merced, CA, USA
| | - Sofia Romero
- Department of Chemistry and Biochemistry, University of California Merced, Merced, CA, USA
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Shahar Sukenik
- Department of Chemistry and Biochemistry, University of California Merced, Merced, CA, USA
| | | |
Collapse
|
25
|
Holehouse AS, Kragelund BB. The molecular basis for cellular function of intrinsically disordered protein regions. Nat Rev Mol Cell Biol 2024; 25:187-211. [PMID: 37957331 DOI: 10.1038/s41580-023-00673-0] [Citation(s) in RCA: 42] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2023] [Indexed: 11/15/2023]
Abstract
Intrinsically disordered protein regions exist in a collection of dynamic interconverting conformations that lack a stable 3D structure. These regions are structurally heterogeneous, ubiquitous and found across all kingdoms of life. Despite the absence of a defined 3D structure, disordered regions are essential for cellular processes ranging from transcriptional control and cell signalling to subcellular organization. Through their conformational malleability and adaptability, disordered regions extend the repertoire of macromolecular interactions and are readily tunable by their structural and chemical context, making them ideal responders to regulatory cues. Recent work has led to major advances in understanding the link between protein sequence and conformational behaviour in disordered regions, yet the link between sequence and molecular function is less well defined. Here we consider the biochemical and biophysical foundations that underlie how and why disordered regions can engage in productive cellular functions, provide examples of emerging concepts and discuss how protein disorder contributes to intracellular information processing and regulation of cellular function.
Collapse
Affiliation(s)
- Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St Louis, MO, USA.
- Center for Biomolecular Condensates, Washington University in St Louis, St Louis, MO, USA.
| | - Birthe B Kragelund
- REPIN, Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
26
|
Gullulu O, Ozcelik E, Tuzlakoglu Ozturk M, Karagoz MS, Tazebay UH. A multi-faceted approach to unravel coding and non-coding gene fusions and target chimeric proteins in ataxia. J Biomol Struct Dyn 2024:1-21. [PMID: 38411012 DOI: 10.1080/07391102.2024.2321510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 02/15/2024] [Indexed: 02/28/2024]
Abstract
Ataxia represents a heterogeneous group of neurodegenerative disorders characterized by a loss of balance and coordination, often resulting from mutations in genes vital for cerebellar function and maintenance. Recent advances in genomics have identified gene fusion events as critical contributors to various cancers and neurodegenerative diseases. However, their role in ataxia pathogenesis remains largely unexplored. Our study Hdelved into this possibility by analyzing RNA sequencing data from 1443 diverse samples, including cell and mouse models, patient samples, and healthy controls. We identified 7067 novel gene fusions, potentially pivotal in disease onset. These fusions, notably in-frame, could produce chimeric proteins, disrupt gene regulation, or introduce new functions. We observed conservation of specific amino acids at fusion breakpoints and identified potential aggregate formations in fusion proteins, known to contribute to ataxia. Through AI-based protein structure prediction, we identified topological changes in three high-confidence fusion proteins-TEN1-ACOX1, PEX14-NMNAT1, and ITPR1-GRID2-which could potentially alter their functions. Subsequent virtual drug screening identified several molecules and peptides with high-affinity binding to fusion sites. Molecular dynamics simulations confirmed the stability of these protein-ligand complexes at fusion breakpoints. Additionally, we explored the role of non-coding RNA fusions as miRNA sponges. One such fusion, RP11-547P4-FLJ33910, showed strong interaction with hsa-miR-504-5p, potentially acting as its sponge. This interaction correlated with the upregulation of hsa-miR-504-5p target genes, some previously linked to ataxia. In conclusion, our study unveils new aspects of gene fusions in ataxia, suggesting their significant role in pathogenesis and opening avenues for targeted therapeutic interventions.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Omer Gullulu
- Department of Structural Biology, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Emrah Ozcelik
- Department of Molecular Biology and Genetics, Gebze Technical University, Gebze, Kocaeli, Turkey
- Central Research Laboratory (GTU-MAR), Gebze Technical University, Gebze, Kocaeli, Turkey
| | - Merve Tuzlakoglu Ozturk
- Department of Molecular Biology and Genetics, Gebze Technical University, Gebze, Kocaeli, Turkey
- Central Research Laboratory (GTU-MAR), Gebze Technical University, Gebze, Kocaeli, Turkey
| | - Mustafa Safa Karagoz
- Institut für Mikrobiologie, Technische Universität Braunschweig, Braunschweig, Germany
- Biochemistry and Biophysics Center, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Uygar Halis Tazebay
- Department of Molecular Biology and Genetics, Gebze Technical University, Gebze, Kocaeli, Turkey
- Central Research Laboratory (GTU-MAR), Gebze Technical University, Gebze, Kocaeli, Turkey
| |
Collapse
|
27
|
Zavrtanik U, Medved T, Purič S, Vranken W, Lah J, Hadži S. Leucine Motifs Stabilize Residual Helical Structure in Disordered Proteins. J Mol Biol 2024; 436:168444. [PMID: 38218366 DOI: 10.1016/j.jmb.2024.168444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 12/31/2023] [Accepted: 01/09/2024] [Indexed: 01/15/2024]
Abstract
Many examples are known of regions of intrinsically disordered proteins that fold into α-helices upon binding to their targets. These helical binding motifs (HBMs) can be partially helical also in the unbound state, and this so-called residual structure can affect binding affinity and kinetics. To investigate the underlying mechanisms governing the formation of residual helical structure, we assembled a dataset of experimental helix contents of 65 peptides containing HBM that fold-upon-binding. The average residual helicity is 17% and increases to 60% upon target binding. The helix contents of residual and target-bound structures do not correlate, however the relative location of helix elements in both states shows a strong overlap. Compared to the general disordered regions, HBMs are enriched in amino acids with high helix preference and these residues are typically involved in target binding, explaining the overlap in helix positions. In particular, we find that leucine residues and leucine motifs in HBMs are the major contributors to helix stabilization and target-binding. For the two model peptides, we show that substitution of leucine motifs to other hydrophobic residues (valine or isoleucine) leads to reduction of residual helicity, supporting the role of leucine as helix stabilizer. From the three hydrophobic residues only leucine can efficiently stabilize residual helical structure. We suggest that the high occurrence of leucine motifs and a general preference for leucine at binding interfaces in HBMs can be explained by its unique ability to stabilize helical elements.
Collapse
Affiliation(s)
- Uroš Zavrtanik
- Department of Physical Chemistry, Faculty of Chemistry and Chemical Technology, University of Ljubljana, 1000 Ljubljana, Slovenia
| | - Tadej Medved
- Department of Physical Chemistry, Faculty of Chemistry and Chemical Technology, University of Ljubljana, 1000 Ljubljana, Slovenia
| | - Samo Purič
- Graduate Study Program, Faculty of Chemistry and Chemical Technology, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Wim Vranken
- Artificial Intelligence Laboratory, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Interuniversity Institute of Bioinformatics in Brussels, ULB/VUB, Triomflaan, 1050 Brussels, Belgium; Structural Biology Brussels, Vrije Universiteit Brussel, Brussels 1050, Belgium; VIB Structural Biology Research Centre, Brussels 1050, Belgium
| | - Jurij Lah
- Department of Physical Chemistry, Faculty of Chemistry and Chemical Technology, University of Ljubljana, 1000 Ljubljana, Slovenia
| | - San Hadži
- Department of Physical Chemistry, Faculty of Chemistry and Chemical Technology, University of Ljubljana, 1000 Ljubljana, Slovenia.
| |
Collapse
|
28
|
Rendón-Luna DF, Arroyo-Mosso IA, De Luna-Valenciano H, Campos F, Segovia L, Saab-Rincón G, Cuevas-Velazquez CL, Reyes JL, Covarrubias AA. Alternative conformations of a group 4 Late Embryogenesis Abundant protein associated to its in vitro protective activity. Sci Rep 2024; 14:2770. [PMID: 38307936 PMCID: PMC10837141 DOI: 10.1038/s41598-024-53295-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 01/30/2024] [Indexed: 02/04/2024] Open
Abstract
Late Embryogenesis Abundant (LEA) proteins are a group of intrinsically disordered proteins implicated in plant responses to water deficit. In vitro studies revealed that LEA proteins protect reporter enzymes from inactivation during low water availability. Group 4 LEA proteins constitute a conserved protein family, displaying in vitro protective capabilities. Under water deficiency or macromolecular crowding, the N-terminal of these proteins adopts an alpha-helix conformation. This region has been identified as responsible for the protein in vitro protective activity. This study investigates whether the attainment of alpha-helix conformation and/or particular amino acid residues are required for the in vitro protective activity. The LEA4-5 protein from Arabidopsis thaliana was used to generate mutant proteins. The mutations altered conserved residues, deleted specific conserved regions, or introduced prolines to hinder alpha-helix formation. The results indicate that conserved residues are not essential for LEA4-5 protective function. Interestingly, the C-terminal region was found to contribute to this function. Moreover, alpha-helix conformation is necessary for the protective activity only when the C-terminal region is deleted. Overall, LEA4-5 shows the ability to adopt alternative functional conformations under the tested conditions. These findings shed light on the in vitro mechanisms by which LEA proteins protect against water deficit stress.
Collapse
Affiliation(s)
- David F Rendón-Luna
- Departamento de Biología Molecular de Plantas, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
| | - Inti A Arroyo-Mosso
- Departamento de Biología Molecular de Plantas, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
| | - Haydee De Luna-Valenciano
- Departamento de Biología Molecular de Plantas, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
- Programa de Biología Sintética, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Av. Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
| | - Francisco Campos
- Departamento de Biología Molecular de Plantas, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
| | - Lorenzo Segovia
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
| | - Gloria Saab-Rincón
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
| | - Cesar L Cuevas-Velazquez
- Departamento de Bioquímica, Facultad de Química, Universidad Nacional Autónoma de México, Ciudad Universitaria, 04510, Ciudad de México, México
| | - José Luis Reyes
- Departamento de Biología Molecular de Plantas, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México
| | - Alejandra A Covarrubias
- Departamento de Biología Molecular de Plantas, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Avenida Universidad 2001, Colonia Chamilpa, 62210, Cuernavaca, Morelos, México.
| |
Collapse
|
29
|
Biswas S, Gollub E, Yu F, Ginell G, Holehouse A, Sukenik S, Boothby TC. Helicity of a tardigrade disordered protein contributes to its protective function during desiccation. Protein Sci 2024; 33:e4872. [PMID: 38114424 PMCID: PMC10804681 DOI: 10.1002/pro.4872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/30/2023] [Accepted: 12/12/2023] [Indexed: 12/21/2023]
Abstract
To survive extreme drying (anhydrobiosis), many organisms, spanning every kingdom of life, accumulate intrinsically disordered proteins (IDPs). For decades, the ability of anhydrobiosis-related IDPs to form transient amphipathic helices has been suggested to be important for promoting desiccation tolerance. However, evidence empirically supporting the necessity and/or sufficiency of helicity in mediating anhydrobiosis is lacking. Here, we demonstrate that the linker region of CAHS D, a desiccation-related IDP from the tardigrade Hypsibius exemplaris, that contains significant helical structure, is the protective portion of this protein. Perturbing the sequence composition and grammar of the linker region of CAHS D, through the insertion of helix-breaking prolines, modulating the identity of charged residues, or replacement of hydrophobic amino acids with serine or glycine residues results in variants with different degrees of helical structure. Importantly, correlation of protective capacity and helical content in variants generated through different helix perturbing modalities does not show as strong a trend, suggesting that while helicity is important, it is not the only property that makes a protein protective during desiccation. These results provide direct evidence for the decades-old theory that helicity of desiccation-related IDPs is linked to their anhydrobiotic capacity.
Collapse
Affiliation(s)
- Sourav Biswas
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| | - Edith Gollub
- Department of Chemistry and BiochemistryUniversity of California, MercedMercedCaliforniaUSA
- Quantitative Systems Biology ProgramUniversity of California MercedMercedCaliforniaUSA
| | - Feng Yu
- Department of Chemistry and BiochemistryUniversity of California, MercedMercedCaliforniaUSA
- Quantitative Systems Biology ProgramUniversity of California MercedMercedCaliforniaUSA
| | - Garrett Ginell
- Department of Biochemistry and Molecular BiophysicsWashington University School of MedicineSt. LouisMissouriUSA
- Center for Biomolecular CondensatesWashington University in St. LouisSt. LouisMissouriUSA
| | - Alex Holehouse
- Department of Biochemistry and Molecular BiophysicsWashington University School of MedicineSt. LouisMissouriUSA
- Center for Biomolecular CondensatesWashington University in St. LouisSt. LouisMissouriUSA
| | - Shahar Sukenik
- Department of Chemistry and BiochemistryUniversity of California, MercedMercedCaliforniaUSA
- Quantitative Systems Biology ProgramUniversity of California MercedMercedCaliforniaUSA
| | - Thomas C. Boothby
- Department of Molecular BiologyUniversity of WyomingLaramieWyomingUSA
| |
Collapse
|
30
|
Lin Z, Li D, Zheng J, Yao C, Liu D, Zhang H, Feng H, Chen C, Li P, Zhang Y, Jiang B, Hu Z, Zhao Y, Shi F, Cao D, Rodriguez-Wallberg KA, Li Z, Yeung WSB, Chow LT, Wang H, Liu K. The male pachynema-specific protein MAPS drives phase separation in vitro and regulates sex body formation and chromatin behaviors in vivo. Cell Rep 2024; 43:113651. [PMID: 38175751 DOI: 10.1016/j.celrep.2023.113651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 08/12/2023] [Accepted: 12/19/2023] [Indexed: 01/06/2024] Open
Abstract
Dynamic chromosome remodeling and nuclear compartmentalization take place during mammalian meiotic prophase I. We report here that the crucial roles of male pachynema-specific protein (MAPS) in pachynema progression might be mediated by its liquid-liquid phase separation in vitro and in cellulo. MAPS forms distinguishable liquid phases, and deletion or mutations of its N-terminal amino acids (aa) 2-9 disrupt its secondary structure and charge properties, impeding phase separation. Maps-/- pachytene spermatocytes exhibit defects in nucleus compartmentalization, including defects in forming sex bodies, altered nucleosome composition, and disordered chromatin accessibility. MapsΔ2-9/Δ2-9 male mice expressing MAPS protein lacking aa 2-9 phenocopy Maps-/- mice. Moreover, a frameshift mutation in C3orf62, the human counterpart of Maps, is correlated with nonobstructive azoospermia in a patient exhibiting pachynema arrest in spermatocyte development. Hence, the phase separation property of MAPS seems essential for pachynema progression in mouse and human spermatocytes.
Collapse
Affiliation(s)
- Zexiong Lin
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Dongliang Li
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Jiahuan Zheng
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Chencheng Yao
- Department of Andrology, Center for Men's Health, Department of ART, Institute of Urology, Urologic Medical Center, Shanghai Key Laboratory of Reproductive Medicine, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China
| | - Dongteng Liu
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Hao Zhang
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Haiwei Feng
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Chunxu Chen
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, AL, USA; Department of Internal Medicine, Division of Hematology, Oncology and Palliative Care, Massey Cancer Institute, Virginia Commonwealth University, Richmond, VA, USA
| | - Peng Li
- Department of Andrology, Center for Men's Health, Department of ART, Institute of Urology, Urologic Medical Center, Shanghai Key Laboratory of Reproductive Medicine, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China
| | - Yuxiang Zhang
- Department of Andrology, Center for Men's Health, Department of ART, Institute of Urology, Urologic Medical Center, Shanghai Key Laboratory of Reproductive Medicine, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China
| | - Binjie Jiang
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Zhe Hu
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Yu Zhao
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Fu Shi
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Dandan Cao
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | | | - Zheng Li
- Department of Andrology, Center for Men's Health, Department of ART, Institute of Urology, Urologic Medical Center, Shanghai Key Laboratory of Reproductive Medicine, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China
| | - William S B Yeung
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China
| | - Louise T Chow
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, AL, USA.
| | - Hengbin Wang
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham, Birmingham, AL, USA; Department of Internal Medicine, Division of Hematology, Oncology and Palliative Care, Massey Cancer Institute, Virginia Commonwealth University, Richmond, VA, USA.
| | - Kui Liu
- Department of Obstetrics and Gynecology, Li Ka Shing Faculty of Medicine; Shenzhen Key Laboratory of Fertility Regulation, Center of Assisted Reproduction and Embryology, The University of Hong Kong-Shenzhen Hospital, The University of Hong Kong, Hong Kong, China.
| |
Collapse
|
31
|
Brumbaugh-Reed EH, Aoki K, Toettcher JE. Rapid and reversible dissolution of biomolecular condensates using light-controlled recruitment of a solubility tag. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.16.575860. [PMID: 38293146 PMCID: PMC10827175 DOI: 10.1101/2024.01.16.575860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Biomolecular condensates are broadly implicated in both normal cellular regulation and disease. Consequently, several chemical biology and optogenetic approaches have been developed to induce phase separation of a protein of interest. However, few tools are available to perform the converse function-dissolving a condensate of interest on demand. Such a tool would aid in testing whether the condensate plays specific functional roles, a major question in cell biology and drug development. Here we report an optogenetic approach to selectively dissolve a condensate of interest in a reversible and spatially controlled manner. We show that light-gated recruitment of maltose-binding protein (MBP), a commonly used solubilizing domain in protein purification, results in rapid and controlled dissolution of condensates formed from proteins of interest. Our optogenetic MBP-based dissolution strategy (OptoMBP) is rapid, reversible, and can be spatially controlled with subcellular precision. We also provide a proof-of-principle application of OptoMBP, showing that disrupting condensation of the oncogenic fusion protein FUS-CHOP results in reversion of FUS-CHOP driven transcriptional changes. We envision that the OptoMBP system could be broadly useful for disrupting constitutive protein condensates to probe their biological functions.
Collapse
Affiliation(s)
- Ellen H Brumbaugh-Reed
- Department of Molecular Biology, Princeton University, Princeton NJ 08544
- Omenn-Darling Bioengineering Institute, Princeton University, Princeton NJ 08544
- International Research Collaboration Center (IRCC), National Institutes of Natural Sciences, Tokyo 105-0001, Japan
| | - Kazuhiro Aoki
- International Research Collaboration Center (IRCC), National Institutes of Natural Sciences, Tokyo 105-0001, Japan
- Quantitative Biology Research Group, Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of Natural Sciences, Okazaki, Aichi 444-8787, Japan
- Division of Quantitative Biology, National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Aichi 444-8787, Japan
- Department of Basic Biology, School of Life Science, SOKENDAI (The Graduate University for Advanced Studies), Okazaki, Aichi 444-8787, Japan
- Laboratory of Cell Cycle Regulation, Graduate School of Biostudies, Kyoto University, Kyoto, Kyoto 606-8315, Japan
| | - Jared E Toettcher
- Department of Molecular Biology, Princeton University, Princeton NJ 08544
- Omenn-Darling Bioengineering Institute, Princeton University, Princeton NJ 08544
| |
Collapse
|
32
|
Gao M, Huang Y. Molecular dynamics simulations revealed topological frustration in the binding-wrapping process of eIF4G with eIF4E. Phys Chem Chem Phys 2024; 26:2073-2081. [PMID: 38131207 DOI: 10.1039/d3cp04899c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Interaction between the cap-binding protein eIF4E and the scaffolding protein eIF4G is essential for the cap-dependent translation initiation in eukaryotes. In the Saccharomyces cerevisiae eIF4G/eIF4E complex, the intrinsically disordered eIF4E-binding domain of eIF4G folds into a bracelet-like structure upon binding to eIF4E. Aiming to unveil the molecular mechanism underlying the binding-wrapping process of eIF4G with eIF4E, we performed extensive coarse-grained molecular dynamics simulations and transition path analysis in this work. The major transition pathway revealed from our simulations showed that docking of the eIF4E-binding motif of eIF4G to the folded core of eIF4E initiates the binding process and then the disordered eIF4G wraps around the N-terminal tail of eIF4E. Additionally, we identified a minor transition pathway which indicates the involvement of topological frustration in the binding process. By manipulating the interaction strength of the wrapping contacts and the latching contacts, we further dissected factors affecting the formation of topological frustration and the binding transition kinetics. Our findings provide new clues for experimental studies on the binding mechanism of eIF4G to eIF4E in the future and exemplify the involvement of topological frustration in the binding process of intrinsically disordered proteins.
Collapse
Affiliation(s)
- Meng Gao
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan 430068, China.
- Cooperative Innovation Center of Industrial Fermentation (Ministry of Education & Hubei Province), Hubei University of Technology, Wuhan 430068, China
- Key Laboratory of Industrial Fermentation (Ministry of Education), Hubei University of Technology, Wuhan 430068, China
| | - Yongqi Huang
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan 430068, China.
- Cooperative Innovation Center of Industrial Fermentation (Ministry of Education & Hubei Province), Hubei University of Technology, Wuhan 430068, China
- Key Laboratory of Industrial Fermentation (Ministry of Education), Hubei University of Technology, Wuhan 430068, China
| |
Collapse
|
33
|
Fritze JS, Stiehler FF, Wolfrum U. Pathogenic Variants in USH1G/SANS Alter Protein Interaction with Pre-RNA Processing Factors PRPF6 and PRPF31 of the Spliceosome. Int J Mol Sci 2023; 24:17608. [PMID: 38139438 PMCID: PMC10744108 DOI: 10.3390/ijms242417608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/08/2023] [Accepted: 12/12/2023] [Indexed: 12/24/2023] Open
Abstract
Pre-mRNA splicing is an essential process orchestrated by the spliceosome, a dynamic complex assembled stepwise on pre-mRNA. We have previously identified that USH1G protein SANS regulates pre-mRNA splicing by mediating the intranuclear transfer of the spliceosomal U4/U6.U5 tri-snRNP complex. During this process, SANS interacts with the U4/U6 and U5 snRNP-specific proteins PRPF31 and PRPF6 and regulates splicing, which is disturbed by variants of USH1G/SANS causative for human Usher syndrome (USH), the most common form of hereditary deaf-blindness. Here, we aim to gain further insights into the molecular interaction of the splicing molecules PRPF31 and PRPF6 to the CENTn domain of SANS using fluorescence resonance energy transfer assays in cells and in silico deep learning-based protein structure predictions. This demonstrates that SANS directly binds via two distinct conserved regions of its CENTn to the two PRPFs. In addition, we provide evidence that these interactions occur sequentially and a conformational change of an intrinsically disordered region to a short α-helix of SANS CENTn2 is triggered by the binding of PRPF6. Furthermore, we find that pathogenic variants of USH1G/SANS perturb the binding of SANS to both PRPFs, implying a significance for the USH1G pathophysiology.
Collapse
Affiliation(s)
| | | | - Uwe Wolfrum
- Institute of Molecular Physiology, Johannes Gutenberg University Mainz, 55128 Mainz, Germany; (J.S.F.)
| |
Collapse
|
34
|
Manriquez-Sandoval E, Brewer J, Lule G, Lopez S, Fried SD. FLiPPR: A Processor for Limited Proteolysis (LiP) Mass Spectrometry Datasets Built on FragPipe. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.04.569947. [PMID: 38106106 PMCID: PMC10723326 DOI: 10.1101/2023.12.04.569947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Here, we present FLiPPR, or FragPipe LiP (limited proteolysis) Processor, a tool that facilitates the analysis of data from limited proteolysis mass spectrometry (LiP-MS) experiments following primary search and quantification in FragPipe. LiP-MS has emerged as a method that can provide proteome-wide information on protein structure and has been applied to a range of biological and biophysical questions. Although LiP-MS can be carried out with standard laboratory reagents and mass spectrometers, analyzing the data can be slow and poses unique challenges compared to typical quantitative proteomics workflows. To address this, we leverage the fast, sensitive, and accurate search and label-free quantification algorithms in FragPipe and then process its output in FLiPPR. FLiPPR formalizes a specific data imputation heuristic that carefully uses missing data in LiP-MS experiments to report on the most significant structural changes. Moreover, FLiPPR introduces a new data merging scheme (from ions to cut-sites) and a protein-centric multiple hypothesis correction scheme, collectively enabling processed LiP-MS datasets to be more robust and less redundant. These improvements substantially strengthen statistical trends when previously published data are reanalyzed with the FragPipe/FLiPPR workflow. As a final feature, FLiPPR facilitates the collection of structural metadata to identify correlations between experiments and structural features. We hope that FLiPPR will lower the barrier for more users to adopt LiP-MS, standardize statistical procedures for LiP-MS data analysis, and systematize output to facilitate eventual larger-scale integration of LiP-MS data.
Collapse
Affiliation(s)
- Edgar Manriquez-Sandoval
- Department of Chemistry, Johns Hopkins University, Baltimore, MD 21218, USA
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Joy Brewer
- Department of Chemistry and Biochemistry, Old Dominion University, Norfolk, VA, 23529, USA
| | - Gabriela Lule
- Department of Chemistry, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Samanta Lopez
- Department of Chemistry, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Stephen D. Fried
- Department of Chemistry, Johns Hopkins University, Baltimore, MD 21218, USA
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
35
|
McConnell BS, Parker MW. Protein intrinsically disordered regions have a non-random, modular architecture. Bioinformatics 2023; 39:btad732. [PMID: 38039154 PMCID: PMC10719218 DOI: 10.1093/bioinformatics/btad732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 11/03/2023] [Accepted: 11/30/2023] [Indexed: 12/03/2023] Open
Abstract
MOTIVATION Protein sequences can be broadly categorized into two classes: those which adopt stable secondary structure and fold into a domain (i.e. globular proteins), and those that do not. The sequences belonging to this latter class are conformationally heterogeneous and are described as being intrinsically disordered. Decades of investigation into the structure and function of globular proteins has resulted in a suite of computational tools that enable their sub-classification by domain type, an approach that has revolutionized how we understand and predict protein functionality. Conversely, it is unknown if sequences of disordered protein regions are subject to broadly generalizable organizational principles that would enable their sub-classification. RESULTS Here, we report the development of a statistical approach that quantifies linear variance in amino acid composition across a sequence. With multiple examples, we provide evidence that intrinsically disordered regions are organized into statistically non-random modules of unique compositional bias. Modularity is observed for both low and high-complexity sequences and, in some cases, we find that modules are organized in repetitive patterns. These data demonstrate that disordered sequences are non-randomly organized into modular architectures and motivate future experiments to comprehensively classify module types and to determine the degree to which modules constitute functionally separable units analogous to the domains of globular proteins. AVAILABILITY AND IMPLEMENTATION The source code, documentation, and data to reproduce all figures are freely available at https://github.com/MWPlabUTSW/Chi-Score-Analysis.git. The analysis is also available as a Google Colab Notebook (https://colab.research.google.com/github/MWPlabUTSW/Chi-Score-Analysis/blob/main/ChiScore_Analysis.ipynb).
Collapse
Affiliation(s)
- Brendan S McConnell
- Department of Biophysics, , University of Texas Southwestern Medical Center, Dallas, TX 75235, United States
| | - Matthew W Parker
- Department of Biophysics, , University of Texas Southwestern Medical Center, Dallas, TX 75235, United States
| |
Collapse
|
36
|
Whitehead JD, Decool H, Leyrat C, Carrique L, Fix J, Eléouët JF, Galloux M, Renner M. Structure of the N-RNA/P interface indicates mode of L/P recruitment to the nucleocapsid of human metapneumovirus. Nat Commun 2023; 14:7627. [PMID: 37993464 PMCID: PMC10665349 DOI: 10.1038/s41467-023-43434-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 11/08/2023] [Indexed: 11/24/2023] Open
Abstract
Human metapneumovirus (HMPV) is a major cause of respiratory illness in young children. The HMPV polymerase (L) binds an obligate cofactor, the phosphoprotein (P). During replication and transcription, the L/P complex traverses the viral RNA genome, which is encapsidated within nucleoproteins (N). An essential interaction between N and a C-terminal region of P tethers the L/P polymerase to the template. This N-P interaction is also involved in the formation of cytoplasmic viral factories in infected cells, called inclusion bodies. To define how the polymerase component P recognizes N-encapsidated RNA (N-RNA) we employed cryogenic electron microscopy (cryo-EM) and molecular dynamics simulations, coupled to activity assays and imaging of inclusion bodies in cells. We report a 2.9 Å resolution structure of a triple-complex between multimeric N, bound to both RNA and the C-terminal region of P. Furthermore, we also present cryo-EM structures of assembled N in different oligomeric states, highlighting the plasticity of N. Combined with our functional assays, these structural data delineate in molecular detail how P attaches to N-RNA whilst retaining substantial conformational dynamics. Moreover, the N-RNA-P triple complex structure provides a molecular blueprint for the design of therapeutics to potentially disrupt the attachment of L/P to its template.
Collapse
Affiliation(s)
- Jack D Whitehead
- Division of Structural Biology, The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Hortense Decool
- Université Paris-Saclay, INRAE, UVSQ, VIM, 78350, Jouy-en-Josas, France
| | - Cédric Leyrat
- Institut de Génomique Fonctionnelle, Université de Montpellier, CNRS, INSERM, Montpellier, France
| | - Loic Carrique
- Division of Structural Biology, The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Jenna Fix
- Université Paris-Saclay, INRAE, UVSQ, VIM, 78350, Jouy-en-Josas, France
| | | | - Marie Galloux
- Université Paris-Saclay, INRAE, UVSQ, VIM, 78350, Jouy-en-Josas, France.
| | - Max Renner
- Department of Chemistry, Umeå University, Umeå, Sweden.
- Umeå Centre for Microbial Research, Umeå University, Umeå, Sweden.
| |
Collapse
|
37
|
LaPeruta AJ, Micic J, Woolford Jr. JL. Additional principles that govern the release of pre-ribosomes from the nucleolus into the nucleoplasm in yeast. Nucleic Acids Res 2023; 51:10867-10883. [PMID: 35736211 PMCID: PMC10639060 DOI: 10.1093/nar/gkac430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Revised: 05/05/2022] [Accepted: 06/20/2022] [Indexed: 11/14/2022] Open
Abstract
During eukaryotic ribosome biogenesis, pre-ribosomes travel from the nucleolus, where assembly is initiated, to the nucleoplasm and then are exported to the cytoplasm, where assembly concludes. Although nuclear export of pre-ribosomes has been extensively investigated, the release of pre-ribosomes from the nucleolus is an understudied phenomenon. Initial data indicate that unfolded rRNA interacts in trans with nucleolar components and that, when rRNA folds due to ribosomal protein (RP) binding, the number of trans interactions drops below the threshold necessary for nucleolar retention. To validate and expand on this idea, we performed a bioinformatic analysis of the protein components of the Saccharomyces cerevisiae ribosome assembly pathway. We found that ribosome biogenesis factors (RiBi factors) contain significantly more predicted trans interacting regions than RPs. We also analyzed cryo-EM structures of ribosome assembly intermediates to determine how nucleolar pre-ribosomes differ from post-nucleolar pre-ribosomes, specifically the capacity of RPs, RiBi factors, and rRNA components to interact in trans. We observed a significant decrease in the theoretical trans-interacting capability of pre-ribosomes between nucleolar and post-nucleolar stages of assembly due to the release of RiBi factors from particles and the folding of rRNA. Here, we provide a mechanism for the release of pre-ribosomes from the nucleolus.
Collapse
Affiliation(s)
- Amber J LaPeruta
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Jelena Micic
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - John L Woolford Jr.
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
38
|
Kurgan L, Hu G, Wang K, Ghadermarzi S, Zhao B, Malhis N, Erdős G, Gsponer J, Uversky VN, Dosztányi Z. Tutorial: a guide for the selection of fast and accurate computational tools for the prediction of intrinsic disorder in proteins. Nat Protoc 2023; 18:3157-3172. [PMID: 37740110 DOI: 10.1038/s41596-023-00876-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 06/21/2023] [Indexed: 09/24/2023]
Abstract
Intrinsic disorder is instrumental for a wide range of protein functions, and its analysis, using computational predictions from primary structures, complements secondary and tertiary structure-based approaches. In this Tutorial, we provide an overview and comparison of 23 publicly available computational tools with complementary parameters useful for intrinsic disorder prediction, partly relying on results from the Critical Assessment of protein Intrinsic Disorder prediction experiment. We consider factors such as accuracy, runtime, availability and the need for functional insights. The selected tools are available as web servers and downloadable programs, offer state-of-the-art predictions and can be used in a high-throughput manner. We provide examples and instructions for the selected tools to illustrate practical aspects related to the submission, collection and interpretation of predictions, as well as the timing and their limitations. We highlight two predictors for intrinsically disordered proteins, flDPnn as accurate and fast and IUPred as very fast and moderately accurate, while suggesting ANCHOR2 and MoRFchibi as two of the best-performing predictors for intrinsically disordered region binding. We link these tools to additional resources, including databases of predictions and web servers that integrate multiple predictive methods. Altogether, this Tutorial provides a hands-on guide to comparatively evaluating multiple predictors, submitting and collecting their own predictions, and reading and interpreting results. It is suitable for experimentalists and computational biologists interested in accurately and conveniently identifying intrinsic disorder, facilitating the functional characterization of the rapidly growing collections of protein sequences.
Collapse
Affiliation(s)
- Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| | - Gang Hu
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Kui Wang
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Nawar Malhis
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Gábor Erdős
- MTA-ELTE Momentum Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary
| | - Jörg Gsponer
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada.
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- Byrd Alzheimer's Center and Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| | - Zsuzsanna Dosztányi
- MTA-ELTE Momentum Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary.
| |
Collapse
|
39
|
Alderson TR, Pritišanac I, Kolarić Đ, Moses AM, Forman-Kay JD. Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2. Proc Natl Acad Sci U S A 2023; 120:e2304302120. [PMID: 37878721 PMCID: PMC10622901 DOI: 10.1073/pnas.2304302120] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 08/30/2023] [Indexed: 10/27/2023] Open
Abstract
The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence scores that reflect low-confidence structural predictions. Here, we show that AlphaFold2 assigns confident structures to nearly 15% of human IDRs. By comparison to experimental NMR data for a subset of IDRs that are known to conditionally fold (i.e., upon binding or under other specific conditions), we find that AlphaFold2 often predicts the structure of the conditionally folded state. Based on databases of IDRs that are known to conditionally fold, we estimate that AlphaFold2 can identify conditionally folding IDRs at a precision as high as 88% at a 10% false positive rate, which is remarkable considering that conditionally folded IDR structures were minimally represented in its training data. We find that human disease mutations are nearly fivefold enriched in conditionally folded IDRs over IDRs in general and that up to 80% of IDRs in prokaryotes are predicted to conditionally fold, compared to less than 20% of eukaryotic IDRs. These results indicate that a large majority of IDRs in the proteomes of human and other eukaryotes function in the absence of conditional folding, but the regions that do acquire folds are more sensitive to mutations. We emphasize that the AlphaFold2 predictions do not reveal functionally relevant structural plasticity within IDRs and cannot offer realistic ensemble representations of conditionally folded IDRs.
Collapse
Affiliation(s)
- T. Reid Alderson
- Department of Biochemistry, University of Toronto, Toronto, ONM5S 1A8, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ONM5S 1A8, Canada
| | - Iva Pritišanac
- Department of Cell and Systems Biology, University of Toronto, Toronto, ONM5S 35G, Canada
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, ONM5G 0A4, Canada
- Department of Molecular Biology and Biochemistry, Gottfried Schatz Research Center for Cell Signaling, Metabolism and Aging, Medical University of Graz, Graz8010, Austria
| | - Đesika Kolarić
- Department of Molecular Biology and Biochemistry, Gottfried Schatz Research Center for Cell Signaling, Metabolism and Aging, Medical University of Graz, Graz8010, Austria
| | - Alan M. Moses
- Department of Cell and Systems Biology, University of Toronto, Toronto, ONM5S 35G, Canada
| | - Julie D. Forman-Kay
- Department of Biochemistry, University of Toronto, Toronto, ONM5S 1A8, Canada
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, ONM5G 0A4, Canada
| |
Collapse
|
40
|
Bradley D, Hogrebe A, Dandage R, Dubé AK, Leutert M, Dionne U, Chang A, Villén J, Landry CR. The fitness cost of spurious phosphorylation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.08.561337. [PMID: 37873463 PMCID: PMC10592693 DOI: 10.1101/2023.10.08.561337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The fidelity of signal transduction requires the binding of regulatory molecules to their cognate targets. However, the crowded cell interior risks off-target interactions between proteins that are functionally unrelated. How such off-target interactions impact fitness is not generally known, but quantifying this is required to understand the constraints faced by cell systems as they evolve. Here, we use the model organism S. cerevisiae to inducibly express tyrosine kinases. Because yeast lacks bona fide tyrosine kinases, most of the resulting tyrosine phosphorylation is spurious. This provides a suitable system to measure the impact of artificial protein interactions on fitness. We engineered 44 yeast strains each expressing a tyrosine kinase, and quantitatively analysed their phosphoproteomes. This analysis resulted in ~30,000 phosphosites mapping to ~3,500 proteins. Examination of the fitness costs in each strain revealed a strong correlation between the number of spurious pY sites and decreased growth. Moreover, the analysis of pY effects on protein structure and on protein function revealed over 1000 pY events that we predict to be deleterious. However, we also find that a large number of the spurious pY sites have a negligible effect on fitness, possibly because of their low stoichiometry. This result is consistent with our evolutionary analyses demonstrating a lack of phosphotyrosine counter-selection in species with bona fide tyrosine kinases. Taken together, our results suggest that, alongside the risk for toxicity, the cell can tolerate a large degree of non-functional crosstalk as interaction networks evolve.
Collapse
Affiliation(s)
- David Bradley
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexander Hogrebe
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Rohan Dandage
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexandre K Dubé
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Mario Leutert
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| | - Ugo Dionne
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexis Chang
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Judit Villén
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Christian R Landry
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| |
Collapse
|
41
|
Pandi B, Brenman S, Black A, Ng DCM, Lau E, Lam MPY. Tissue Usage Preference and Intrinsically Disordered Region Remodeling of Alternative Splicing Derived Proteoforms in the Heart. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.08.561375. [PMID: 37873130 PMCID: PMC10592692 DOI: 10.1101/2023.10.08.561375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
A computational analysis of mass spectrometry data was performed to uncover alternative splicing derived protein variants across chambers of the human heart. Evidence for 216 non-canonical isoforms was apparent in the atrium and the ventricle, including 52 isoforms not documented on SwissProt and recovered using an RNA sequencing derived database. Among non-canonical isoforms, 29 show signs of regulation based on statistically significant preferences in tissue usage, including a ventricular enriched protein isoform of tensin-1 (TNS1) and an atrium-enriched PDZ and LIM Domain 3 (PDLIM3) isoform 2 (PDLIM3-2/ALP-H). Examined variant regions that differ between alternative and canonical isoforms are highly enriched in intrinsically disordered regions, and over two-thirds of such regions are predicted to function in protein binding and/or RNA binding. The analysis here lends further credence to the notion that alternative splicing diversifies the proteome by rewiring intrinsically disordered regions, which are increasingly recognized to play important roles in the generation of biological function from protein sequences.
Collapse
Affiliation(s)
- Boomathi Pandi
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Stella Brenman
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Alexander Black
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Dominic C. M. Ng
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Edward Lau
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Maggie P. Y. Lam
- Department of Medicine/Division of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
- Consortium for Fibrosis Research and Translation (CFReT), University of Colorado School of Medicine, Aurora, CO 80045, USA
| |
Collapse
|
42
|
Khandwala CB, Sarkar P, Schmidt HB, Ma M, Kinnebrew M, Pusapati GV, Patel BB, Tillo D, Lebensohn AM, Rohatgi R. Direct ionic stress sensing and mitigation by the transcription factor NFAT5. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.23.559074. [PMID: 37886503 PMCID: PMC10602047 DOI: 10.1101/2023.09.23.559074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
Homeostatic control of intracellular ionic strength is essential for protein, organelle and genome function, yet mechanisms that sense and enable adaptation to ionic stress remain poorly understood in animals. We find that the transcription factor NFAT5 directly senses solution ionic strength using a C-terminal intrinsically disordered region. Both in intact cells and in a purified system, NFAT5 forms dynamic, reversible biomolecular condensates in response to increasing ionic strength. This self-associative property, conserved from insects to mammals, allows NFAT5 to accumulate in the nucleus and activate genes that restore cellular ion content. Mutations that reduce condensation or those that promote aggregation both reduce NFAT5 activity, highlighting the importance of optimally tuned associative interactions. Remarkably, human NFAT5 alone is sufficient to reconstitute a mammalian transcriptional response to ionic or hypertonic stress in yeast. Thus NFAT5 is both the sensor and effector of a cell-autonomous ionic stress response pathway in animal cells.
Collapse
Affiliation(s)
- Chandni B. Khandwala
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Parijat Sarkar
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - H. Broder Schmidt
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Mengxiao Ma
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Maia Kinnebrew
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Ganesh V. Pusapati
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Bhaven B. Patel
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Desiree Tillo
- Center for Cancer Research Genomics Core, National Cancer Institute, National Institutes of Health, NIH, Building 37, RM 2056B, Bethesda, MD, 20892, USA
| | - Andres M. Lebensohn
- Laboratory of Cellular and Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, NIH, Building 37, RM 2056B, Bethesda, MD, 20892, USA
| | - Rajat Rohatgi
- Departments of Biochemistry and Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| |
Collapse
|
43
|
Hu H, Ho D, Tan DS, MacCarthy C, Yu CH, Weng M, Schöler H, Jauch R. Evaluation of the determinants for improved pluripotency induction and maintenance by engineered SOX17. Nucleic Acids Res 2023; 51:8934-8956. [PMID: 37607832 PMCID: PMC10516664 DOI: 10.1093/nar/gkad597] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 06/30/2023] [Accepted: 07/06/2023] [Indexed: 08/24/2023] Open
Abstract
An engineered SOX17 variant with point mutations within its DNA binding domain termed SOX17FNV is a more potent pluripotency inducer than SOX2, yet the underlying mechanism remains unclear. Although wild-type SOX17 was incapable of inducing pluripotency, SOX17FNV outperformed SOX2 in mouse and human pluripotency reprogramming. In embryonic stem cells, SOX17FNV could replace SOX2 to maintain pluripotency despite considerable sequence differences and upregulated genes expressed in cleavage-stage embryos. Mechanistically, SOX17FNV co-bound OCT4 more cooperatively than SOX2 in the context of the canonical SoxOct DNA element. SOX2, SOX17, and SOX17FNV were all able to bind nucleosome core particles in vitro, which is a prerequisite for pioneer transcription factors. Experiments using purified proteins and in cellular contexts showed that SOX17 variants phase-separated more efficiently than SOX2, suggesting an enhanced ability to self-organise. Systematic deletion analyses showed that the N-terminus of SOX17FNV was dispensable for its reprogramming activity. However, the C-terminus encodes essential domains indicating multivalent interactions that drive transactivation and reprogramming. We defined a minimal SOX17FNV (miniSOX) that can support reprogramming with high activity, reducing the payload of reprogramming cassettes. This study uncovers the mechanisms behind SOX17FNV-induced pluripotency and establishes engineered SOX factors as powerful cell engineering tools.
Collapse
Affiliation(s)
- Haoqing Hu
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Derek Hoi Hang Ho
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
- Centre for Translational Stem Cell Biology, Hong Kong
| | - Daisylyn Senna Tan
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | | | - Cheng-han Yu
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Mingxi Weng
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
- Centre for Translational Stem Cell Biology, Hong Kong
| | | | - Ralf Jauch
- School of Biomedical Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
- Centre for Translational Stem Cell Biology, Hong Kong
| |
Collapse
|
44
|
Hummel NFC, Markel K, Stefani J, Staller MV, Shih PM. Systematic identification of transcriptional activator domains from non-transcription factor proteins in plants and yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.12.557247. [PMID: 37745555 PMCID: PMC10515812 DOI: 10.1101/2023.09.12.557247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Transcription factors promote gene expression via trans-regulatory activation domains. Although whole genome scale screens in model organisms (e.g. human, yeast, fly) have helped identify activation domains from transcription factors, such screens have been less extensively used to explore the occurrence of activation domains in non-transcription factor proteins, such as transcriptional coactivators, chromatin regulators and some cytosolic proteins, leaving a blind spot on what role activation domains in these proteins could play in regulating transcription. We utilized the activation domain predictor PADDLE to mine the entire proteomes of two model eukaryotes, Arabidopsis thaliana and Saccharomyces cerevisiae ( 1 ). We characterized 18,000 fragments covering predicted activation domains from >800 non-transcription factor genes in both species, and experimentally validated that 89% of proteins contained fragments capable of activating transcription in yeast. Peptides with similar sequence composition show a broad range of activities, which is explained by the arrangement of key amino acids. We also annotated hundreds of nuclear proteins with activation domains as putative coactivators; many of which have never been ascribed any function in plants. Furthermore, our library contains >250 non-nuclear proteins containing peptides with activation domain function across both eukaryotic lineages, suggesting that there are unknown biological roles of these peptides beyond transcription. Finally, we identify and validate short, 'universal' eukaryotic activation domains that activate transcription in both yeast and plants with comparable or stronger performance to state-of-the-art activation domains. Overall, our dual host screen provides a blueprint on how to systematically discover novel genetic parts for synthetic biology that function across a wide diversity of eukaryotes. Significance Statement Activation domains promote transcription and play a critical role in regulating gene expression. Although the mapping of activation domains from transcription factors has been carried out in previous genome-wide screens, their occurrence in non-transcription factors has been less explored. We utilize an activation domain predictor to mine the entire proteomes of Arabidopsis thaliana and Saccharomyces cerevisiae for new activation domains on non-transcription factor proteins. We validate peptides derived from >750 non-transcription factor proteins capable of activating transcription, discovering many potentially new coactivators in plants. Importantly, we identify novel genetic parts that can function across both species, representing unique synthetic biology tools.
Collapse
|
45
|
Velasco-Carneros L, Cuéllar J, Dublang L, Santiago C, Maréchal JD, Martín-Benito J, Maestro M, Fernández-Higuero JÁ, Orozco N, Moro F, Valpuesta JM, Muga A. The self-association equilibrium of DNAJA2 regulates its interaction with unfolded substrate proteins and with Hsc70. Nat Commun 2023; 14:5436. [PMID: 37670029 PMCID: PMC10480186 DOI: 10.1038/s41467-023-41150-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Accepted: 08/24/2023] [Indexed: 09/07/2023] Open
Abstract
J-domain proteins tune the specificity of Hsp70s, engaging them in precise functions. Despite their essential role, the structure and function of many J-domain proteins remain largely unknown. We explore human DNAJA2, finding that it reversibly forms highly-ordered, tubular structures that can be dissociated by Hsc70, the constitutively expressed Hsp70 isoform. Cryoelectron microscopy and mutational studies reveal that different domains are involved in self-association. Oligomer dissociation into dimers potentiates its interaction with unfolded client proteins. The J-domains are accessible to Hsc70 within the tubular structure. They allow binding of closely spaced Hsc70 molecules that could be transferred to the unfolded substrate for its cooperative remodelling, explaining the efficient recovery of DNAJA2-bound clients. The disordered C-terminal domain, comprising the last 52 residues, regulates its holding activity and productive interaction with Hsc70. These in vitro findings suggest that the association equilibrium of DNAJA2 could regulate its interaction with client proteins and Hsc70.
Collapse
Affiliation(s)
- Lorea Velasco-Carneros
- Biofisika Institute (CSIC, UPV/EHU), University of the Basque Country, 48940, Leioa, Spain
- Department of Biochemistry and Molecular Biology, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), 48940, Leioa, Spain
| | - Jorge Cuéllar
- Department of Macromolecular Structure, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain
| | - Leire Dublang
- Biofisika Institute (CSIC, UPV/EHU), University of the Basque Country, 48940, Leioa, Spain
- Department of Biochemistry and Molecular Biology, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), 48940, Leioa, Spain
| | - César Santiago
- Department of Macromolecular Structure, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain
| | - Jean-Didier Maréchal
- Insilichem, Departament de Química, Universitat Autònoma de Barcelona, (UAB), 08193, Bellaterra (Barcelona), Spain
| | - Jaime Martín-Benito
- Department of Macromolecular Structure, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain
| | - Moisés Maestro
- Department of Macromolecular Structure, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain
| | - José Ángel Fernández-Higuero
- Biofisika Institute (CSIC, UPV/EHU), University of the Basque Country, 48940, Leioa, Spain
- Department of Biochemistry and Molecular Biology, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), 48940, Leioa, Spain
| | - Natalia Orozco
- Biofisika Institute (CSIC, UPV/EHU), University of the Basque Country, 48940, Leioa, Spain
| | - Fernando Moro
- Biofisika Institute (CSIC, UPV/EHU), University of the Basque Country, 48940, Leioa, Spain
- Department of Biochemistry and Molecular Biology, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), 48940, Leioa, Spain
| | - José María Valpuesta
- Department of Macromolecular Structure, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain.
| | - Arturo Muga
- Biofisika Institute (CSIC, UPV/EHU), University of the Basque Country, 48940, Leioa, Spain.
- Department of Biochemistry and Molecular Biology, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), 48940, Leioa, Spain.
| |
Collapse
|
46
|
Wilson C, Lewis KA, Fitzkee NC, Hough LE, Whitten ST. ParSe 2.0: A web tool to identify drivers of protein phase separation at the proteome level. Protein Sci 2023; 32:e4756. [PMID: 37574757 PMCID: PMC10464302 DOI: 10.1002/pro.4756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 08/09/2023] [Accepted: 08/10/2023] [Indexed: 08/15/2023]
Abstract
We have developed an algorithm, ParSe, which accurately identifies from the primary sequence those protein regions likely to exhibit physiological phase separation behavior. Originally, ParSe was designed to test the hypothesis that, for flexible proteins, phase separation potential is correlated to hydrodynamic size. While our results were consistent with that idea, we also found that many different descriptors could successfully differentiate between three classes of protein regions: folded, intrinsically disordered, and phase-separating intrinsically disordered. Consequently, numerous combinations of amino acid property scales can be used to make robust predictions of protein phase separation. Built from that finding, ParSe 2.0 uses an optimal set of property scales to predict domain-level organization and compute a sequence-based prediction of phase separation potential. The algorithm is fast enough to scan the whole of the human proteome in minutes on a single computer and is equally or more accurate than other published predictors in identifying proteins and regions within proteins that drive phase separation. Here, we describe a web application for ParSe 2.0 that may be accessed through a browser by visiting https://stevewhitten.github.io/Parse_v2_FASTA to quickly identify phase-separating proteins within large sequence sets, or by visiting https://stevewhitten.github.io/Parse_v2_web to evaluate individual protein sequences.
Collapse
Affiliation(s)
- Colorado Wilson
- Department of Chemistry and BiochemistryTexas State UniversitySan MarcosTexasUSA
- Present address:
Department of Pharmacology and Toxicology, Sealy Center for Structural Biology and Molecular BiophysicsUniversity of Texas Medical BranchGalvestonTexasUSA
| | - Karen A. Lewis
- Department of Chemistry and BiochemistryTexas State UniversitySan MarcosTexasUSA
| | - Nicholas C. Fitzkee
- Department of ChemistryMississippi State UniversityMississippi StateMississippiUSA
| | - Loren E. Hough
- Department of PhysicsUniversity of Colorado BoulderBoulderColoradoUSA
- BioFrontiers InstituteUniversity of Colorado BoulderBoulderColoradoUSA
| | - Steven T. Whitten
- Department of Chemistry and BiochemistryTexas State UniversitySan MarcosTexasUSA
| |
Collapse
|
47
|
Christou-Kent M, Cuartero S, Garcia-Cabau C, Ruehle J, Naderi J, Erber J, Neguembor MV, Plana-Carmona M, Alcoverro-Bertran M, De Andres-Aguayo L, Klonizakis A, Julià-Vilella E, Lynch C, Serrano M, Hnisz D, Salvatella X, Graf T, Stik G. CEBPA phase separation links transcriptional activity and 3D chromatin hubs. Cell Rep 2023; 42:112897. [PMID: 37516962 DOI: 10.1016/j.celrep.2023.112897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 06/02/2023] [Accepted: 07/14/2023] [Indexed: 08/01/2023] Open
Abstract
Cell identity is orchestrated through an interplay between transcription factor (TF) action and genome architecture. The mechanisms used by TFs to shape three-dimensional (3D) genome organization remain incompletely understood. Here we present evidence that the lineage-instructive TF CEBPA drives extensive chromatin compartment switching and promotes the formation of long-range chromatin hubs during induced B cell-to-macrophage transdifferentiation. Mechanistically, we find that the intrinsically disordered region (IDR) of CEBPA undergoes in vitro phase separation (PS) dependent on aromatic residues. Both overexpressing B cells and native CEBPA-expressing cell types such as primary granulocyte-macrophage progenitors, liver cells, and trophectoderm cells reveal nuclear CEBPA foci and long-range 3D chromatin hubs at CEBPA-bound regions. In short, we show that CEBPA can undergo PS through its IDR, which may underlie in vivo foci formation and suggest a potential role of PS in regulating CEBPA function.
Collapse
Affiliation(s)
- Marie Christou-Kent
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Sergi Cuartero
- Josep Carreras Leukaemia Research Institute (IJC), Badalona, Spain; Germans Trias I Pujol Research Institute (IGTP), Badalona, Spain
| | - Carla Garcia-Cabau
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | - Julia Ruehle
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Julian Naderi
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| | - Julia Erber
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Maria Victoria Neguembor
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Marcos Plana-Carmona
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | - Luisa De Andres-Aguayo
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Antonios Klonizakis
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | - Cian Lynch
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain; Altos Labs, Cambridge Institute of Science, Cambridge CB21 6GP, UK
| | - Manuel Serrano
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain; Altos Labs, Cambridge Institute of Science, Cambridge CB21 6GP, UK
| | - Denes Hnisz
- Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| | - Xavier Salvatella
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain; ICREA, Passeig Lluís Companys 23, 08010 Barcelona, Spain
| | - Thomas Graf
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain.
| | - Grégoire Stik
- Josep Carreras Leukaemia Research Institute (IJC), Badalona, Spain.
| |
Collapse
|
48
|
Antonietti M, Gonzalez DJT, Djulbegovic M, Dayhoff GW, Uversky VN, Shields CL, Karp CL. Intrinsic disorder in PRAME and its role in uveal melanoma. Cell Commun Signal 2023; 21:222. [PMID: 37626310 PMCID: PMC10463658 DOI: 10.1186/s12964-023-01197-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Accepted: 06/13/2023] [Indexed: 08/27/2023] Open
Abstract
INTRODUCTION The PReferentially expressed Antigen in MElanoma (PRAME) protein has been shown to be an independent biomarker for increased risk of metastasis in Class 1 uveal melanomas (UM). Intrinsically disordered proteins and regions of proteins (IDPs/IDPRs) are proteins that do not have a well-defined three-dimensional structure and have been linked to neoplastic development. Our study aimed to evaluate the presence of intrinsic disorder in PRAME and the role these structureless regions have in PRAME( +) Class 1 UM. METHODS A bioinformatics study to characterize PRAME's propensity for the intrinsic disorder. We first used the AlphaFold tool to qualitatively assess the protein structure of PRAME. Then we used the Compositional Profiler and a set of per-residue intrinsic disorder predictors to quantify the intrinsic disorder. The Database of Disordered Protein Prediction (D2P2) platform, IUPred, FuzDrop, fIDPnn, AUCpred, SPOT-Disorder2, and metapredict V2 allowed us to evaluate the potential functional disorder of PRAME. Additionally, we used the Search Tool for the Retrieval of Interacting Genes (STRING) to analyze PRAME's potential interactions with other proteins. RESULTS Our structural analysis showed that PRAME contains intrinsically disordered protein regions (IDPRs), which are structureless and flexible. We found that PRAME is significantly enriched with serine (p-value < 0.05), a disorder-promoting amino acid. PRAME was found to have an average disorder score of 16.49% (i.e., moderately disordered) across six per-residue intrinsic disorder predictors. Our IUPred analysis revealed the presence of disorder-to-order transition (DOT) regions in PRAME near the C-terminus of the protein (residues 475-509). The D2P2 platform predicted a region from approximately 140 and 175 to be highly concentrated with post-translational modifications (PTMs). FuzDrop predicted the PTM hot spot of PRAME to be a droplet-promoting region and an aggregation hotspot. Finally, our analysis using the STRING tool revealed that PRAME has significantly more interactions with other proteins than expected for randomly selected proteins of the same size, with the ability to interact with 84 different partners (STRING analysis result: p-value < 1.0 × 10-16; model confidence: 0.400). CONCLUSION Our study revealed that PRAME has IDPRs that are possibly linked to its functionality in the context of Class 1 UM. The regions of functionality (i.e., DOT regions, PTM sites, droplet-promoting regions, and aggregation hotspots) are localized to regions of high levels of disorder. PRAME has a complex protein-protein interaction (PPI) network that may be secondary to the structureless features of the polypeptide. Our findings contribute to our understanding of UM and suggest that IDPRs and DOT regions in PRAME may be targeted in developing new therapies for this aggressive cancer. Video Abstract.
Collapse
Affiliation(s)
- Michael Antonietti
- Bascom Palmer Eye Institute, University of Miami, 900 NW 17th Street, Miami, FL, 33136, USA
| | | | - Mak Djulbegovic
- Bascom Palmer Eye Institute, University of Miami, 900 NW 17th Street, Miami, FL, 33136, USA
| | - Guy W Dayhoff
- Department of Chemistry, College of Art and Sciences, University of South Florida, FL, 33612, Tampa, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, FL, 33612, Tampa, USA
| | - Carol L Shields
- Ocular Oncology Service, Wills Eye Hospital, Thomas Jefferson University, PA, Philadelphia, USA
| | - Carol L Karp
- Bascom Palmer Eye Institute, University of Miami, 900 NW 17th Street, Miami, FL, 33136, USA.
| |
Collapse
|
49
|
Bernardini A, Mukherjee P, Scheer E, Kamenova I, Antonova S, Mendoza Sanchez PK, Yayli G, Morlet B, Timmers HTM, Tora L. Hierarchical TAF1-dependent co-translational assembly of the basal transcription factor TFIID. Nat Struct Mol Biol 2023; 30:1141-1152. [PMID: 37386215 PMCID: PMC10442232 DOI: 10.1038/s41594-023-01026-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 05/31/2023] [Indexed: 07/01/2023]
Abstract
Large heteromeric multiprotein complexes play pivotal roles at every step of gene expression in eukaryotic cells. Among them, the 20-subunit basal transcription factor TFIID nucleates the RNA polymerase II preinitiation complex at gene promoters. Here, by combining systematic RNA-immunoprecipitation (RIP) experiments, single-molecule imaging, proteomics and structure-function analyses, we show that human TFIID biogenesis occurs co-translationally. We discovered that all protein heterodimerization steps happen during protein synthesis. We identify TAF1-the largest protein in the complex-as a critical factor for TFIID assembly. TAF1 acts as a flexible scaffold that drives the co-translational recruitment of TFIID submodules preassembled in the cytoplasm. Altogether, our data suggest a multistep hierarchical model for TFIID biogenesis that culminates with the co-translational assembly of the complex onto the nascent TAF1 polypeptide. We envision that this assembly strategy could be shared with other large heteromeric protein complexes.
Collapse
Affiliation(s)
- Andrea Bernardini
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Université de Strasbourg, Illkirch, France
| | - Pooja Mukherjee
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Université de Strasbourg, Illkirch, France
- Innovative Genomics Institute, University of California, Berkeley, CA, USA
| | - Elisabeth Scheer
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Université de Strasbourg, Illkirch, France
| | - Ivanka Kamenova
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Université de Strasbourg, Illkirch, France
- Nature Protocols, London, UK
| | - Simona Antonova
- German Cancer Consortium (DKTK) partner site Freiburg, German Cancer Research Center (DKFZ) and Department of Urology, Medical Center-University of Freiburg, Freiburg, Germany
- The Netherlands Cancer Institute, Amsterdam, the Netherlands
| | - Paulina Karen Mendoza Sanchez
- German Cancer Consortium (DKTK) partner site Freiburg, German Cancer Research Center (DKFZ) and Department of Urology, Medical Center-University of Freiburg, Freiburg, Germany
| | - Gizem Yayli
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Université de Strasbourg, Illkirch, France
| | - Bastien Morlet
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Université de Strasbourg, Illkirch, France
| | - H T Marc Timmers
- German Cancer Consortium (DKTK) partner site Freiburg, German Cancer Research Center (DKFZ) and Department of Urology, Medical Center-University of Freiburg, Freiburg, Germany
| | - László Tora
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France.
- Centre National de la Recherche Scientifique, Illkirch, France.
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France.
- Université de Strasbourg, Illkirch, France.
| |
Collapse
|
50
|
Oksuz O, Henninger JE, Warneford-Thomson R, Zheng MM, Erb H, Vancura A, Overholt KJ, Hawken SW, Banani SF, Lauman R, Reich LN, Robertson AL, Hannett NM, Lee TI, Zon LI, Bonasio R, Young RA. Transcription factors interact with RNA to regulate genes. Mol Cell 2023; 83:2449-2463.e13. [PMID: 37402367 PMCID: PMC10529847 DOI: 10.1016/j.molcel.2023.06.012] [Citation(s) in RCA: 42] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 03/16/2023] [Accepted: 06/06/2023] [Indexed: 07/06/2023]
Abstract
Transcription factors (TFs) orchestrate the gene expression programs that define each cell's identity. The canonical TF accomplishes this with two domains, one that binds specific DNA sequences and the other that binds protein coactivators or corepressors. We find that at least half of TFs also bind RNA, doing so through a previously unrecognized domain with sequence and functional features analogous to the arginine-rich motif of the HIV transcriptional activator Tat. RNA binding contributes to TF function by promoting the dynamic association between DNA, RNA, and TF on chromatin. TF-RNA interactions are a conserved feature important for vertebrate development and disrupted in disease. We propose that the ability to bind DNA, RNA, and protein is a general property of many TFs and is fundamental to their gene regulatory function.
Collapse
Affiliation(s)
- Ozgur Oksuz
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | | | - Robert Warneford-Thomson
- Epigenetics Institute, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Ming M Zheng
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Hailey Erb
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Adrienne Vancura
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Kalon J Overholt
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Susana Wilson Hawken
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Program of Computational & Systems Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Salman F Banani
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Richard Lauman
- Epigenetics Institute, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Lauren N Reich
- Epigenetics Institute, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Anne L Robertson
- Stem Cell Program, Division of Hematology/Oncology, Boston Children's Hospital and Dana Farber Cancer Institute, Boston, MA 02115, USA; Howard Hughes Medical Institute, Boston, MA 02115, USA
| | - Nancy M Hannett
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Tong I Lee
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Leonard I Zon
- Stem Cell Program, Division of Hematology/Oncology, Boston Children's Hospital and Dana Farber Cancer Institute, Boston, MA 02115, USA; Harvard Medical School, Boston, MA 02115, USA; Howard Hughes Medical Institute, Boston, MA 02115, USA; Stem Cell and Regenerative Biology Department, Harvard University, Cambridge, MA 02138, USA
| | - Roberto Bonasio
- Epigenetics Institute, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Richard A Young
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| |
Collapse
|