1
|
Yao YM, Miodownik I, O'Hagan MP, Jbara M, Afek A. Deciphering the dynamic code: DNA recognition by transcription factors in the ever-changing genome. Transcription 2024:1-25. [PMID: 39033307 DOI: 10.1080/21541264.2024.2379161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 07/03/2024] [Indexed: 07/23/2024] Open
Abstract
Transcription factors (TFs) intricately navigate the vast genomic landscape to locate and bind specific DNA sequences for the regulation of gene expression programs. These interactions occur within a dynamic cellular environment, where both DNA and TF proteins experience continual chemical and structural perturbations, including epigenetic modifications, DNA damage, mechanical stress, and post-translational modifications (PTMs). While many of these factors impact TF-DNA binding interactions, understanding their effects remains challenging and incomplete. This review explores the existing literature on these dynamic changes and their potential impact on TF-DNA interactions.
Collapse
Affiliation(s)
- Yumi Minyi Yao
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Irina Miodownik
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Michael P O'Hagan
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Muhammad Jbara
- School of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Ariel Afek
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
2
|
von Ehr J, Oberstrass L, Yazgan E, Schnaubelt LI, Blümel N, McNicoll F, Weigand JE, Zarnack K, Müller-McNicoll M, Korn SM, Schlundt A. Arid5a uses disordered extensions of its core ARID domain for distinct DNA- and RNA-recognition and gene regulation. J Biol Chem 2024; 300:107457. [PMID: 38866324 PMCID: PMC11262183 DOI: 10.1016/j.jbc.2024.107457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 05/23/2024] [Accepted: 06/01/2024] [Indexed: 06/14/2024] Open
Abstract
AT-rich interacting domain (ARID)-containing proteins, Arids, are a heterogeneous DNA-binding protein family involved in transcription regulation and chromatin processing. For the member Arid5a, no exact DNA-binding preference has been experimentally defined so far. Additionally, the protein binds to mRNA motifs for transcript stabilization, supposedly through the DNA-binding ARID domain. To date, however, no unbiased RNA motif definition and clear dissection of nucleic acid-binding through the ARID domain have been undertaken. Using NMR-centered biochemistry, we here define the Arid5a DNA preference. Further, high-throughput in vitro binding reveals a consensus RNA-binding motif engaged by the core ARID domain. Finally, transcriptome-wide binding (iCLIP2) reveals that Arid5a has a weak preference for (A)U-rich regions in pre-mRNA transcripts of factors related to RNA processing. We find that the intrinsically disordered regions flanking the ARID domain modulate the specificity and affinity of DNA binding, while they appear crucial for RNA interactions. Ultimately, our data suggest that Arid5a uses its extended ARID domain for bifunctional gene regulation and that the involvement of IDR extensions is a more general feature of Arids in interacting with different nucleic acids at the chromatin-mRNA interface.
Collapse
Affiliation(s)
- Julian von Ehr
- Institute for Molecular Biosciences and Biomolecular Resonance Center (BMRZ), Goethe University Frankfurt, Frankfurt, Germany; IMPRS on Cellular Biophysics, Frankfurt, Germany
| | - Lasse Oberstrass
- University of Marburg, Department of Pharmacy, Institute of Pharmaceutical Chemistry, Marburg, Germany
| | - Ege Yazgan
- Institute for Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany; Buchmann Institute for Molecular Life Sciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Lara Ina Schnaubelt
- Institute for Molecular Biosciences and Biomolecular Resonance Center (BMRZ), Goethe University Frankfurt, Frankfurt, Germany
| | - Nicole Blümel
- Institute for Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Francois McNicoll
- Institute for Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Julia E Weigand
- University of Marburg, Department of Pharmacy, Institute of Pharmaceutical Chemistry, Marburg, Germany
| | - Kathi Zarnack
- Institute for Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany; Buchmann Institute for Molecular Life Sciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Michaela Müller-McNicoll
- Institute for Molecular Biosciences, Goethe University Frankfurt, Frankfurt, Germany; Max-Planck Institute for Biophysics, Frankfurt, Germany
| | - Sophie Marianne Korn
- Institute for Molecular Biosciences and Biomolecular Resonance Center (BMRZ), Goethe University Frankfurt, Frankfurt, Germany; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, USA.
| | - Andreas Schlundt
- Institute for Molecular Biosciences and Biomolecular Resonance Center (BMRZ), Goethe University Frankfurt, Frankfurt, Germany; University of Greifswald, Institute of Biochemistry, Greifswald, Germany.
| |
Collapse
|
3
|
Hurieva B, Kumar DK, Morag R, Lupo O, Carmi M, Barkai N, Jonas F. Disordered sequences of transcription factors regulate genomic binding by integrating diverse sequence grammars and interaction types. Nucleic Acids Res 2024:gkae521. [PMID: 38908024 DOI: 10.1093/nar/gkae521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 04/25/2024] [Accepted: 06/19/2024] [Indexed: 06/24/2024] Open
Abstract
Intrinsically disordered regions (IDRs) guide transcription factors (TFs) to their genomic binding sites, raising the question of how structure-lacking regions encode for complex binding patterns. We investigated this using the TF Gln3, revealing sets of IDR-embedded determinants that direct Gln3 binding to respective groups of functionally related promoters, and enable tuning binding preferences between environmental conditions, phospho-mimicking mutations, and orthologs. Through targeted mutations, we defined the role of short linear motifs (SLiMs) and co-binding TFs (Hap2) in stabilizing Gln3 at respiration-chain promoters, while providing evidence that Gln3 binding at nitrogen-associated promoters is encoded by the IDR amino-acid composition, independent of SLiMs or co-binding TFs. Therefore, despite their apparent simplicity, TF IDRs can direct and regulate complex genomic binding patterns through a combination of SLiM-mediated and composition-encoded interactions.
Collapse
Affiliation(s)
- Bohdana Hurieva
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Divya Krishna Kumar
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Rotem Morag
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Offir Lupo
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Miri Carmi
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Felix Jonas
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
- School of Science, Constructor University, 28759 Bremen, Germany
| |
Collapse
|
4
|
Ginell GM, Emenecker RJ, Lotthammer JM, Usher ET, Holehouse AS. Direct prediction of intermolecular interactions driven by disordered regions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.03.597104. [PMID: 38895487 PMCID: PMC11185574 DOI: 10.1101/2024.06.03.597104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Intrinsically disordered regions (IDRs) are critical for a wide variety of cellular functions, many of which involve interactions with partner proteins. Molecular recognition is typically considered through the lens of sequence-specific binding events. However, a growing body of work has shown that IDRs often interact with partners in a manner that does not depend on the precise order of the amino acid order, instead driven by complementary chemical interactions leading to disordered bound-state complexes. Despite this emerging paradigm, we lack tools to describe, quantify, predict, and interpret these types of structurally heterogeneous interactions from the underlying amino acid sequences. Here, we repurpose the chemical physics developed originally for molecular simulations to develop an approach for predicting intermolecular interactions between IDRs and partner proteins. Our approach enables the direct prediction of phase diagrams, the identification of chemically-specific interaction hotspots on IDRs, and a route to develop and test mechanistic hypotheses regarding IDR function in the context of molecular recognition. We use our approach to examine a range of systems and questions to highlight its versatility and applicability.
Collapse
Affiliation(s)
- Garrett M. Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Ryan. J Emenecker
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Jeffrey M. Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Emery T. Usher
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO
- Center for Biomolecular Condensates (CBC), Washington University in St. Louis, St. Louis, MO
| |
Collapse
|
5
|
Ertl HA, Bayala EX, Siddiq MA, Wittkopp PJ. Divergence of Grainy head affects chromatin accessibility, gene expression, and embryonic viability in Drosophila melanogaster. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.07.588430. [PMID: 38645200 PMCID: PMC11030446 DOI: 10.1101/2024.04.07.588430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Pioneer factors are critical for gene regulation and development because they bind chromatin and make DNA more accessible for binding by other transcription factors. The pioneer factor Grainy head (Grh) is present across metazoans and has been shown to retain a role in epithelium development in fruit flies, nematodes, and mice despite extensive divergence in both amino acid sequence and length. Here, we investigate the evolution of Grh function by comparing the effects of the fly ( Drosophila melanogaster ) and worm ( Caenorhabditis elegans ) Grh orthologs on chromatin accessibility, gene expression, embryonic development, and viability in transgenic D. melanogaster . We found that the Caenorhabditis elegans ortholog rescued cuticle development but not full embryonic viability in Drosophila melanogaster grh null mutants. At the molecular level, the C. elegans ortholog only partially rescued chromatin accessibility and gene expression. Divergence in the disordered N-terminus of the Grh protein contributes to these differences in embryonic viability and molecular phenotypes. These data show how pioneer factors can diverge in sequence and function at the molecular level while retaining conserved developmental functions at the organismal level. SUMMARY STATEMENT Despite divergence in a disordered region that affects function at both molecular and organismal levels, the Caenorhabditis elegans Grainy head (Grh) protein rescued cuticle morphology in D. melanogaster embryos.
Collapse
|
6
|
Valyaeva AA, Sheval EV. Nonspecific Interactions in Transcription Regulation and Organization of Transcriptional Condensates. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:688-700. [PMID: 38831505 DOI: 10.1134/s0006297924040084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/19/2023] [Accepted: 11/20/2023] [Indexed: 06/05/2024]
Abstract
Eukaryotic cells are characterized by a high degree of compartmentalization of their internal contents, which ensures precise and controlled regulation of intracellular processes. During many processes, including different stages of transcription, dynamic membraneless compartments termed biomolecular condensates are formed. Transcription condensates contain various transcription factors and RNA polymerase and are formed by high- and low-specificity interactions between the proteins, DNA, and nearby RNA. This review discusses recent data demonstrating important role of nonspecific multivalent protein-protein and RNA-protein interactions in organization and regulation of transcription.
Collapse
Affiliation(s)
- Anna A Valyaeva
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119991, Russia.
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
- Department of Cell Biology and Histology, Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Eugene V Sheval
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
- Department of Cell Biology and Histology, Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| |
Collapse
|
7
|
Shepherdson JL, Granas DM, Li J, Shariff Z, Plassmeyer SP, Holehouse AS, White MA, Cohen BA. Mutational scanning of CRX classifies clinical variants and reveals biochemical properties of the transcriptional effector domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.21.585809. [PMID: 38585983 PMCID: PMC10996540 DOI: 10.1101/2024.03.21.585809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Cone-Rod Homeobox, encoded by CRX, is a transcription factor (TF) essential for the terminal differentiation and maintenance of mammalian photoreceptors. Structurally, CRX comprises an ordered DNA-binding homeodomain and an intrinsically disordered transcriptional effector domain. Although a handful of human variants in CRX have been shown to cause several different degenerative retinopathies with varying cone and rod predominance, as with most human disease genes the vast majority of observed CRX genetic variants are uncharacterized variants of uncertain significance (VUS). We performed a deep mutational scan (DMS) of nearly all possible single amino acid substitution variants in CRX, using an engineered cell-based transcriptional reporter assay. We measured the ability of each CRX missense variant to transactivate a synthetic fluorescent reporter construct in a pooled fluorescence-activated cell sorting assay and compared the activation strength of each variant to that of wild-type CRX to compute an activity score, identifying thousands of variants with altered transcriptional activity. We calculated a statistical confidence for each activity score derived from multiple independent measurements of each variant marked by unique sequence barcodes, curating a high-confidence list of nearly 2,000 variants with significantly altered transcriptional activity compared to wild-type CRX. We evaluated the performance of the DMS assay as a clinical variant classification tool using gold-standard classified human variants from ClinVar, and determined that activity scores could be used to identify pathogenic variants with high specificity. That this performance could be achieved using a synthetic reporter assay in a foreign cell type, even for a highly cell type-specific TF like CRX, suggests that this approach shows promise for DMS of other TFs that function in cell types that are not easily accessible. Per-position average activity scores closely aligned to a predicted structure of the ordered homeodomain and demonstrated position-specific residue requirements. The intrinsically disordered transcriptional effector domain, by contrast, displayed a qualitatively different pattern of substitution effects, following compositional constraints without specific residue position requirements in the peptide chain. The observed compositional constraints of the effector domain were consistent with the acidic exposure model of transcriptional activation. Together, the results of the CRX DMS identify molecular features of the CRX effector domain and demonstrate clinical utility for variant classification.
Collapse
Affiliation(s)
- James L. Shepherdson
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - David M. Granas
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Jie Li
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Zara Shariff
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Stephen P. Plassmeyer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Alex S. Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Center for Biomolecular Condensates, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Michael A. White
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| | - Barak A. Cohen
- Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
- Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110
| |
Collapse
|
8
|
Mindel V, Brodsky S, Cohen A, Manadre W, Jonas F, Carmi M, Barkai N. Intrinsically disordered regions of the Msn2 transcription factor encode multiple functions using interwoven sequence grammars. Nucleic Acids Res 2024; 52:2260-2272. [PMID: 38109289 PMCID: PMC10954448 DOI: 10.1093/nar/gkad1191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 11/04/2023] [Accepted: 12/11/2023] [Indexed: 12/20/2023] Open
Abstract
Intrinsically disordered regions (IDRs) are abundant in eukaryotic proteins, but their sequence-function relationship remains poorly understood. IDRs of transcription factors (TFs) can direct promoter selection and recruit coactivators, as shown for the budding yeast TF Msn2. To examine how IDRs encode both these functions, we compared genomic binding specificity, coactivator recruitment, and gene induction amongst a large set of designed Msn2-IDR mutants. We find that both functions depend on multiple regions across the > 600AA IDR. Yet, transcription activity was readily disrupted by mutations that showed no effect on the Msn2 binding specificity. Our data attribute this differential sensitivity to the integration of a relaxed, composition-based code directing binding specificity with a more stringent, motif-based code controlling the recruitment of coactivators and transcription activity. Therefore, Msn2 utilizes interwoven sequence grammars for encoding multiple functions, suggesting a new IDR design paradigm of potentially general use.
Collapse
Affiliation(s)
- Vladimir Mindel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Sagie Brodsky
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Aileen Cohen
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Wajd Manadre
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Felix Jonas
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Miri Carmi
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
9
|
Weinmann R, Frank L, Rippe K. Approaches to characterize chromatin subcompartment organization in the cell nucleus. Curr Opin Struct Biol 2023; 83:102695. [PMID: 37722292 DOI: 10.1016/j.sbi.2023.102695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/05/2023] [Accepted: 08/07/2023] [Indexed: 09/20/2023]
Abstract
The mechanism of self-organization of chromatin subcompartments on the 0.1-1 μm scale and their impact on genome-associated activities has long been a key aspect of research on nuclear organization. Understanding the underlying structure-function relationship, however, remains challenging due to the complex hierarchical structure of chromatin and the polymorphic organization of subcompartments that assemble around it. Towards this goal, approaches to measure local properties and compositional dynamics of chromatin in its endogenous cellular environment are instrumental. Here, we discuss recent advancements in studying these features and their functional implications in protein and RNA enrichment and genome accessibility.
Collapse
Affiliation(s)
- Robin Weinmann
- German Cancer Research Center (DKFZ) Heidelberg, Division of Chromatin Networks, Germany; Center for Quantitative Analysis of Molecular and Cellular Biosystems (BioQuant), Heidelberg University, Germany; Faculty of Biosciences, Heidelberg University, Germany
| | - Lukas Frank
- German Cancer Research Center (DKFZ) Heidelberg, Division of Chromatin Networks, Germany; Center for Quantitative Analysis of Molecular and Cellular Biosystems (BioQuant), Heidelberg University, Germany
| | - Karsten Rippe
- German Cancer Research Center (DKFZ) Heidelberg, Division of Chromatin Networks, Germany; Center for Quantitative Analysis of Molecular and Cellular Biosystems (BioQuant), Heidelberg University, Germany.
| |
Collapse
|