1
|
Reuter LM, Khadayate SP, Mossler A, Liebl K, Faull SV, Karimi MM, Speck C. MCM2-7 loading-dependent ORC release ensures genome-wide origin licensing. Nat Commun 2024; 15:7306. [PMID: 39181881 PMCID: PMC11344781 DOI: 10.1038/s41467-024-51538-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 08/09/2024] [Indexed: 08/27/2024] Open
Abstract
Origin recognition complex (ORC)-dependent loading of the replicative helicase MCM2-7 onto replication origins in G1-phase forms the basis of replication fork establishment in S-phase. However, how ORC and MCM2-7 facilitate genome-wide DNA licensing is not fully understood. Mapping the molecular footprints of budding yeast ORC and MCM2-7 genome-wide, we discovered that MCM2-7 loading is associated with ORC release from origins and redistribution to non-origin sites. Our bioinformatic analysis revealed that origins are compact units, where a single MCM2-7 double hexamer blocks repetitive loading through steric ORC binding site occlusion. Analyses of A-elements and an improved B2-element consensus motif uncovered that DNA shape, DNA flexibility, and the correct, face-to-face spacing of the two DNA elements are hallmarks of ORC-binding and efficient helicase loading sites. Thus, our work identified fundamental principles for MCM2-7 helicase loading that explain how origin licensing is realised across the genome.
Collapse
Affiliation(s)
- L Maximilian Reuter
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom.
- Institute of Molecular Biology (IMB) gGmbH, Ackermannweg 4, Mainz, Germany.
| | | | - Audrey Mossler
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Korbinian Liebl
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, IL, USA
| | - Sarah V Faull
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Mohammad M Karimi
- MRC London Institute of Medical Sciences (LMS), London, United Kingdom
- Comprehensive Cancer Centre, School of Cancer & Pharmaceutical Sciences, Faculty of Life Sciences & Medicine, King's College London, London, United Kingdom
| | - Christian Speck
- DNA Replication Group, Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom.
- MRC London Institute of Medical Sciences (LMS), London, United Kingdom.
| |
Collapse
|
2
|
Oyarzún-Cisterna A, Gidi C, Raiqueo F, Amigo R, Rivas C, Torrejón M, Gutiérrez JL. General regulatory factors exert differential effects on nucleosome sliding activity of the ISW1a complex. Biol Res 2024; 57:22. [PMID: 38704609 PMCID: PMC11069190 DOI: 10.1186/s40659-024-00500-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Accepted: 04/15/2024] [Indexed: 05/06/2024] Open
Abstract
BACKGROUND Chromatin dynamics is deeply involved in processes that require access to DNA, such as transcriptional regulation. Among the factors involved in chromatin dynamics at gene regulatory regions are general regulatory factors (GRFs). These factors contribute to establishment and maintenance of nucleosome-depleted regions (NDRs). These regions are populated by nucleosomes through histone deposition and nucleosome sliding, the latter catalyzed by a number of ATP-dependent chromatin remodeling complexes, including ISW1a. It has been observed that GRFs can act as barriers against nucleosome sliding towards NDRs. However, the relative ability of the different GRFs to hinder sliding activity is currently unknown. RESULTS Considering this, we performed a comparative analysis for the main GRFs, with focus in their ability to modulate nucleosome sliding mediated by ISW1a. Among the GRFs tested in nucleosome remodeling assays, Rap1 was the only factor displaying the ability to hinder the activity of ISW1a. This effect requires location of the Rap1 cognate sequence on linker that becomes entry DNA in the nucleosome remodeling process. In addition, Rap1 was able to hinder nucleosome assembly in octamer transfer assays. Concurrently, Rap1 displayed the highest affinity for and longest dwell time from its target sequence, compared to the other GRFs tested. Consistently, through bioinformatics analyses of publicly available genome-wide data, we found that nucleosome occupancy and histone deposition in vivo are inversely correlated with the affinity of Rap1 for its target sequences in the genome. CONCLUSIONS Our findings point to DNA binding affinity, residence time and location at particular translational positions relative to the nucleosome core as the key features of GRFs underlying their roles played in nucleosome sliding and assembly.
Collapse
Affiliation(s)
- Andrea Oyarzún-Cisterna
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, 4070043, Concepción, Chile
| | - Cristián Gidi
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, 4070043, Concepción, Chile
| | - Fernanda Raiqueo
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, 4070043, Concepción, Chile
| | - Roberto Amigo
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, 4070043, Concepción, Chile
| | - Camila Rivas
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, 4070043, Concepción, Chile
| | - Marcela Torrejón
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, 4070043, Concepción, Chile
| | - José L Gutiérrez
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, 4070043, Concepción, Chile.
| |
Collapse
|
3
|
Karollus A, Hingerl J, Gankin D, Grosshauser M, Klemon K, Gagneur J. Species-aware DNA language models capture regulatory elements and their evolution. Genome Biol 2024; 25:83. [PMID: 38566111 PMCID: PMC10985990 DOI: 10.1186/s13059-024-03221-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 03/20/2024] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND The rise of large-scale multi-species genome sequencing projects promises to shed new light on how genomes encode gene regulatory instructions. To this end, new algorithms are needed that can leverage conservation to capture regulatory elements while accounting for their evolution. RESULTS Here, we introduce species-aware DNA language models, which we trained on more than 800 species spanning over 500 million years of evolution. Investigating their ability to predict masked nucleotides from context, we show that DNA language models distinguish transcription factor and RNA-binding protein motifs from background non-coding sequence. Owing to their flexibility, DNA language models capture conserved regulatory elements over much further evolutionary distances than sequence alignment would allow. Remarkably, DNA language models reconstruct motif instances bound in vivo better than unbound ones and account for the evolution of motif sequences and their positional constraints, showing that these models capture functional high-order sequence and evolutionary context. We further show that species-aware training yields improved sequence representations for endogenous and MPRA-based gene expression prediction, as well as motif discovery. CONCLUSIONS Collectively, these results demonstrate that species-aware DNA language models are a powerful, flexible, and scalable tool to integrate information from large compendia of highly diverged genomes.
Collapse
Affiliation(s)
- Alexander Karollus
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
- Munich Center for Machine Learning, Munich, Germany
| | - Johannes Hingerl
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Dennis Gankin
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Martin Grosshauser
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Kristian Klemon
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Julien Gagneur
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany.
- Munich Center for Machine Learning, Munich, Germany.
- Institute of Human Genetics, School of Medicine and Health, Technical University of Munich, Munich, Germany.
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany.
- Munich Data Science Institute, Technical University of Munich, Garching, Germany.
| |
Collapse
|
4
|
Azhar M, Xu C, Jiang X, Li W, Cao Y, Zhu X, Xing X, Wu L, Zou J, Meng L, Cheng Y, Han W, Bao J. The arginine methyltransferase Prmt1 coordinates the germline arginine methylome essential for spermatogonial homeostasis and male fertility. Nucleic Acids Res 2023; 51:10428-10450. [PMID: 37739418 PMCID: PMC10602896 DOI: 10.1093/nar/gkad769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 08/30/2023] [Accepted: 09/08/2023] [Indexed: 09/24/2023] Open
Abstract
Arginine methylation, catalyzed by the protein arginine methyltransferases (PRMTs), is a common post-translational protein modification (PTM) that is engaged in a plethora of biological events. However, little is known about how the methylarginine-directed signaling functions in germline development. In this study, we discover that Prmt1 is predominantly distributed in the nuclei of spermatogonia but weakly in the spermatocytes throughout mouse spermatogenesis. By exploiting a combination of three Cre-mediated Prmt1 knockout mouse lines, we unravel that Prmt1 is essential for spermatogonial establishment and maintenance, and that Prmt1-catalyzed asymmetric methylarginine coordinates inherent transcriptional homeostasis within spermatogonial cells. In conjunction with high-throughput CUT&Tag profiling and modified mini-bulk Smart-seq2 analyses, we unveil that the Prmt1-deposited H4R3me2a mark is permissively enriched at promoter and exon/intron regions, and sculpts a distinctive transcriptomic landscape as well as the alternative splicing pattern, in the mouse spermatogonia. Collectively, our study provides the genetic and mechanistic evidence that connects the Prmt1-deposited methylarginine signaling to the establishment and maintenance of a high-fidelity transcriptomic identity in orchestrating spermatogonial development in the mammalian germline.
Collapse
Affiliation(s)
- Muhammad Azhar
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Caoling Xu
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Xue Jiang
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Wenqing Li
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Yuzhu Cao
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Xiaoli Zhu
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Xuemei Xing
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
| | - Limin Wu
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
| | - Jiaqi Zou
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Lan Meng
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Yu Cheng
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Wenjie Han
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| | - Jianqiang Bao
- Department of Obstetrics and Gynecology, Reproductive and Genetic Hospital, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hefei National Laboratory for Physical Sciences at Microscale, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China (USTC), Anhui, China
| |
Collapse
|
5
|
Alexandari AM, Horton CA, Shrikumar A, Shah N, Li E, Weilert M, Pufall MA, Zeitlinger J, Fordyce PM, Kundaje A. De novo distillation of thermodynamic affinity from deep learning regulatory sequence models of in vivo protein-DNA binding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.11.540401. [PMID: 37214836 PMCID: PMC10197627 DOI: 10.1101/2023.05.11.540401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Transcription factors (TF) are proteins that bind DNA in a sequence-specific manner to regulate gene transcription. Despite their unique intrinsic sequence preferences, in vivo genomic occupancy profiles of TFs differ across cellular contexts. Hence, deciphering the sequence determinants of TF binding, both intrinsic and context-specific, is essential to understand gene regulation and the impact of regulatory, non-coding genetic variation. Biophysical models trained on in vitro TF binding assays can estimate intrinsic affinity landscapes and predict occupancy based on TF concentration and affinity. However, these models cannot adequately explain context-specific, in vivo binding profiles. Conversely, deep learning models, trained on in vivo TF binding assays, effectively predict and explain genomic occupancy profiles as a function of complex regulatory sequence syntax, albeit without a clear biophysical interpretation. To reconcile these complementary models of in vitro and in vivo TF binding, we developed Affinity Distillation (AD), a method that extracts thermodynamic affinities de-novo from deep learning models of TF chromatin immunoprecipitation (ChIP) experiments by marginalizing away the influence of genomic sequence context. Applied to neural networks modeling diverse classes of yeast and mammalian TFs, AD predicts energetic impacts of sequence variation within and surrounding motifs on TF binding as measured by diverse in vitro assays with superior dynamic range and accuracy compared to motif-based methods. Furthermore, AD can accurately discern affinities of TF paralogs. Our results highlight thermodynamic affinity as a key determinant of in vivo binding, suggest that deep learning models of in vivo binding implicitly learn high-resolution affinity landscapes, and show that these affinities can be successfully distilled using AD. This new biophysical interpretation of deep learning models enables high-throughput in silico experiments to explore the influence of sequence context and variation on both intrinsic affinity and in vivo occupancy.
Collapse
Affiliation(s)
- Amr M. Alexandari
- Department of Computer Science, Stanford University, Stanford, CA 94305
| | | | - Avanti Shrikumar
- Department of Earth System Science, Stanford University, Stanford, CA 94305
| | - Nilay Shah
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Eileen Li
- Department of Genetics, Stanford University, Stanford, CA 94305
| | - Melanie Weilert
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Miles A. Pufall
- Department of Biochemistry, Carver College of Medicine, University of Iowa, Iowa City, Iowa 52242, USA
| | - Julia Zeitlinger
- Stowers Institute for Medical Research, Kansas City, MO, USA
- The University of Kansas Medical Center, Kansas City, KS, USA
| | - Polly M. Fordyce
- Department of Genetics, Stanford University, Stanford, CA 94305
- Department of Bioengineering, Stanford University, Stanford, CA 94305
- ChEM-H Institute, Stanford University, Stanford, CA 94305
- Chan Zuckerberg Biohub, San Francisco, CA 94110
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA 94305
- Department of Genetics, Stanford University, Stanford, CA 94305
| |
Collapse
|
6
|
Donovan BT, Chen H, Eek P, Meng Z, Jipa C, Tan S, Bai L, Poirier MG. Basic helix-loop-helix pioneer factors interact with the histone octamer to invade nucleosomes and generate nucleosome-depleted regions. Mol Cell 2023; 83:1251-1263.e6. [PMID: 36996811 PMCID: PMC10182836 DOI: 10.1016/j.molcel.2023.03.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 01/13/2023] [Accepted: 03/06/2023] [Indexed: 03/31/2023]
Abstract
Nucleosomes drastically limit transcription factor (TF) occupancy, while pioneer transcription factors (PFs) somehow circumvent this nucleosome barrier. In this study, we compare nucleosome binding of two conserved S. cerevisiae basic helix-loop-helix (bHLH) TFs, Cbf1 and Pho4. A cryo-EM structure of Cbf1 in complex with the nucleosome reveals that the Cbf1 HLH region can electrostatically interact with exposed histone residues within a partially unwrapped nucleosome. Single-molecule fluorescence studies show that the Cbf1 HLH region facilitates efficient nucleosome invasion by slowing its dissociation rate relative to DNA through interactions with histones, whereas the Pho4 HLH region does not. In vivo studies show that this enhanced binding provided by the Cbf1 HLH region enables nucleosome invasion and ensuing repositioning. These structural, single-molecule, and in vivo studies reveal the mechanistic basis of dissociation rate compensation by PFs and how this translates to facilitating chromatin opening inside cells.
Collapse
Affiliation(s)
- Benjamin T Donovan
- Biophysics Graduate Program, The Ohio State University, Columbus, OH 43210, USA
| | - Hengye Chen
- Department of Biochemistry and Molecular Biology, Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA 16802, USA
| | - Priit Eek
- Department of Biochemistry and Molecular Biology, Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA 16802, USA
| | - Zhiyuan Meng
- Biophysics Graduate Program, The Ohio State University, Columbus, OH 43210, USA
| | - Caroline Jipa
- Department of Physics, The Ohio State University, Columbus, OH 43210, USA
| | - Song Tan
- Department of Biochemistry and Molecular Biology, Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA 16802, USA; Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA.
| | - Lu Bai
- Department of Biochemistry and Molecular Biology, Center for Eukaryotic Gene Regulation, The Pennsylvania State University, University Park, PA 16802, USA; Department of Physics, The Pennsylvania State University, University Park, PA 16802, USA.
| | - Michael G Poirier
- Biophysics Graduate Program, The Ohio State University, Columbus, OH 43210, USA; Department of Physics, The Ohio State University, Columbus, OH 43210, USA; Department of Chemistry & Biochemistry, The Ohio State University, Columbus, OH 43210, USA.
| |
Collapse
|
7
|
Barissi S, Sala A, Wieczór M, Battistini F, Orozco M. DNAffinity: a machine-learning approach to predict DNA binding affinities of transcription factors. Nucleic Acids Res 2022; 50:9105-9114. [PMID: 36018808 PMCID: PMC9458447 DOI: 10.1093/nar/gkac708] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 07/21/2022] [Accepted: 08/08/2022] [Indexed: 12/24/2022] Open
Abstract
We present a physics-based machine learning approach to predict in vitro transcription factor binding affinities from structural and mechanical DNA properties directly derived from atomistic molecular dynamics simulations. The method is able to predict affinities obtained with techniques as different as uPBM, gcPBM and HT-SELEX with an excellent performance, much better than existing algorithms. Due to its nature, the method can be extended to epigenetic variants, mismatches, mutations, or any non-coding nucleobases. When complemented with chromatin structure information, our in vitro trained method provides also good estimates of in vivo binding sites in yeast.
Collapse
Affiliation(s)
| | | | - Miłosz Wieczór
- Institute for Research in Biomedicine (IRB Barcelona). The Barcelona Institute of Science and Technology. Baldiri Reixac 10–12, 08028 Barcelona, Spain,Department of Physical Chemistry. Gdansk University of Technology, 80-233 Gdańsk, Poland
| | | | - Modesto Orozco
- Correspondence may also be addressed to Modesto Orozco. Tel: +34 934 037 156;
| |
Collapse
|
8
|
Kang Y, Jung WJ, Brent MR. Predicting which genes will respond to transcription factor perturbations. G3 (BETHESDA, MD.) 2022; 12:jkac144. [PMID: 35666184 PMCID: PMC9339286 DOI: 10.1093/g3journal/jkac144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 05/25/2022] [Indexed: 11/13/2022]
Abstract
The ability to predict which genes will respond to the perturbation of a transcription factor serves as a benchmark for our systems-level understanding of transcriptional regulatory networks. In previous work, machine learning models have been trained to predict static gene expression levels in a biological sample by using data from the same or similar samples, including data on their transcription factor binding locations, histone marks, or DNA sequence. We report on a different challenge-training machine learning models to predict which genes will respond to the perturbation of a transcription factor without using any data from the perturbed cells. We find that existing transcription factor location data (ChIP-seq) from human cells have very little detectable utility for predicting which genes will respond to perturbation of a transcription factor. Features of genes, including their preperturbation expression level and expression variation, are very useful for predicting responses to perturbation of any transcription factor. This shows that some genes are poised to respond to transcription factor perturbations and others are resistant, shedding light on why it has been so difficult to predict responses from binding locations. Certain histone marks, including H3K4me1 and H3K4me3, have some predictive power when located downstream of the transcription start site. However, the predictive power of histone marks is much less than that of gene expression level and expression variation. Sequence-based or epigenetic properties of genes strongly influence their tendency to respond to direct transcription factor perturbations, partially explaining the oft-noted difficulty of predicting responsiveness from transcription factor binding location data. These molecular features are largely reflected in and summarized by the gene's expression level and expression variation. Code is available at https://github.com/BrentLab/TFPertRespExplainer.
Collapse
Affiliation(s)
- Yiming Kang
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA
- Department of Computer Science and Engineering, Washington University, St. Louis, MO 63108, USA
| | - Wooseok J Jung
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA
- Department of Computer Science and Engineering, Washington University, St. Louis, MO 63108, USA
| | - Michael R Brent
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA
- Department of Computer Science and Engineering, Washington University, St. Louis, MO 63108, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| |
Collapse
|
9
|
Krieger G, Lupo O, Wittkopp P, Barkai N. Evolution of transcription factor binding through sequence variations and turnover of binding sites. Genome Res 2022; 32:1099-1111. [PMID: 35618416 PMCID: PMC9248875 DOI: 10.1101/gr.276715.122] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 05/20/2022] [Indexed: 01/08/2023]
Abstract
Variations in noncoding regulatory sequences play a central role in evolution. Interpreting such variations, however, remains difficult even in the context of defined attributes such as transcription factor (TF) binding sites. Here, we systematically link variations in cis-regulatory sequences to TF binding by profiling the allele-specific binding of 27 TFs expressed in a yeast hybrid, in which two related genomes are present within the same nucleus. TFs localize preferentially to sites containing their known consensus motifs but occupy only a small fraction of the motif-containing sites available within the genomes. Differential binding of TFs to the orthologous alleles was well explained by variations that alter motif sequence, whereas differences in chromatin accessibility between alleles were of little apparent effect. Motif variations that abolished binding when present in only one allele were still bound when present in both alleles, suggesting evolutionary compensation, with a potential role for sequence conservation at the motif's vicinity. At the level of the full promoter, we identify cases of binding-site turnover, in which binding sites are reciprocally gained and lost, yet most interspecific differences remained uncompensated. Our results show the flexibility of TFs to bind imprecise motifs and the fast evolution of TF binding sites between related species.
Collapse
Affiliation(s)
- Gat Krieger
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Offir Lupo
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Patricia Wittkopp
- Department of Ecology and Evolutionary Biology, Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
10
|
Karl LA, Peritore M, Galanti L, Pfander B. DNA Double Strand Break Repair and Its Control by Nucleosome Remodeling. Front Genet 2022; 12:821543. [PMID: 35096025 PMCID: PMC8790285 DOI: 10.3389/fgene.2021.821543] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 12/23/2021] [Indexed: 12/12/2022] Open
Abstract
DNA double strand breaks (DSBs) are repaired in eukaryotes by one of several cellular mechanisms. The decision-making process controlling DSB repair takes place at the step of DNA end resection, the nucleolytic processing of DNA ends, which generates single-stranded DNA overhangs. Dependent on the length of the overhang, a corresponding DSB repair mechanism is engaged. Interestingly, nucleosomes-the fundamental unit of chromatin-influence the activity of resection nucleases and nucleosome remodelers have emerged as key regulators of DSB repair. Nucleosome remodelers share a common enzymatic mechanism, but for global genome organization specific remodelers have been shown to exert distinct activities. Specifically, different remodelers have been found to slide and evict, position or edit nucleosomes. It is an open question whether the same remodelers exert the same function also in the context of DSBs. Here, we will review recent advances in our understanding of nucleosome remodelers at DSBs: to what extent nucleosome sliding, eviction, positioning and editing can be observed at DSBs and how these activities affect the DSB repair decision.
Collapse
Affiliation(s)
- Leonhard Andreas Karl
- Resarch Group DNA Replication and Genome Integrity, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Martina Peritore
- Resarch Group DNA Replication and Genome Integrity, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Lorenzo Galanti
- Resarch Group DNA Replication and Genome Integrity, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Boris Pfander
- Resarch Group DNA Replication and Genome Integrity, Max Planck Institute of Biochemistry, Martinsried, Germany
| |
Collapse
|
11
|
Li M, Hu Q, Collins G, Parida M, Ball CB, Price DH, Meier JL. Cytomegalovirus late transcription factor target sequence diversity orchestrates viral early to late transcription. PLoS Pathog 2021; 17:e1009796. [PMID: 34339482 PMCID: PMC8360532 DOI: 10.1371/journal.ppat.1009796] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 08/12/2021] [Accepted: 07/12/2021] [Indexed: 11/23/2022] Open
Abstract
Beta- and gammaherpesviruses late transcription factors (LTFs) target viral promoters containing a TATT sequence to drive transcription after viral DNA replication has begun. Human cytomegalovirus (HCMV), a betaherpesvirus, uses the UL87 LTF to bind both TATT and host RNA polymerase II (Pol II), whereas the UL79 LTF has been suggested to drive productive elongation. Here we apply integrated functional genomics (dTag system, PRO-Seq, ChIP-Seq, and promoter function assays) to uncover the contribution of diversity in LTF target sequences in determining degree and scope to which LTFs drive viral transcription. We characterize the DNA sequence patterns in LTF-responsive and -unresponsive promoter populations, determine where and when Pol II initiates transcription, identify sites of LTF binding genome-wide, and quantify change in nascent transcripts from individual promoters in relation to core promoter sequences, LTF loss, stage of infection, and viral DNA replication. We find that HCMV UL79 and UL87 LTFs function concordantly to initiate transcription from over half of all active viral promoters in late infection, while not appreciably affecting host transcription. Both LTFs act on and bind to viral early-late and late kinetic-class promoters. Over one-third of these core promoters lack the TATT and instead have a TATAT, TGTT, or YRYT. The TATT and non-TATT motifs are part of a sequence block with a sequence code that correlates with promoter transcription level. LTF occupancy of a TATATA palindrome shared by back-to-back promoters is linked to bidirectional transcription. We conclude that diversity in LTF target sequences shapes the LTF-transformative program that drives the viral early-to-late transcription switch. Herpesviruses have a group of genes earmarked for expression late in the infection. Beta- and gammaherpesviruses utilize a six-member set of viral late transcription factors to selectively activate these genes by binding to a DNA sequence signature in gene promoters. We made an unexpected discovery that a wider range of differences in sequence signatures configures the late gene expression program for human cytomegalovirus, a beta-herpesvirus of global public health importance. Diversity in signature patterns expands promoter targets and probably pre-sets amount of individual promoter output. A unique palindromic sequence signature is linked to the activation of back-to-back promoters at multiple locations in the viral genome. We deduce that diversity in late transcription factor targets functionally orchestrates the rollout of a productive late-stage infection. This may be a generalizable feature adopted by beta- and gammaherpesvirus subfamilies.
Collapse
Affiliation(s)
- Ming Li
- Iowa City Veterans Affairs Health Care System, Iowa City, Iowa, United States of America
- Department of Internal Medicine University of Iowa, Iowa City, Iowa, United States of America
| | - Qiaolin Hu
- Iowa City Veterans Affairs Health Care System, Iowa City, Iowa, United States of America
- Department of Internal Medicine University of Iowa, Iowa City, Iowa, United States of America
| | - Geoffrey Collins
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, United States of America
| | - Mrutyunjaya Parida
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, United States of America
| | - Christopher B. Ball
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, United States of America
| | - David H. Price
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, United States of America
| | - Jeffery L. Meier
- Iowa City Veterans Affairs Health Care System, Iowa City, Iowa, United States of America
- Department of Internal Medicine University of Iowa, Iowa City, Iowa, United States of America
- Department of Epidemiology, University of Iowa, Iowa City, Iowa, United States of America
- * E-mail:
| |
Collapse
|
12
|
Transcriptional control of ribosome biogenesis in yeast: links to growth and stress signals. Biochem Soc Trans 2021; 49:1589-1599. [PMID: 34240738 PMCID: PMC8421047 DOI: 10.1042/bst20201136] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 06/14/2021] [Accepted: 06/18/2021] [Indexed: 12/15/2022]
Abstract
Ribosome biogenesis requires prodigious transcriptional output in rapidly growing yeast cells and is highly regulated in response to both growth and stress signals. This minireview focuses on recent developments in our understanding of this regulatory process, with an emphasis on the 138 ribosomal protein genes (RPGs) themselves and a group of >200 ribosome biogenesis (RiBi) genes whose products contribute to assembly but are not part of the ribosome. Expression of most RPGs depends upon Rap1, a pioneer transcription factor (TF) required for the binding of a pair of RPG-specific TFs called Fhl1 and Ifh1. RPG expression is correlated with Ifh1 promoter binding, whereas Rap1 and Fhl1 remain promoter-associated upon stress-induced down regulation. A TF called Sfp1 has also been implicated in RPG regulation, though recent work reveals that its primary function is in activation of RiBi and other growth-related genes. Sfp1 plays an important regulatory role at a small number of RPGs where Rap1–Fhl1–Ifh1 action is subsidiary or non-existent. In addition, nearly half of all RPGs are bound by Hmo1, which either stabilizes or re-configures Fhl1–Ifh1 binding. Recent studies identified the proline rotamase Fpr1, known primarily for its role in rapamycin-mediated inhibition of the TORC1 kinase, as an additional TF at RPG promoters. Fpr1 also affects Fhl1–Ifh1 binding, either independently or in cooperation with Hmo1. Finally, a major recent development was the discovery of a protein homeostasis mechanism driven by unassembled ribosomal proteins, referred to as the Ribosome Assembly Stress Response (RASTR), that controls RPG transcription through the reversible condensation of Ifh1.
Collapse
|
13
|
Oberbeckmann E, Niebauer V, Watanabe S, Farnung L, Moldt M, Schmid A, Cramer P, Peterson CL, Eustermann S, Hopfner KP, Korber P. Ruler elements in chromatin remodelers set nucleosome array spacing and phasing. Nat Commun 2021; 12:3232. [PMID: 34050140 PMCID: PMC8163753 DOI: 10.1038/s41467-021-23015-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 03/13/2021] [Indexed: 01/09/2023] Open
Abstract
Arrays of regularly spaced nucleosomes dominate chromatin and are often phased by alignment to reference sites like active promoters. How the distances between nucleosomes (spacing), and between phasing sites and nucleosomes are determined remains unclear, and specifically, how ATP-dependent chromatin remodelers impact these features. Here, we used genome-wide reconstitution to probe how Saccharomyces cerevisiae ATP-dependent remodelers generate phased arrays of regularly spaced nucleosomes. We find that remodelers bear a functional element named the 'ruler' that determines spacing and phasing in a remodeler-specific way. We use structure-based mutagenesis to identify and tune the ruler element residing in the Nhp10 and Arp8 modules of the INO80 remodeler complex. Generally, we propose that a remodeler ruler regulates nucleosome sliding direction bias in response to (epi)genetic information. This finally conceptualizes how remodeler-mediated nucleosome dynamics determine stable steady-state nucleosome positioning relative to other nucleosomes, DNA bound factors, DNA ends and DNA sequence elements.
Collapse
Affiliation(s)
- Elisa Oberbeckmann
- Division of Molecular Biology, Biomedical Center (BMC), Faculty of Medicine, LMU Munich, Martinsried, Germany
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Vanessa Niebauer
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
- Department of Biochemistry, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Shinya Watanabe
- Program of Molecular Medicine, University of Massachusetts, Worcester, MA, USA
| | - Lucas Farnung
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
- Department of Cell Biology, Blavatnik Institute, Harvard Medical School, Boston, USA
| | - Manuela Moldt
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
- Department of Biochemistry, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Andrea Schmid
- Division of Molecular Biology, Biomedical Center (BMC), Faculty of Medicine, LMU Munich, Martinsried, Germany
| | - Patrick Cramer
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Craig L Peterson
- Program of Molecular Medicine, University of Massachusetts, Worcester, MA, USA
| | - Sebastian Eustermann
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany.
- Department of Biochemistry, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, Munich, Germany.
- European Molecular Biology Laboratory (EMBL), Structural and Computational Biology Unit, Heidelberg, Germany.
| | - Karl-Peter Hopfner
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany.
- Department of Biochemistry, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, Munich, Germany.
| | - Philipp Korber
- Division of Molecular Biology, Biomedical Center (BMC), Faculty of Medicine, LMU Munich, Martinsried, Germany.
| |
Collapse
|
14
|
Genome information processing by the INO80 chromatin remodeler positions nucleosomes. Nat Commun 2021; 12:3231. [PMID: 34050142 PMCID: PMC8163841 DOI: 10.1038/s41467-021-23016-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 04/07/2021] [Indexed: 11/26/2022] Open
Abstract
The fundamental molecular determinants by which ATP-dependent chromatin remodelers organize nucleosomes across eukaryotic genomes remain largely elusive. Here, chromatin reconstitutions on physiological, whole-genome templates reveal how remodelers read and translate genomic information into nucleosome positions. Using the yeast genome and the multi-subunit INO80 remodeler as a paradigm, we identify DNA shape/mechanics encoded signature motifs as sufficient for nucleosome positioning and distinct from known DNA sequence preferences of histones. INO80 processes such information through an allosteric interplay between its core- and Arp8-modules that probes mechanical properties of nucleosomal and linker DNA. At promoters, INO80 integrates this readout of DNA shape/mechanics with a readout of co-evolved sequence motifs via interaction with general regulatory factors bound to these motifs. Our findings establish a molecular mechanism for robust and yet adjustable +1 nucleosome positioning and, more generally, remodelers as information processing hubs that enable active organization and allosteric regulation of the first level of chromatin. DNA sequence preferences or statistical positioning of histones has not explained genomic patterns of nucleosome organisation in vivo. Here, the authors establish DNA shape/mechanics as key elements that have evolved together with binding sites of DNA sequence-specific barriers so that such information directs nucleosome positioning by chromatin remodelers.
Collapse
|
15
|
An all-to-all approach to the identification of sequence-specific readers for epigenetic DNA modifications on cytosine. Nat Commun 2021; 12:795. [PMID: 33542217 PMCID: PMC7862700 DOI: 10.1038/s41467-021-20950-w] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 12/14/2020] [Indexed: 12/31/2022] Open
Abstract
Epigenetic modifications of DNA play important roles in many biological processes. Identifying readers of these epigenetic marks is a critical step towards understanding the underlying mechanisms. Here, we present an all-to-all approach, dubbed digital affinity profiling via proximity ligation (DAPPL), to simultaneously profile human TF-DNA interactions using mixtures of random DNA libraries carrying different epigenetic modifications (i.e., 5-methylcytosine, 5-hydroxymethylcytosine, 5-formylcytosine, and 5-carboxylcytosine) on CpG dinucleotides. Many proteins that recognize consensus sequences carrying these modifications in symmetric and/or hemi-modified forms are identified. We further demonstrate that the modifications in different sequence contexts could either enhance or suppress TF binding activity. Moreover, many modifications can affect TF binding specificity. Furthermore, symmetric modifications show a stronger effect in either enhancing or suppressing TF-DNA interactions than hemi-modifications. Finally, in vivo evidence suggests that USF1 and USF2 might regulate transcription via hydroxymethylcytosine-binding activity in weak enhancers in human embryonic stem cells. Identifying readers of epigenetic marks is a critical step for understanding the role of epigenetic marks in biology. Here, the authors applied DAPPL, an all-to-all approach to profile the interactions between TFs and epigenetic modified DNA libraries.
Collapse
|
16
|
Ash1 and Tup1 dependent repression of the Saccharomyces cerevisiae HO promoter requires activator-dependent nucleosome eviction. PLoS Genet 2020; 16:e1009133. [PMID: 33382702 PMCID: PMC7806131 DOI: 10.1371/journal.pgen.1009133] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 01/13/2021] [Accepted: 11/25/2020] [Indexed: 11/30/2022] Open
Abstract
Transcriptional regulation of the Saccharomyces cerevisiae HO gene is highly complex, requiring a balance of multiple activating and repressing factors to ensure that only a few transcripts are produced in mother cells within a narrow window of the cell cycle. Here, we show that the Ash1 repressor associates with two DNA sequences that are usually concealed within nucleosomes in the HO promoter and recruits the Tup1 corepressor and the Rpd3 histone deacetylase, both of which are required for full repression in daughters. Genome-wide ChIP identified greater than 200 additional sites of co-localization of these factors, primarily within large, intergenic regions from which they could regulate adjacent genes. Most Ash1 binding sites are in nucleosome depleted regions (NDRs), while a small number overlap nucleosomes, similar to HO. We demonstrate that Ash1 binding to the HO promoter does not occur in the absence of the Swi5 transcription factor, which recruits coactivators that evict nucleosomes, including the nucleosomes obscuring the Ash1 binding sites. In the absence of Swi5, artificial nucleosome depletion allowed Ash1 to bind, demonstrating that nucleosomes are inhibitory to Ash1 binding. The location of binding sites within nucleosomes may therefore be a mechanism for limiting repressive activity to periods of nucleosome eviction that are otherwise associated with activation of the promoter. Our results illustrate that activation and repression can be intricately connected, and events set in motion by an activator may also ensure the appropriate level of repression and reset the promoter for the next activation cycle. Nucleosomes inhibit both gene expression and DNA-binding by regulatory factors. Here we examine the role of nucleosomes in regulating the binding of repressive transcription factors to the complex promoter for the yeast HO gene. Ash1 is a sequence-specific DNA-binding protein, and we show that it recruits the Tup1 global repressive factor to the HO promoter. Using a method to determine where Ash1 and Tup1 are bound to DNA throughout the genome, we discovered that Tup1 is also present at most places where Ash1 binds. The majority of these sites are in “Nucleosome Depleted Regions,” or NDRs, where the absence of chromatin makes factor binding easier. We discovered that the HO promoter is an exception, in that the two places where Ash1 binds overlap nucleosomes. Activation of the HO promoter is a complex, multi-step process, and we demonstrated that chromatin factors transiently evict these nucleosomes from the HO promoter during the cell cycle, allowing Ash1 to bind and recruit Tup1. Thus, activators must evict nucleosomes from the promoter to allow the repressive machinery to bind.
Collapse
|
17
|
Zencir S, Dilg D, Rueda MP, Shore D, Albert B. Mechanisms coordinating ribosomal protein gene transcription in response to stress. Nucleic Acids Res 2020; 48:11408-11420. [PMID: 33084907 PMCID: PMC7672434 DOI: 10.1093/nar/gkaa852] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 09/17/2020] [Accepted: 10/11/2020] [Indexed: 11/14/2022] Open
Abstract
While expression of ribosomal protein genes (RPGs) in the budding yeast has been extensively studied, a longstanding enigma persists regarding their co-regulation under fluctuating growth conditions. Most RPG promoters display one of two distinct arrangements of a core set of transcription factors (TFs) and are further differentiated by the presence or absence of the HMGB protein Hmo1. However, a third group of promoters appears not to be bound by any of these proteins, raising the question of how the whole suite of genes is co-regulated. We demonstrate here that all RPGs are regulated by two distinct, but complementary mechanisms driven by the TFs Ifh1 and Sfp1, both of which are required for maximal expression in optimal conditions and coordinated downregulation upon stress. At the majority of RPG promoters, Ifh1-dependent regulation predominates, whereas Sfp1 plays the major role at all other genes. We also uncovered an unexpected protein homeostasis-dependent binding property of Hmo1 at RPG promoters. Finally, we show that the Ifh1 paralog Crf1, previously described as a transcriptional repressor, can act as a constitutive RPG activator. Our study provides a more complete picture of RPG regulation and may serve as a paradigm for unravelling RPG regulation in multicellular eukaryotes.
Collapse
Affiliation(s)
- Sevil Zencir
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, 30 quai Ernest-Ansermet, 1211 Geneva 4, Switzerland
| | - Daniel Dilg
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, 30 quai Ernest-Ansermet, 1211 Geneva 4, Switzerland
| | - Maria Paula Rueda
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, 30 quai Ernest-Ansermet, 1211 Geneva 4, Switzerland
| | - David Shore
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, 30 quai Ernest-Ansermet, 1211 Geneva 4, Switzerland
| | - Benjamin Albert
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, 30 quai Ernest-Ansermet, 1211 Geneva 4, Switzerland
| |
Collapse
|
18
|
de Jonge WJ, Brok M, Lijnzaad P, Kemmeren P, Holstege FCP. Genome-wide off-rates reveal how DNA binding dynamics shape transcription factor function. Mol Syst Biol 2020; 16:e9885. [PMID: 33280256 PMCID: PMC7586999 DOI: 10.15252/msb.20209885] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 09/06/2020] [Accepted: 09/10/2020] [Indexed: 11/25/2022] Open
Abstract
Protein-DNA interactions are dynamic, and these dynamics are an important aspect of chromatin-associated processes such as transcription or replication. Due to a lack of methods to study on- and off-rates across entire genomes, protein-DNA interaction dynamics have not been studied extensively. Here, we determine in vivo off-rates for the Saccharomyces cerevisiae chromatin organizing factor Abf1, at 191 sites simultaneously across the yeast genome. Average Abf1 residence times span a wide range, varying between 4.2 and 33 min. Sites with different off-rates are associated with different functional characteristics. This includes their transcriptional dependency on Abf1, nucleosome positioning and the size of the nucleosome-free region, as well as the ability to roadblock RNA polymerase II for termination. The results show how off-rates contribute to transcription factor function and that DIVORSEQ (Determining In Vivo Off-Rates by SEQuencing) is a meaningful way of investigating protein-DNA binding dynamics genome-wide.
Collapse
Affiliation(s)
- Wim J de Jonge
- Princess Máxima Center for Pediatric OncologyUtrechtThe Netherlands
| | - Mariël Brok
- Princess Máxima Center for Pediatric OncologyUtrechtThe Netherlands
| | - Philip Lijnzaad
- Princess Máxima Center for Pediatric OncologyUtrechtThe Netherlands
| | - Patrick Kemmeren
- Princess Máxima Center for Pediatric OncologyUtrechtThe Netherlands
| | | |
Collapse
|
19
|
Hainer SJ, Kaplan CD. Specialized RSC: Substrate Specificities for a Conserved Chromatin Remodeler. Bioessays 2020; 42:e2000002. [PMID: 32490565 PMCID: PMC7329613 DOI: 10.1002/bies.202000002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 03/11/2020] [Indexed: 01/16/2023]
Abstract
The remodel the structure of chromatin (RSC) nucleosome remodeling complex is a conserved chromatin regulator with roles in chromatin organization, especially over nucleosome depleted regions therefore functioning in gene expression. Recent reports in Saccharomyces cerevisiae have identified specificities in RSC activity toward certain types of nucleosomes. RSC has now been shown to preferentially evict nucleosomes containing the histone variant H2A.Z in vitro. Furthermore, biochemical activities of distinct RSC complexes has been found to differ when their nucleosome substrate is partially unraveled. Mammalian BAF complexes, the homologs of yeast RSC and SWI/SNF complexes, are also linked to nucleosomes with H2A.Z, but this relationship may be complex and extent of conservation remains to be determined. The interplay of remodelers with specific nucleosome substrates and regulation of remodeler outcomes by nucleosome composition are tantalizing questions given the wave of structural data emerging for RSC and other SWI/SNF family remodelers.
Collapse
Affiliation(s)
- Sarah J Hainer
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Craig D Kaplan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| |
Collapse
|
20
|
de Jonge WJ, Brok M, Kemmeren P, Holstege FCP. An Optimized Chromatin Immunoprecipitation Protocol for Quantification of Protein-DNA Interactions. STAR Protoc 2020; 1:100020. [PMID: 32685929 PMCID: PMC7357673 DOI: 10.1016/j.xpro.2020.100020] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Transcription factors are important regulators of cell fate and function. Knowledge about where transcription factors are bound in the genome is crucial for understanding their function. A common method to study protein-DNA interactions is chromatin immunoprecipitation (ChIP). Here, we present a revised ChIP protocol to determine protein-DNA interactions for the yeast Saccharomyces cerevisiae. We optimized several aspects of the procedure, including cross-linking and quenching, cell lysis, and immunoprecipitation steps. This protocol facilitates sensitive and reproducible quantitation of protein-DNA interactions. For complete details on the use and execution of this protocol, please refer to (de Jonge et al., 2019). Chromatin immunoprecipitation protocol to quantify protein-DNA interactions Optimized for sensitivity and robustness Optimized for quantitative comparisons between experiments, e.g., in time series Highlights common ChIP pitfalls, variable steps, and how to increase reproducibility
Collapse
Affiliation(s)
- Wim J de Jonge
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| | - Mariël Brok
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| | - Patrick Kemmeren
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| | - Frank C P Holstege
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| |
Collapse
|
21
|
Dhillon N, Shelansky R, Townshend B, Jain M, Boeger H, Endy D, Kamakaka R. Permutational analysis of Saccharomyces cerevisiae regulatory elements. Synth Biol (Oxf) 2020; 5:ysaa007. [PMID: 32775697 PMCID: PMC7402160 DOI: 10.1093/synbio/ysaa007] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 05/12/2020] [Accepted: 05/29/2020] [Indexed: 01/24/2023] Open
Abstract
Gene expression in Saccharomyces cerevisiae is regulated at multiple levels. Genomic and epigenomic mapping of transcription factors and chromatin factors has led to the delineation of various modular regulatory elements—enhancers (upstream activating sequences), core promoters, 5′ untranslated regions (5′ UTRs) and transcription terminators/3′ untranslated regions (3′ UTRs). However, only a few of these elements have been tested in combinations with other elements and the functional interactions between the different modular regulatory elements remain under explored. We describe a simple and rapid approach to build a combinatorial library of regulatory elements and have used this library to study 26 different enhancers, core promoters, 5′ UTRs and transcription terminators/3′ UTRs to estimate the contribution of individual regulatory parts in gene expression. Our combinatorial analysis shows that while enhancers initiate gene expression, core promoters modulate the levels of enhancer-mediated expression and can positively or negatively affect expression from even the strongest enhancers. Principal component analysis (PCA) indicates that enhancer and promoter function can be explained by a single principal component while UTR function involves multiple functional components. The PCA also highlights outliers and suggest differences in mechanisms of regulation by individual elements. Our data also identify numerous regulatory cassettes composed of different individual regulatory elements that exhibit equivalent gene expression levels. These data thus provide a catalog of elements that could in future be used in the design of synthetic regulatory circuits.
Collapse
Affiliation(s)
- Namrita Dhillon
- Department of MCD Biology, University of California, Santa Cruz, CA, USA
| | - Robert Shelansky
- Department of MCD Biology, University of California, Santa Cruz, CA, USA
| | - Brent Townshend
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Miten Jain
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Hinrich Boeger
- Department of MCD Biology, University of California, Santa Cruz, CA, USA
| | - Drew Endy
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Rohinton Kamakaka
- Department of MCD Biology, University of California, Santa Cruz, CA, USA
| |
Collapse
|
22
|
Qiu C, Jin H, Vvedenskaya I, Llenas JA, Zhao T, Malik I, Visbisky AM, Schwartz SL, Cui P, Čabart P, Han KH, Lai WKM, Metz RP, Johnson CD, Sze SH, Pugh BF, Nickels BE, Kaplan CD. Universal promoter scanning by Pol II during transcription initiation in Saccharomyces cerevisiae. Genome Biol 2020; 21:132. [PMID: 32487207 PMCID: PMC7265651 DOI: 10.1186/s13059-020-02040-0] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 05/08/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The majority of eukaryotic promoters utilize multiple transcription start sites (TSSs). How multiple TSSs are specified at individual promoters across eukaryotes is not understood for most species. In Saccharomyces cerevisiae, a pre-initiation complex (PIC) comprised of Pol II and conserved general transcription factors (GTFs) assembles and opens DNA upstream of TSSs. Evidence from model promoters indicates that the PIC scans from upstream to downstream to identify TSSs. Prior results suggest that TSS distributions at promoters where scanning occurs shift in a polar fashion upon alteration in Pol II catalytic activity or GTF function. RESULTS To determine the extent of promoter scanning across promoter classes in S. cerevisiae, we perturb Pol II catalytic activity and GTF function and analyze their effects on TSS usage genome-wide. We find that alterations to Pol II, TFIIB, or TFIIF function widely alter the initiation landscape consistent with promoter scanning operating at all yeast promoters, regardless of promoter class. Promoter architecture, however, can determine the extent of promoter sensitivity to altered Pol II activity in ways that are predicted by a scanning model. CONCLUSIONS Our observations coupled with previous data validate key predictions of the scanning model for Pol II initiation in yeast, which we term the shooting gallery. In this model, Pol II catalytic activity and the rate and processivity of Pol II scanning together with promoter sequence determine the distribution of TSSs and their usage.
Collapse
Affiliation(s)
- Chenxi Qiu
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- Present Address: Department of Medicine, Division of Translational Therapeutics, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, 02215, USA
| | - Huiyan Jin
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
| | - Irina Vvedenskaya
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854, USA
- Department of Genetics, Rutgers University, Piscataway, NJ, 08854, USA
| | - Jordi Abante Llenas
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, 77843-3128, USA
- Present Address: Whitaker Biomedical Engineering Institute, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Tingting Zhao
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Indranil Malik
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- Present Address: Department of Neurology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Alex M Visbisky
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Scott L Schwartz
- Genomics and Bioinformatics Service, Texas A&M AgriLife, College Station, TX, 77845, USA
| | - Ping Cui
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
| | - Pavel Čabart
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- Present Address: First Faculty of Medicine, Charles University, BIOCEV, 252 42, Vestec, Czech Republic
| | - Kang Hoo Han
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, 16802, USA
| | - William K M Lai
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, 16802, USA
- Present Address: Department of Molecular Biology and Genetics, 458 Biotechnology, Cornell University, New York, 14853, USA
| | - Richard P Metz
- Genomics and Bioinformatics Service, Texas A&M AgriLife, College Station, TX, 77845, USA
| | - Charles D Johnson
- Genomics and Bioinformatics Service, Texas A&M AgriLife, College Station, TX, 77845, USA
| | - Sing-Hoi Sze
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- Department of Computer Science and Engineering, Texas A&M University, College Station, TX, 77843-3127, USA
| | - B Franklin Pugh
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA, 16802, USA
- Present Address: Department of Molecular Biology and Genetics, 458 Biotechnology, Cornell University, New York, 14853, USA
| | - Bryce E Nickels
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854, USA
- Department of Genetics, Rutgers University, Piscataway, NJ, 08854, USA
| | - Craig D Kaplan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, 15260, USA.
| |
Collapse
|
23
|
Goldshtein M, Mellul M, Deutch G, Imashimizu M, Takeuchi K, Meshorer E, Ram O, Lukatsky DB. Transcription Factor Binding in Embryonic Stem Cells Is Constrained by DNA Sequence Repeat Symmetry. Biophys J 2020; 118:2015-2026. [PMID: 32101712 DOI: 10.1016/j.bpj.2020.02.009] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Revised: 02/05/2020] [Accepted: 02/10/2020] [Indexed: 01/21/2023] Open
Abstract
Transcription factor (TF) recognition is dictated by the underlying DNA motif sequence specific for each TF. Here, we reveal that DNA sequence repeat symmetry plays a central role in defining TF-DNA-binding preferences. In particular, we find that different TFs bind similar symmetry patterns in the context of different developmental layers. Most TFs possess dominant preferences for similar DNA repeat symmetry types. However, in some cases, preferences of specific TFs are changed during differentiation, suggesting the importance of information encoded outside of known motif regions. Histone modifications also exhibit strong preferences for similar DNA repeat symmetry patterns unique to each type of modification. Next, using an in vivo reporter assay, we show that gene expression in embryonic stem cells can be positively modulated by the presence of genomic and computationally designed DNA oligonucleotides containing identified nonconsensus-repetitive sequence elements. This supports the hypothesis that certain nonconsensus-repetitive patterns possess a functional ability to regulate gene expression. We also performed a solution NMR experiment to probe the stability of double-stranded DNA via imino proton resonances for several double-stranded DNA sequences characterized by different repetitive patterns. We suggest that such local stability might play a key role in determining TF-DNA binding preferences. Overall, our findings show that despite the enormous sequence complexity of the TF-DNA binding landscape in differentiating embryonic stem cells, this landscape can be quantitatively characterized in simple terms using the notion of DNA sequence repeat symmetry.
Collapse
Affiliation(s)
- Matan Goldshtein
- Avram and Stella Goldstein-Goren Department of Biotechnology Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Meir Mellul
- Department of Biological Chemistry, The Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Gai Deutch
- Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Masahiko Imashimizu
- Molecular Profiling Research Center for Drug Discovery, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Koh Takeuchi
- Molecular Profiling Research Center for Drug Discovery, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Eran Meshorer
- Department of Genetics, The Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel; The Edmond and Lily Safra Center for Brain Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Oren Ram
- Department of Biological Chemistry, The Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel.
| | - David B Lukatsky
- Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva, Israel.
| |
Collapse
|
24
|
Kang Y, Patel NR, Shively C, Recio PS, Chen X, Wranik BJ, Kim G, McIsaac RS, Mitra R, Brent MR. Dual threshold optimization and network inference reveal convergent evidence from TF binding locations and TF perturbation responses. Genome Res 2020; 30:459-471. [PMID: 32060051 PMCID: PMC7111528 DOI: 10.1101/gr.259655.119] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 02/11/2020] [Indexed: 12/22/2022]
Abstract
A high-confidence map of the direct, functional targets of each transcription factor (TF) requires convergent evidence from independent sources. Two significant sources of evidence are TF binding locations and the transcriptional responses to direct TF perturbations. Systematic data sets of both types exist for yeast and human, but they rarely converge on a common set of direct, functional targets for a TF. Even the few genes that are both bound and responsive may not be direct functional targets. Our analysis shows that when there are many nonfunctional binding sites and many indirect targets, nonfunctional sites are expected to occur in the cis-regulatory DNA of indirect targets by chance. To address this problem, we introduce dual threshold optimization (DTO), a new method for setting significance thresholds on binding and perturbation-response data, and show that it improves convergence. It also enables comparison of binding data to perturbation-response data that have been processed by network inference algorithms, which further improves convergence. The combination of dual threshold optimization and network inference greatly expands the high-confidence TF network map in both yeast and human. Next, we analyze a comprehensive new data set measuring the transcriptional response shortly after inducing overexpression of a yeast TF. We also present a new yeast binding location data set obtained by transposon calling cards and compare it to recent ChIP-exo data. These new data sets improve convergence and expand the high-confidence network synergistically.
Collapse
Affiliation(s)
- Yiming Kang
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, Missouri 63130, USA
| | - Nikhil R Patel
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, Missouri 63130, USA
| | - Christian Shively
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Pamela Samantha Recio
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Xuhua Chen
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Bernd J Wranik
- Calico Life Sciences LLC, South San Francisco, California 94080, USA
| | - Griffin Kim
- Calico Life Sciences LLC, South San Francisco, California 94080, USA
| | - R Scott McIsaac
- Calico Life Sciences LLC, South San Francisco, California 94080, USA
| | - Robi Mitra
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Michael R Brent
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, Missouri 63130, USA
| |
Collapse
|
25
|
Oberbeckmann E, Wolff M, Krietenstein N, Heron M, Ellins JL, Schmid A, Krebs S, Blum H, Gerland U, Korber P. Absolute nucleosome occupancy map for the Saccharomyces cerevisiae genome. Genome Res 2019; 29:1996-2009. [PMID: 31694866 PMCID: PMC6886505 DOI: 10.1101/gr.253419.119] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Accepted: 10/31/2019] [Indexed: 12/23/2022]
Abstract
Mapping of nucleosomes, the basic DNA packaging unit in eukaryotes, is fundamental for understanding genome regulation because nucleosomes modulate DNA access by their positioning along the genome. A cell-population nucleosome map requires two observables: nucleosome positions along the DNA ("Where?") and nucleosome occupancies across the population ("In how many cells?"). All available genome-wide nucleosome mapping techniques are yield methods because they score either nucleosomal (e.g., MNase-seq, chemical cleavage-seq) or nonnucleosomal (e.g., ATAC-seq) DNA but lose track of the total DNA population for each genomic region. Therefore, they only provide nucleosome positions and maybe compare relative occupancies between positions, but cannot measure absolute nucleosome occupancy, which is the fraction of all DNA molecules occupied at a given position and time by a nucleosome. Here, we established two orthogonal and thereby cross-validating approaches to measure absolute nucleosome occupancy across the Saccharomyces cerevisiae genome via restriction enzymes and DNA methyltransferases. The resulting high-resolution (9-bp) map shows uniform absolute occupancies. Most nucleosome positions are occupied in most cells: 97% of all nucleosomes called by chemical cleavage-seq have a mean absolute occupancy of 90 ± 6% (±SD). Depending on nucleosome position calling procedures, there are 57,000 to 60,000 nucleosomes per yeast cell. The few low absolute occupancy nucleosomes do not correlate with highly transcribed gene bodies, but correlate with increased presence of the nucleosome-evicting chromatin structure remodeling (RSC) complex, and are enriched upstream of highly transcribed or regulated genes. Our work provides a quantitative method and reference frame in absolute terms for future chromatin studies.
Collapse
Affiliation(s)
- Elisa Oberbeckmann
- Molecular Biology Division, Biomedical Center, Faculty of Medicine, Ludwig-Maximilians-Universität München, 82152 Planegg-Martinsried, Germany
| | - Michael Wolff
- Physik Department, Technische Universität München, 85748 Garching, Germany
| | - Nils Krietenstein
- Molecular Biology Division, Biomedical Center, Faculty of Medicine, Ludwig-Maximilians-Universität München, 82152 Planegg-Martinsried, Germany.,Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA
| | - Mark Heron
- Quantitative and Computational Biology, Max Planck Institute for Biophysical Chemistry, 37077 Göttingen, Germany.,Gene Center, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, 81377 Munich, Germany
| | - Jessica L Ellins
- Department of Biochemistry, University of Oxford, Oxford, OX1 3QU, United Kingdom
| | - Andrea Schmid
- Molecular Biology Division, Biomedical Center, Faculty of Medicine, Ludwig-Maximilians-Universität München, 82152 Planegg-Martinsried, Germany
| | - Stefan Krebs
- Laboratory of Functional Genome Analysis (LAFUGA), Gene Center, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, 81377 Munich, Germany
| | - Helmut Blum
- Laboratory of Functional Genome Analysis (LAFUGA), Gene Center, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, 81377 Munich, Germany
| | - Ulrich Gerland
- Physik Department, Technische Universität München, 85748 Garching, Germany
| | - Philipp Korber
- Molecular Biology Division, Biomedical Center, Faculty of Medicine, Ludwig-Maximilians-Universität München, 82152 Planegg-Martinsried, Germany
| |
Collapse
|
26
|
Homotypic cooperativity and collective binding are determinants of bHLH specificity and function. Proc Natl Acad Sci U S A 2019; 116:16143-16152. [PMID: 31341088 DOI: 10.1073/pnas.1818015116] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Eukaryotic cells express transcription factor (TF) paralogues that bind to nearly identical DNA sequences in vitro but bind at different genomic loci and perform different functions in vivo. Predicting how 2 paralogous TFs bind in vivo using DNA sequence alone is an important open problem. Here, we analyzed 2 yeast bHLH TFs, Cbf1p and Tye7p, which have highly similar binding preferences in vitro, yet bind at almost completely nonoverlapping target loci in vivo. We dissected the determinants of specificity for these 2 proteins by making a number of chimeric TFs in which we swapped different domains of Cbf1p and Tye7p and determined the effects on in vivo binding and cellular function. From these experiments, we learned that the Cbf1p dimer achieves its specificity by binding cooperatively with other Cbf1p dimers bound nearby. In contrast, we found that Tye7p achieves its specificity by binding cooperatively with 3 other DNA-binding proteins, Gcr1p, Gcr2p, and Rap1p. Remarkably, most promoters (63%) that are bound by Tye7p do not contain a consensus Tye7p binding site. Using this information, we were able to build simple models to accurately discriminate bound and unbound genomic loci for both Cbf1p and Tye7p. We then successfully reprogrammed the human bHLH NPAS2 to bind Cbf1p in vivo targets and a Tye7p target intergenic region to be bound by Cbf1p. These results demonstrate that the genome-wide binding targets of paralogous TFs can be discriminated using sequence information, and provide lessons about TF specificity that can be applied across the phylogenetic tree.
Collapse
|
27
|
Ghosh RP, Shi Q, Yang L, Reddick MP, Nikitina T, Zhurkin VB, Fordyce P, Stasevich TJ, Chang HY, Greenleaf WJ, Liphardt JT. Satb1 integrates DNA binding site geometry and torsional stress to differentially target nucleosome-dense regions. Nat Commun 2019; 10:3221. [PMID: 31324780 PMCID: PMC6642133 DOI: 10.1038/s41467-019-11118-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 06/20/2019] [Indexed: 01/12/2023] Open
Abstract
The Satb1 genome organizer regulates multiple cellular and developmental processes. It is not yet clear how Satb1 selects different sets of targets throughout the genome. Here we have used live-cell single molecule imaging and deep sequencing to assess determinants of Satb1 binding-site selectivity. We have found that Satb1 preferentially targets nucleosome-dense regions and can directly bind consensus motifs within nucleosomes. Some genomic regions harbor multiple, regularly spaced Satb1 binding motifs (typical separation ~1 turn of the DNA helix) characterized by highly cooperative binding. The Satb1 homeodomain is dispensable for high affinity binding but is essential for specificity. Finally, we find that Satb1-DNA interactions are mechanosensitive. Increasing negative torsional stress in DNA enhances Satb1 binding and Satb1 stabilizes base unpairing regions against melting by molecular machines. The ability of Satb1 to control diverse biological programs may reflect its ability to combinatorially use multiple site selection criteria.
Collapse
Affiliation(s)
- Rajarshi P Ghosh
- Bioengineering, Stanford University, Stanford, CA, 94305, USA
- BioX Institute, Stanford University, Stanford, CA, 94305, USA
- ChEM-H, Stanford University, Stanford, CA, 94305, USA
- Cell Biology Division, Stanford Cancer Institute, Stanford, CA, 94305, USA
| | - Quanming Shi
- Bioengineering, Stanford University, Stanford, CA, 94305, USA
- BioX Institute, Stanford University, Stanford, CA, 94305, USA
- ChEM-H, Stanford University, Stanford, CA, 94305, USA
- Cell Biology Division, Stanford Cancer Institute, Stanford, CA, 94305, USA
| | - Linfeng Yang
- Bioengineering, Stanford University, Stanford, CA, 94305, USA
- BioX Institute, Stanford University, Stanford, CA, 94305, USA
- ChEM-H, Stanford University, Stanford, CA, 94305, USA
- Cell Biology Division, Stanford Cancer Institute, Stanford, CA, 94305, USA
| | - Michael P Reddick
- Bioengineering, Stanford University, Stanford, CA, 94305, USA
- BioX Institute, Stanford University, Stanford, CA, 94305, USA
- ChEM-H, Stanford University, Stanford, CA, 94305, USA
- Cell Biology Division, Stanford Cancer Institute, Stanford, CA, 94305, USA
- Chemical Engineering, Stanford University, Stanford, CA, 94305, USA
| | - Tatiana Nikitina
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Victor B Zhurkin
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Polly Fordyce
- Bioengineering, Stanford University, Stanford, CA, 94305, USA
- ChEM-H, Stanford University, Stanford, CA, 94305, USA
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
- Chan Zuckerberg Biohub, San Francisco, CA, 94158, USA
| | - Timothy J Stasevich
- Department of Biochemistry and Molecular Biology and the Institute for Genome Architecture and Function, Colorado State University, Fort Collins, CO, USA
| | - Howard Y Chang
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
- Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA, 94305, USA
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, CA, 94305, USA
- Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA
| | - William J Greenleaf
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
- Department of Applied Physics, Stanford University, Stanford, United States
| | - Jan T Liphardt
- Bioengineering, Stanford University, Stanford, CA, 94305, USA.
- BioX Institute, Stanford University, Stanford, CA, 94305, USA.
- ChEM-H, Stanford University, Stanford, CA, 94305, USA.
- Cell Biology Division, Stanford Cancer Institute, Stanford, CA, 94305, USA.
| |
Collapse
|
28
|
Cohen DM, Lim HW, Won KJ, Steger DJ. Shared nucleotide flanks confer transcriptional competency to bZip core motifs. Nucleic Acids Res 2019; 46:8371-8384. [PMID: 30085281 PMCID: PMC6144830 DOI: 10.1093/nar/gky681] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Accepted: 07/17/2018] [Indexed: 12/31/2022] Open
Abstract
Sequence-specific DNA binding recruits transcription factors (TFs) to the genome to regulate gene expression. Here, we perform high resolution mapping of CEBP proteins to determine how sequence dictates genomic occupancy. We demonstrate a fundamental difference between the sequence repertoire utilized by CEBPs in vivo versus the palindromic sequence preference reported by classical in vitro models, by identifying a palindromic motif at <1% of the genomic binding sites. On the native genome, CEBPs bind a diversity of related 10 bp sequences resulting from the fusion of degenerate and canonical half-sites. Altered DNA specificity of CEBPs in cells occurs through heterodimerization with other bZip TFs, and approximately 40% of CEBP-binding sites in primary human cells harbor motifs characteristic of CEBP heterodimers. In addition, we uncover an important role for sequence bias at core-motif-flanking bases for CEBPs and demonstrate that flanking bases regulate motif function across mammalian bZip TFs. Favorable flanking bases confer efficient TF occupancy and transcriptional activity, and DNA shape may explain how the flanks alter TF binding. Importantly, motif optimization within the 10-mer is strongly correlated with cell-type-independent recruitment of CEBPβ, providing key insight into how sequence sub-optimization affects genomic occupancy of widely expressed CEBPs across cell types.
Collapse
Affiliation(s)
- Daniel M Cohen
- Division of Endocrinology, Diabetes, and Metabolism, Department of Medicine, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA.,The Institute for Diabetes, Obesity, and Metabolism, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Hee-Woong Lim
- The Institute for Diabetes, Obesity, and Metabolism, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kyoung-Jae Won
- The Institute for Diabetes, Obesity, and Metabolism, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA.,Department of Genetics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA.,Biotech Research and Innovation Centre (BRIC), University of Copenhagen, Copenhagen, Denmark
| | - David J Steger
- Division of Endocrinology, Diabetes, and Metabolism, Department of Medicine, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA.,The Institute for Diabetes, Obesity, and Metabolism, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
29
|
Albert B, Tomassetti S, Gloor Y, Dilg D, Mattarocci S, Kubik S, Hafner L, Shore D. Sfp1 regulates transcriptional networks driving cell growth and division through multiple promoter-binding modes. Genes Dev 2019; 33:288-293. [PMID: 30804227 PMCID: PMC6411004 DOI: 10.1101/gad.322040.118] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 12/17/2018] [Indexed: 12/19/2022]
Abstract
In this study, Albert et al. investigated the mechanisms by which the yeast Sfp1 protein coordinates both cell division and growth. They demonstrate that Sfp1 directly controls genes required for ribosome production and many other growth-promoting processes. The yeast Sfp1 protein regulates both cell division and growth but how it coordinates these processes is poorly understood. We demonstrate that Sfp1 directly controls genes required for ribosome production and many other growth-promoting processes. Remarkably, the complete set of Sfp1 target genes is revealed only by a combination of ChIP (chromatin immunoprecipitation) and ChEC (chromatin endogenous cleavage) methods, which uncover two promoter binding modes, one requiring a cofactor and the other a DNA-recognition motif. Glucose-regulated Sfp1 binding at cell cycle “START” genes suggests that Sfp1 controls cell size by coordinating expression of genes implicated in mass accumulation and cell division.
Collapse
Affiliation(s)
- Benjamin Albert
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Susanna Tomassetti
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Yvonne Gloor
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Daniel Dilg
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Stefano Mattarocci
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Slawomir Kubik
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Lukas Hafner
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - David Shore
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| |
Collapse
|