1
|
Ronzio M, Bernardini A, Taglietti V, Ceribelli M, Donati G, Gallo A, Pavesi G, Dellabona P, Casorati G, Messina G, Mantovani R, Dolfini D. Genomic binding of NF-Y in mouse and human cells. Genomics 2024; 116:110895. [PMID: 39025317 DOI: 10.1016/j.ygeno.2024.110895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Revised: 06/12/2024] [Accepted: 07/13/2024] [Indexed: 07/20/2024]
Abstract
NF-Y is a Transcription Factor that regulates transcription through binding to the CCAAT-box. To understand its strategy, we analyzed 16 ChIP-seq datasets from human and mouse cells. Shared loci, mostly located in promoters of expressed genes of cell cycle, metabolism and gene expression pathways, are associated with histone marks of active chromatin and specific modules of TFs. Other peaks are in enhancers and Transposable Elements -TE- of retroviral origin in human and mouse. We evaluated the relationship with USF1, a common synergistic partner in promoters and MLT1 TEs, upon NF-YB inactivation: USF1 binding decreases in promoters, modestly in MLT1, suggesting a pioneering role of NF-Y in formers, not in the latters. These data define a common set of NF-Y functional targets across different mammalian cell types, suggesting a pioneering role in promoters with respect to TEs.
Collapse
Affiliation(s)
- Mirko Ronzio
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Andrea Bernardini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | | | - Michele Ceribelli
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Giacomo Donati
- Dipartimento di Scienze della Vita e Biologia dei Sistemi, Università degli Studi di Torino, Torino, Italy
| | - Alberto Gallo
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Giulio Pavesi
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Paolo Dellabona
- Experimental Immunology Unit. Division of Immunology, Transplantation and Infectious Diseases, San Raffaele Scientific Institute, Milano, Italy
| | - Giulia Casorati
- Experimental Immunology Unit. Division of Immunology, Transplantation and Infectious Diseases, San Raffaele Scientific Institute, Milano, Italy
| | - Graziella Messina
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Roberto Mantovani
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Diletta Dolfini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy.
| |
Collapse
|
2
|
Bentsen M, Heger V, Schultheis H, Kuenne C, Looso M. TF-COMB - discovering grammar of transcription factor binding sites. Comput Struct Biotechnol J 2022; 20:4040-4051. [PMID: 35983231 PMCID: PMC9358416 DOI: 10.1016/j.csbj.2022.07.025] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 07/12/2022] [Indexed: 02/07/2023] Open
Abstract
Cooperativity between transcription factors is important to regulate target gene expression. In particular, the binding grammar of TFs in relation to each other, as well as in the context of other genomic elements, is crucial for TF functionality. However, tools to easily uncover co-occurrence between DNA-binding proteins, and investigate the regulatory modules of TFs, are limited. Here we present TF-COMB (Transcription Factor Co-Occurrence using Market Basket analysis) - a tool to investigate co-occurring TFs and binding grammar within regulatory regions. We found that TF-COMB can accurately identify known co-occurring TFs from ChIP-seq data, as well as uncover preferential localization to other genomic elements. With the use of ATAC-seq footprinting and TF motif locations, we found that TFs exhibit both preferred orientation and distance in relation to each other, and that these are biologically significant. Finally, we extended the analysis to not only investigate individual TF pairs, but also TF pairs in the context of networks, which enabled the investigation of TF complexes and TF hubs. In conclusion, TF-COMB is a flexible tool to investigate various aspects of TF binding grammar.
Collapse
Affiliation(s)
- Mette Bentsen
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Vanessa Heger
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Hendrik Schultheis
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Carsten Kuenne
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Mario Looso
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
- Corresponding author at: Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany.
| |
Collapse
|
3
|
Suryatenggara J, Yong KJ, Tenen DE, Tenen DG, Bassal MA. ChIP-AP: an integrated analysis pipeline for unbiased ChIP-seq analysis. Brief Bioinform 2021; 23:6489109. [PMID: 34965583 PMCID: PMC8769893 DOI: 10.1093/bib/bbab537] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 11/02/2021] [Accepted: 11/19/2021] [Indexed: 12/15/2022] Open
Abstract
Chromatin immunoprecipitation coupled with sequencing (ChIP-seq) is a technique used to identify protein–DNA interaction sites through antibody pull-down, sequencing and analysis; with enrichment ‘peak’ calling being the most critical analytical step. Benchmarking studies have consistently shown that peak callers have distinct selectivity and specificity characteristics that are not additive and seldom completely overlap in many scenarios, even after parameter optimization. We therefore developed ChIP-AP, an integrated ChIP-seq analysis pipeline utilizing four independent peak callers, which seamlessly processes raw sequencing files to final result. This approach enables (1) better gauging of peak confidence through detection by multiple algorithms, and (2) more thoroughly surveys the binding landscape by capturing peaks not detected by individual callers. Final analysis results are then integrated into a single output table, enabling users to explore their data by applying selectivity and sensitivity thresholds that best address their biological questions, without needing any additional reprocessing. ChIP-AP therefore presents investigators with a more comprehensive coverage of the binding landscape without requiring additional wet-lab observations.
Collapse
Affiliation(s)
- Jeremiah Suryatenggara
- Cancer Science Institute of Singapore, National University of Singapore, Singapore, 117599, Singapore
| | - Kol Jia Yong
- Cancer Science Institute of Singapore, National University of Singapore, Singapore, 117599, Singapore.,Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117597, Singapore
| | | | - Daniel G Tenen
- Cancer Science Institute of Singapore, National University of Singapore, Singapore, 117599, Singapore.,Harvard Stem Cell Institute, Boston, 02138, USA
| | - Mahmoud A Bassal
- Cancer Science Institute of Singapore, National University of Singapore, Singapore, 117599, Singapore.,Harvard Stem Cell Institute, Boston, 02138, USA
| |
Collapse
|
4
|
Bernardini A, Lorenzo M, Chaves-Sanjuan A, Swuec P, Pigni M, Saad D, Konarev PV, Graewert MA, Valentini E, Svergun DI, Nardini M, Mantovani R, Gnesutta N. The USR domain of USF1 mediates NF-Y interactions and cooperative DNA binding. Int J Biol Macromol 2021; 193:401-413. [PMID: 34673109 DOI: 10.1016/j.ijbiomac.2021.10.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 10/07/2021] [Accepted: 10/08/2021] [Indexed: 10/20/2022]
Abstract
The trimeric CCAAT-binding NF-Y is a "pioneer" Transcription Factor -TF- known to cooperate with neighboring TFs to regulate gene expression. Genome-wide analyses detected a precise stereo-alignment -10/12 bp- of CCAAT with E-box elements and corresponding colocalization of NF-Y with basic-Helix-Loop-Helix (bHLH) TFs. We dissected here NF-Y interactions with USF1 and MAX. USF1, but not MAX, cooperates in DNA binding with NF-Y. NF-Y and USF1 synergize to activate target promoters. Reconstruction of complexes by structural means shows independent DNA binding of MAX, whereas USF1 has extended contacts with NF-Y, involving the USR, a USF-specific amino acid sequence stretch required for trans-activation. The USR is an intrinsically disordered domain and adopts different conformations based on E-box-CCAAT distances. Deletion of the USR abolishes cooperative DNA binding with NF-Y. Our data indicate that the functionality of certain unstructured domains involves adapting to small variation in stereo-alignments of the multimeric TFs sites.
Collapse
Affiliation(s)
- Andrea Bernardini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Mariangela Lorenzo
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | | | - Paolo Swuec
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Matteo Pigni
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Dana Saad
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Petr V Konarev
- A.V. Shubnikov Institute of Crystallography, Federal Scientific Research Centre "Crystallography and Photonics" of Russian Academy of Science, Moscow 119333, Russian Federation
| | | | - Erica Valentini
- European Molecular Biology Laboratory, Hamburg Unit, Hamburg 22607, Germany
| | - Dmitri I Svergun
- European Molecular Biology Laboratory, Hamburg Unit, Hamburg 22607, Germany
| | - Marco Nardini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Roberto Mantovani
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy.
| | - Nerina Gnesutta
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy.
| |
Collapse
|
5
|
Ronzio M, Bernardini A, Pavesi G, Mantovani R, Dolfini D. On the NF-Y regulome as in ENCODE (2019). PLoS Comput Biol 2020; 16:e1008488. [PMID: 33370256 PMCID: PMC7793273 DOI: 10.1371/journal.pcbi.1008488] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 01/08/2021] [Accepted: 11/04/2020] [Indexed: 11/19/2022] Open
Abstract
NF-Y is a trimeric Transcription Factor -TF- which binds with high selectivity to the conserved CCAAT element. Individual ChIP-seq analysis as well as ENCODE have progressively identified locations shared by other TFs. Here, we have analyzed data introduced by ENCODE over the last five years in K562, HeLa-S3 and GM12878, including several chromatin features, as well RNA-seq profiling of HeLa cells after NF-Y inactivation. We double the number of sequence-specific TFs and co-factors reported. We catalogue them in 4 classes based on co-association criteria, infer target genes categorizations, identify positional bias of binding sites and gene expression changes. Larger and novel co-associations emerge, specifically concerning subunits of repressive complexes as well as RNA-binding proteins. On the one hand, these data better define NF-Y association with single members of major classes of TFs, on the other, they suggest that it might have a wider role in the control of mRNA production. The ongoing ENCODE consortium represents a useful compendium of locations of TFs, chromatin marks, gene expression data. In previous reports, we identified modules of CCAAT-binding NF-Y with individual TFs. Here, we analyzed all 363 factors currently present: 68 with enrichment of CCAAT in their locations, 38 with overlap of peaks. New sequence-specific TFs, co-activators and co-repressors are reported. Co-association patterns correspond to specific targeted genes categorizations and gene expression changes, as assessed by RNA-seq after NF-Y inactivation. These data widen and better define a coherent model of synergy of NF-Y with selected groups of TFs and co-factors.
Collapse
Affiliation(s)
- Mirko Ronzio
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Andrea Bernardini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Giulio Pavesi
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Roberto Mantovani
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
| | - Diletta Dolfini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano, Italy
- * E-mail:
| |
Collapse
|