1
|
Qin T, Lee C, Li S, Cavalcante RG, Orchard P, Yao H, Zhang H, Wang S, Patil S, Boyle AP, Sartor MA. Comprehensive enhancer-target gene assignments improve gene set level interpretation of genome-wide regulatory data. Genome Biol 2022; 23:105. [PMID: 35473573 PMCID: PMC9044877 DOI: 10.1186/s13059-022-02668-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 04/06/2022] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Revealing the gene targets of distal regulatory elements is challenging yet critical for interpreting regulome data. Experiment-derived enhancer-gene links are restricted to a small set of enhancers and/or cell types, while the accuracy of genome-wide approaches remains elusive due to the lack of a systematic evaluation. We combined multiple spatial and in silico approaches for defining enhancer locations and linking them to their target genes aggregated across >500 cell types, generating 1860 human genome-wide distal enhancer-to-target gene definitions (EnTDefs). To evaluate performance, we used gene set enrichment (GSE) testing on 87 independent ENCODE ChIP-seq datasets of 34 transcription factors (TFs) and assessed concordance of results with known TF Gene Ontology annotations, and other benchmarks. RESULTS The top ranked 741 (40%) EnTDefs significantly outperform the common, naïve approach of linking distal regions to the nearest genes, and the top 10 EnTDefs perform well when applied to ChIP-seq data of other cell types. The GSE-based ranking of EnTDefs is highly concordant with ranking based on overlap with curated benchmarks of enhancer-gene interactions. Both our top general EnTDef and cell-type-specific EnTDefs significantly outperform seven independent computational and experiment-based enhancer-gene pair datasets. We show that using our top EnTDefs for GSE with either genome-wide DNA methylation or ATAC-seq data is able to better recapitulate the biological processes changed in gene expression data performed in parallel for the same experiment than our lower-ranked EnTDefs. CONCLUSIONS Our findings illustrate the power of our approach to provide genome-wide interpretation regardless of cell type.
Collapse
Affiliation(s)
- Tingting Qin
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA.
| | - Christopher Lee
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Biostatistics, School of Public Health, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Shiting Li
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Raymond G Cavalcante
- Biomedical Research Core Facilities, Epigenomics Core, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Peter Orchard
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Heming Yao
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Hanrui Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Shuze Wang
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Snehal Patil
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Alan P Boyle
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Maureen A Sartor
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA.
- Department of Biostatistics, School of Public Health, University of Michigan Medical School, Ann Arbor, MI, USA.
| |
Collapse
|
2
|
O'Connor T, Grant CE, Bodén M, Bailey TL. T-Gene: improved target gene prediction. Bioinformatics 2020; 36:3902-3904. [PMID: 32246829 DOI: 10.1093/bioinformatics/btaa227] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 03/04/2020] [Accepted: 03/30/2020] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Identifying the genes regulated by a given transcription factor (TF) (its 'target genes') is a key step in developing a comprehensive understanding of gene regulation. Previously, we developed a method (CisMapper) for predicting the target genes of a TF based solely on the correlation between a histone modification at the TF's binding site and the expression of the gene across a set of tissues or cell lines. That approach is limited to organisms for which extensive histone and expression data are available, and does not explicitly incorporate the genomic distance between the TF and the gene. RESULTS We present the T-Gene algorithm, which overcomes these limitations. It can be used to predict which genes are most likely to be regulated by a TF, and which of the TF's binding sites are most likely involved in regulating particular genes. T-Gene calculates a novel score that combines distance and histone/expression correlation, and we show that this score accurately predicts when a regulatory element bound by a TF is in contact with a gene's promoter, achieving median precision above 60%. T-Gene is easy to use via its web server or as a command-line tool, and can also make accurate predictions (median precision above 40%) based on distance alone when extensive histone/expression data is not available for the organism. T-Gene provides an estimate of the statistical significance of each of its predictions. AVAILABILITY AND IMPLEMENTATION The T-Gene web server, source code, histone/expression data and genome annotation files are provided at http://meme-suite.org. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Charles E Grant
- Department of Genome Sciences, University of Washington, Seattle, WA 98195-5065
| | - Mikael Bodén
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane 4072, Australia
| | - Timothy L Bailey
- Department of Pharmacology, University of Nevada, Reno, NV 89557, USA
| |
Collapse
|
3
|
Machine learning uncovers cell identity regulator by histone code. Nat Commun 2020; 11:2696. [PMID: 32483223 PMCID: PMC7264183 DOI: 10.1038/s41467-020-16539-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 05/09/2020] [Indexed: 01/13/2023] Open
Abstract
Conversion between cell types, e.g., by induced expression of master transcription factors, holds great promise for cellular therapy. Our ability to manipulate cell identity is constrained by incomplete information on cell identity genes (CIGs) and their expression regulation. Here, we develop CEFCIG, an artificial intelligent framework to uncover CIGs and further define their master regulators. On the basis of machine learning, CEFCIG reveals unique histone codes for transcriptional regulation of reported CIGs, and utilizes these codes to predict CIGs and their master regulators with high accuracy. Applying CEFCIG to 1,005 epigenetic profiles, our analysis uncovers the landscape of regulation network for identity genes in individual cell or tissue types. Together, this work provides insights into cell identity regulation, and delivers a powerful technique to facilitate regenerative medicine.
Collapse
|
4
|
Dubois V, Gheeraert C, Vankrunkelsven W, Dubois‐Chevalier J, Dehondt H, Bobowski‐Gerard M, Vinod M, Zummo FP, Güiza F, Ploton M, Dorchies E, Pineau L, Boulinguiez A, Vallez E, Woitrain E, Baugé E, Lalloyer F, Duhem C, Rabhi N, van Kesteren RE, Chiang C, Lancel S, Duez H, Annicotte J, Paumelle R, Vanhorebeek I, Van den Berghe G, Staels B, Lefebvre P, Eeckhoute J. Endoplasmic reticulum stress actively suppresses hepatic molecular identity in damaged liver. Mol Syst Biol 2020; 16:e9156. [PMID: 32407006 PMCID: PMC7224309 DOI: 10.15252/msb.20199156] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 04/09/2020] [Accepted: 04/14/2020] [Indexed: 02/06/2023] Open
Abstract
Liver injury triggers adaptive remodeling of the hepatic transcriptome for repair/regeneration. We demonstrate that this involves particularly profound transcriptomic alterations where acute induction of genes involved in handling of endoplasmic reticulum stress (ERS) is accompanied by partial hepatic dedifferentiation. Importantly, widespread hepatic gene downregulation could not simply be ascribed to cofactor squelching secondary to ERS gene induction, but rather involves a combination of active repressive mechanisms. ERS acts through inhibition of the liver-identity (LIVER-ID) transcription factor (TF) network, initiated by rapid LIVER-ID TF protein loss. In addition, induction of the transcriptional repressor NFIL3 further contributes to LIVER-ID gene repression. Alteration to the liver TF repertoire translates into compromised activity of regulatory regions characterized by the densest co-recruitment of LIVER-ID TFs and decommissioning of BRD4 super-enhancers driving hepatic identity. While transient repression of the hepatic molecular identity is an intrinsic part of liver repair, sustained disequilibrium between the ERS and LIVER-ID transcriptional programs is linked to liver dysfunction as shown using mouse models of acute liver injury and livers from deceased human septic patients.
Collapse
Affiliation(s)
- Vanessa Dubois
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
- Present address:
Clinical and Experimental EndocrinologyDepartment of Chronic Diseases, Metabolism and Ageing (CHROMETA)KU LeuvenLeuvenBelgium
| | - Céline Gheeraert
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Wouter Vankrunkelsven
- Clinical Division and Laboratory of Intensive Care MedicineDepartment of Cellular and Molecular MedicineKU LeuvenLeuvenBelgium
| | | | - Hélène Dehondt
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | | | - Manjula Vinod
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | | | - Fabian Güiza
- Clinical Division and Laboratory of Intensive Care MedicineDepartment of Cellular and Molecular MedicineKU LeuvenLeuvenBelgium
| | - Maheul Ploton
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Emilie Dorchies
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Laurent Pineau
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Alexis Boulinguiez
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Emmanuelle Vallez
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Eloise Woitrain
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Eric Baugé
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Fanny Lalloyer
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Christian Duhem
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Nabil Rabhi
- UMR 8199 ‐ EGIDCNRSInstitut Pasteur de LilleUniversity of LilleLilleFrance
| | - Ronald E van Kesteren
- Center for Neurogenomics and Cognitive ResearchNeuroscience Campus AmsterdamVU UniversityAmsterdamThe Netherlands
| | - Cheng‐Ming Chiang
- Simmons Comprehensive Cancer CenterDepartments of Biochemistry and PharmacologyUniversity of Texas Southwestern Medical CenterDallasTXUSA
| | - Steve Lancel
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Hélène Duez
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | | | - Réjane Paumelle
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Ilse Vanhorebeek
- Clinical Division and Laboratory of Intensive Care MedicineDepartment of Cellular and Molecular MedicineKU LeuvenLeuvenBelgium
| | - Greet Van den Berghe
- Clinical Division and Laboratory of Intensive Care MedicineDepartment of Cellular and Molecular MedicineKU LeuvenLeuvenBelgium
| | - Bart Staels
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Philippe Lefebvre
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| | - Jérôme Eeckhoute
- Inserm, CHU LilleInstitut Pasteur de LilleU1011‐EGIDUniversity of LilleLilleFrance
| |
Collapse
|
5
|
Kim HJ, Osteil P, Humphrey SJ, Cinghu S, Oldfield AJ, Patrick E, Wilkie EE, Peng G, Suo S, Jothi R, Tam PPL, Yang P. Transcriptional network dynamics during the progression of pluripotency revealed by integrative statistical learning. Nucleic Acids Res 2020; 48:1828-1842. [PMID: 31853542 PMCID: PMC7038952 DOI: 10.1093/nar/gkz1179] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 12/02/2019] [Accepted: 12/09/2019] [Indexed: 12/12/2022] Open
Abstract
The developmental potential of cells, termed pluripotency, is highly dynamic and progresses through a continuum of naive, formative and primed states. Pluripotency progression of mouse embryonic stem cells (ESCs) from naive to formative and primed state is governed by transcription factors (TFs) and their target genes. Genomic techniques have uncovered a multitude of TF binding sites in ESCs, yet a major challenge lies in identifying target genes from functional binding sites and reconstructing dynamic transcriptional networks underlying pluripotency progression. Here, we integrated time-resolved ‘trans-omic’ datasets together with TF binding profiles and chromatin conformation data to identify target genes of a panel of TFs. Our analyses revealed that naive TF target genes are more likely to be TFs themselves than those of formative TFs, suggesting denser hierarchies among naive TFs. We also discovered that formative TF target genes are marked by permissive epigenomic signatures in the naive state, indicating that they are poised for expression prior to the initiation of pluripotency transition to the formative state. Finally, our reconstructed transcriptional networks pinpointed the precise timing from naive to formative pluripotency progression and enabled the spatiotemporal mapping of differentiating ESCs to their in vivo counterparts in developing embryos.
Collapse
Affiliation(s)
- Hani Jieun Kim
- Charles Perkins Centre, School of Mathematics and Statistics, University of Sydney, Sydney, NSW 2006, Australia.,Computational Systems Biology Group, Children's Medical Research Institute, University of Sydney, Westmead, NSW 2145, Australia.,School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, NSW 2006, Australia
| | - Pierre Osteil
- School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, NSW 2006, Australia.,Embryology Unit, Children's Medical Research Institute, University of Sydney, Westmead, NSW 2145, Australia
| | - Sean J Humphrey
- Charles Perkins Centre, School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| | - Senthilkumar Cinghu
- Epigenetics & Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Andrew J Oldfield
- Institute of Human Genetics, CNRS, University of Montpellier, Montpellier, France
| | - Ellis Patrick
- Charles Perkins Centre, School of Mathematics and Statistics, University of Sydney, Sydney, NSW 2006, Australia.,School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, NSW 2006, Australia.,Westmead Institute for Medical Research, University of Sydney, Westmead, NSW 2145, Australia
| | - Emilie E Wilkie
- Embryology Unit, Children's Medical Research Institute, University of Sydney, Westmead, NSW 2145, Australia
| | - Guangdun Peng
- CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China, and Guangzhou Regenerative Medicine and Health Guangdong Laboratory (GRMH-GDL), Guangzhou 510005, China
| | - Shengbao Suo
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health, Boston, MA 02215, USA
| | - Raja Jothi
- Epigenetics & Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Patrick P L Tam
- School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, NSW 2006, Australia.,Embryology Unit, Children's Medical Research Institute, University of Sydney, Westmead, NSW 2145, Australia
| | - Pengyi Yang
- Charles Perkins Centre, School of Mathematics and Statistics, University of Sydney, Sydney, NSW 2006, Australia.,Computational Systems Biology Group, Children's Medical Research Institute, University of Sydney, Westmead, NSW 2145, Australia.,School of Medical Sciences, Faculty of Medicine and Health, University of Sydney, NSW 2006, Australia
| |
Collapse
|
6
|
Lee C, Wang K, Qin T, Sartor MA. Testing Proximity of Genomic Regions to Transcription Start Sites and Enhancers Complements Gene Set Enrichment Testing. Front Genet 2020; 11:199. [PMID: 32211031 PMCID: PMC7069355 DOI: 10.3389/fgene.2020.00199] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 02/20/2020] [Indexed: 11/13/2022] Open
Abstract
Large sets of genomic regions are generated by the initial analysis of various genome-wide sequencing data, such as ChIP-seq and ATAC-seq experiments. Gene set enrichment (GSE) methods are commonly employed to determine the pathways associated with them. Given the pathways and other gene sets (e.g., GO terms) of significance, it is of great interest to know the extent to which each is driven by binding near transcription start sites (TSS) or near enhancers. Currently, no tool performs such an analysis. Here, we present a method that addresses this question to complement GSE methods for genomic regions. Specifically, the new method tests whether the genomic regions in a gene set are significantly closer to a TSS (or to an enhancer) than expected by chance given the total list of genomic regions, using a non-parametric test. Combining the results from a GSE test with our novel method provides additional information regarding the mode of regulation of each pathway, and additional evidence that the pathway is truly enriched. We illustrate our new method with a large set of ENCODE ChIP-seq data, using the chipenrich Bioconductor package. The results show that our method is a powerful complementary approach to help researchers interpret large sets of genomic regions.
Collapse
Affiliation(s)
- Christopher Lee
- Department of Computational Medicine and Bioinformatics, School of Medicine, University of Michigan, Ann Arbor, MI, United States
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, United States
| | - Kai Wang
- Department of Computational Medicine and Bioinformatics, School of Medicine, University of Michigan, Ann Arbor, MI, United States
| | - Tingting Qin
- Department of Computational Medicine and Bioinformatics, School of Medicine, University of Michigan, Ann Arbor, MI, United States
| | - Maureen A. Sartor
- Department of Computational Medicine and Bioinformatics, School of Medicine, University of Michigan, Ann Arbor, MI, United States
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
7
|
Fraser J, Essebier A, Brown AS, Davila RA, Harkins D, Zalucki O, Shapiro LP, Penzes P, Wainwright BJ, Scott MP, Gronostajski RM, Bodén M, Piper M, Harvey TJ. Common Regulatory Targets of NFIA, NFIX and NFIB during Postnatal Cerebellar Development. CEREBELLUM (LONDON, ENGLAND) 2020; 19:89-101. [PMID: 31838646 PMCID: PMC7815246 DOI: 10.1007/s12311-019-01089-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Transcriptional regulation plays a central role in controlling neural stem and progenitor cell proliferation and differentiation during neurogenesis. For instance, transcription factors from the nuclear factor I (NFI) family have been shown to co-ordinate neural stem and progenitor cell differentiation within multiple regions of the embryonic nervous system, including the neocortex, hippocampus, spinal cord and cerebellum. Knockout of individual Nfi genes culminates in similar phenotypes, suggestive of common target genes for these transcription factors. However, whether or not the NFI family regulates common suites of genes remains poorly defined. Here, we use granule neuron precursors (GNPs) of the postnatal murine cerebellum as a model system to analyse regulatory targets of three members of the NFI family: NFIA, NFIB and NFIX. By integrating transcriptomic profiling (RNA-seq) of Nfia- and Nfix-deficient GNPs with epigenomic profiling (ChIP-seq against NFIA, NFIB and NFIX, and DNase I hypersensitivity assays), we reveal that these transcription factors share a large set of potential transcriptional targets, suggestive of complementary roles for these NFI family members in promoting neural development.
Collapse
Affiliation(s)
- James Fraser
- The School of Biomedical Sciences, The University of Queensland, Brisbane, 4072, Australia
| | - Alexandra Essebier
- The School of Chemistry and Molecular Bioscience, The University of Queensland, Brisbane, 4072, Australia
| | - Alexander S Brown
- Department of Developmental Biology, School of Medicine, Stanford University, Stanford, CA, USA
| | - Raul Ayala Davila
- The School of Biomedical Sciences, The University of Queensland, Brisbane, 4072, Australia
| | - Danyon Harkins
- The School of Biomedical Sciences, The University of Queensland, Brisbane, 4072, Australia
| | - Oressia Zalucki
- The School of Biomedical Sciences, The University of Queensland, Brisbane, 4072, Australia
| | - Lauren P Shapiro
- Department of Physiology, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Peter Penzes
- Department of Physiology, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Brandon J Wainwright
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, 4072, Australia
| | - Matthew P Scott
- Department of Developmental Biology, School of Medicine, Stanford University, Stanford, CA, USA
| | - Richard M Gronostajski
- Department of Biochemistry, Program in Genetics, Genomics and Bioinformatics, Center of Excellence in Bioinformatics and Life Sciences, State University of New York at Buffalo, Buffalo, NY, USA
| | - Mikael Bodén
- The School of Chemistry and Molecular Bioscience, The University of Queensland, Brisbane, 4072, Australia
| | - Michael Piper
- The School of Biomedical Sciences, The University of Queensland, Brisbane, 4072, Australia.
- Queensland Brain Institute, The University of Queensland, Brisbane, 4072, Australia.
| | - Tracey J Harvey
- The School of Biomedical Sciences, The University of Queensland, Brisbane, 4072, Australia.
| |
Collapse
|
8
|
Belokopytova PS, Nuriddinov MA, Mozheiko EA, Fishman D, Fishman V. Quantitative prediction of enhancer-promoter interactions. Genome Res 2019; 30:72-84. [PMID: 31804952 PMCID: PMC6961579 DOI: 10.1101/gr.249367.119] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 11/25/2019] [Indexed: 11/24/2022]
Abstract
Recent experimental and computational efforts have provided large data sets describing three-dimensional organization of mouse and human genomes and showed the interconnection between the expression profile, epigenetic state, and spatial interactions of loci. These interconnections were utilized to infer the spatial organization of chromatin, including enhancer–promoter contacts, from one-dimensional epigenetic marks. Here, we show that the predictive power of some of these algorithms is overestimated due to peculiar properties of the biological data. We propose an alternative approach, which provides high-quality predictions of chromatin interactions using information on gene expression and CTCF-binding alone. Using multiple metrics, we confirmed that our algorithm could efficiently predict the three-dimensional architecture of both normal and rearranged genomes.
Collapse
Affiliation(s)
- Polina S Belokopytova
- Institute of Cytology and Genetics SB RAS 630090, Novosibirsk, Russia.,Novosibirsk State University, Novosibirsk, Russia 630090
| | | | | | - Daniil Fishman
- Novosibirsk State University, Novosibirsk, Russia 630090
| | - Veniamin Fishman
- Institute of Cytology and Genetics SB RAS 630090, Novosibirsk, Russia.,Novosibirsk State University, Novosibirsk, Russia 630090
| |
Collapse
|
9
|
Ibn-Salem J, Andrade-Navarro MA. 7C: Computational Chromosome Conformation Capture by Correlation of ChIP-seq at CTCF motifs. BMC Genomics 2019; 20:777. [PMID: 31653198 PMCID: PMC6814980 DOI: 10.1186/s12864-019-6088-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Accepted: 09/09/2019] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Knowledge of the three-dimensional structure of the genome is necessary to understand how gene expression is regulated. Recent experimental techniques such as Hi-C or ChIA-PET measure long-range chromatin interactions genome-wide but are experimentally elaborate, have limited resolution and such data is only available for a limited number of cell types and tissues. RESULTS While ChIP-seq was not designed to detect chromatin interactions, the formaldehyde treatment in the ChIP-seq protocol cross-links proteins with each other and with DNA. Consequently, also regions that are not directly bound by the targeted TF but interact with the binding site via chromatin looping are co-immunoprecipitated and sequenced. This produces minor ChIP-seq signals at loop anchor regions close to the directly bound site. We use the position and shape of ChIP-seq signals around CTCF motif pairs to predict whether they interact or not. We implemented this approach in a prediction method, termed Computational Chromosome Conformation Capture by Correlation of ChIP-seq at CTCF motifs (7C). We applied 7C to all CTCF motif pairs within 1 Mb in the human genome and validated predicted interactions with high-resolution Hi-C and ChIA-PET. A single ChIP-seq experiment from known architectural proteins (CTCF, Rad21, Znf143) but also from other TFs (like TRIM22 or RUNX3) predicts loops accurately. Importantly, 7C predicts loops in cell types and for TF ChIP-seq datasets not used in training. CONCLUSION 7C predicts chromatin loops which can help to associate TF binding sites to regulated genes. Furthermore, profiling of hundreds of ChIP-seq datasets results in novel candidate factors functionally involved in chromatin looping. Our method is available as an R/Bioconductor package: http://bioconductor.org/packages/sevenC .
Collapse
Affiliation(s)
- Jonas Ibn-Salem
- Faculty of Biology, Johannes Gutenberg University of Mainz, 55128, Mainz, Germany.
| | | |
Collapse
|
10
|
Essebier A, Lamprecht M, Piper M, Bodén M. Bioinformatics approaches to predict target genes from transcription factor binding data. Methods 2017; 131:111-119. [DOI: 10.1016/j.ymeth.2017.09.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 08/29/2017] [Accepted: 09/03/2017] [Indexed: 12/28/2022] Open
|
11
|
Lecoutre S, Oger F, Pourpe C, Butruille L, Marousez L, Dickes-Coopman A, Laborie C, Guinez C, Lesage J, Vieau D, Junien C, Eberlé D, Gabory A, Eeckhoute J, Breton C. Maternal obesity programs increased leptin gene expression in rat male offspring via epigenetic modifications in a depot-specific manner. Mol Metab 2017; 6:922-930. [PMID: 28752055 PMCID: PMC5518658 DOI: 10.1016/j.molmet.2017.05.010] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Revised: 05/15/2017] [Accepted: 05/22/2017] [Indexed: 12/13/2022] Open
Abstract
Objective According to the Developmental Origin of Health and Disease (DOHaD) concept, maternal obesity and accelerated growth in neonates predispose offspring to white adipose tissue (WAT) accumulation. In rodents, adipogenesis mainly develops during lactation. The mechanisms underlying the phenomenon known as developmental programming remain elusive. We previously reported that adult rat offspring from high-fat diet-fed dams (called HF) exhibited hypertrophic adipocyte, hyperleptinemia and increased leptin mRNA levels in a depot-specific manner. We hypothesized that leptin upregulation occurs via epigenetic malprogramming, which takes place early during development of WAT. Methods As a first step, we identified in silico two potential enhancers located upstream and downstream of the leptin transcription start site that exhibit strong dynamic epigenomic remodeling during adipocyte differentiation. We then focused on epigenetic modifications (methylation, hydroxymethylation, and histone modifications) of the promoter and the two potential enhancers regulating leptin gene expression in perirenal (pWAT) and inguinal (iWAT) fat pads of HF offspring during lactation (postnatal days 12 (PND12) and 21 (PND21)) and in adulthood. Results PND12 is an active period for epigenomic remodeling in both deposits especially in the upstream enhancer, consistent with leptin gene induction during adipogenesis. Unlike iWAT, some of these epigenetic marks were still observable in pWAT of weaned HF offspring. Retained marks were only visible in pWAT of 9-month-old HF rats that showed a persistent “expandable” phenotype. Conclusions Consistent with the DOHaD hypothesis, persistent epigenetic remodeling occurs at regulatory regions especially within intergenic sequences, linked to higher leptin gene expression in adult HF offspring in a depot-specific manner. The white adipose tissue is an important target of developmental programming. Higher leptin gene expression occurs in offspring from obese dams in a depot-specific manner. Leptin upregulation occurs via epigenetic malprogramming during development of adipose tissue. Persistent genomic epigenetic remodeling occurs in adipose tissue of offspring from obese dams. Intergenic regions were more affected than the leptin promoter region in offspring of obese dams.
Collapse
Affiliation(s)
- Simon Lecoutre
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Frederik Oger
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Charlène Pourpe
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Laura Butruille
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Lucie Marousez
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Anne Dickes-Coopman
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Christine Laborie
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Céline Guinez
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Jean Lesage
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Didier Vieau
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Claudine Junien
- UMR BDR, INRA, ENVA, Université Paris Saclay, 78350, Jouy-en-Josas, France; UVSQ, Université Versailles-Saint-Quentin-en-Yvelines, France
| | - Delphine Eberlé
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France
| | - Anne Gabory
- UMR BDR, INRA, ENVA, Université Paris Saclay, 78350, Jouy-en-Josas, France
| | - Jérôme Eeckhoute
- Univ. Lille, Inserm, CHU Lille, Institut Pasteur de Lille, U1011-EGID, F-59000 Lille, France
| | - Christophe Breton
- Univ. Lille, EA4489, Équipe Malnutrition Maternelle et Programmation des Maladies Métaboliques, F-59000 Lille, France.
| |
Collapse
|