1
|
Rapakoulia T, Lopez Ruiz De Vargas S, Omgba PA, Laupert V, Ulitsky I, Vingron M. CENTRE: a gradient boosting algorithm for Cell-type-specific ENhancer-Target pREdiction. Bioinformatics 2023; 39:btad687. [PMID: 37982748 PMCID: PMC10666202 DOI: 10.1093/bioinformatics/btad687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 10/11/2023] [Accepted: 11/17/2023] [Indexed: 11/21/2023] Open
Abstract
MOTIVATION Identifying target promoters of active enhancers is a crucial step for realizing gene regulation and deciphering phenotypes and diseases. Up to now, several computational methods were developed to predict enhancer gene interactions, but they require either many epigenomic and transcriptomic experimental assays to generate cell-type (CT)-specific predictions or a single experiment applied to a large cohort of CTs to extract correlations between activities of regulatory elements. Thus, inferring CT-specific enhancer gene interactions in unstudied or poorly annotated CTs becomes a laborious and costly task. RESULTS Here, we aim to infer CT-specific enhancer target interactions, using minimal experimental input. We introduce Cell-specific ENhancer Target pREdiction (CENTRE), a machine learning framework that predicts enhancer target interactions in a CT-specific manner, using only gene expression and ChIP-seq data for three histone modifications for the CT of interest. CENTRE exploits the wealth of available datasets and extracts cell-type agnostic statistics to complement the CT-specific information. CENTRE is thoroughly tested across many datasets and CTs and achieves equivalent or superior performance than existing algorithms that require massive experimental data. AVAILABILITY AND IMPLEMENTATION CENTRE's open-source code is available at GitHub via https://github.com/slrvv/CENTRE.
Collapse
Affiliation(s)
| | | | | | - Verena Laupert
- Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Igor Ulitsky
- Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
- Department of Immunology and Regenerative Biology, Weizmann Institute of Science, Rehovot 76100, Israel
- Department of Molecular Neuroscience, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Martin Vingron
- Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| |
Collapse
|
2
|
Ringel AR, Szabo Q, Chiariello AM, Chudzik K, Schöpflin R, Rothe P, Mattei AL, Zehnder T, Harnett D, Laupert V, Bianco S, Hetzel S, Glaser J, Phan MHQ, Schindler M, Ibrahim DM, Paliou C, Esposito A, Prada-Medina CA, Haas SA, Giere P, Vingron M, Wittler L, Meissner A, Nicodemi M, Cavalli G, Bantignies F, Mundlos S, Robson MI. Repression and 3D-restructuring resolves regulatory conflicts in evolutionarily rearranged genomes. Cell 2022; 185:3689-3704.e21. [PMID: 36179666 PMCID: PMC9567273 DOI: 10.1016/j.cell.2022.09.006] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 06/03/2022] [Accepted: 08/30/2022] [Indexed: 01/26/2023]
Abstract
Regulatory landscapes drive complex developmental gene expression, but it remains unclear how their integrity is maintained when incorporating novel genes and functions during evolution. Here, we investigated how a placental mammal-specific gene, Zfp42, emerged in an ancient vertebrate topologically associated domain (TAD) without adopting or disrupting the conserved expression of its gene, Fat1. In ESCs, physical TAD partitioning separates Zfp42 and Fat1 with distinct local enhancers that drive their independent expression. This separation is driven by chromatin activity and not CTCF/cohesin. In contrast, in embryonic limbs, inactive Zfp42 shares Fat1's intact TAD without responding to active Fat1 enhancers. However, neither Fat1 enhancer-incompatibility nor nuclear envelope-attachment account for Zfp42's unresponsiveness. Rather, Zfp42's promoter is rendered inert to enhancers by context-dependent DNA methylation. Thus, diverse mechanisms enabled the integration of independent Zfp42 regulation in the Fat1 locus. Critically, such regulatory complexity appears common in evolution as, genome wide, most TADs contain multiple independently expressed genes.
Collapse
Affiliation(s)
- Alessa R Ringel
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany; Institute of Chemistry and Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Quentin Szabo
- Institute of Human Genetics, University of Montpellier, CNRS, Montpellier, France
| | - Andrea M Chiariello
- Dipartimento di Fisica, Università di Napoli Federico II and INFN Napoli, Complesso Universitario di Monte Sant'Angelo, Naples, Italy
| | - Konrad Chudzik
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Robert Schöpflin
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Patricia Rothe
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Alexandra L Mattei
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA; Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
| | - Tobias Zehnder
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Dermot Harnett
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Verena Laupert
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Simona Bianco
- Dipartimento di Fisica, Università di Napoli Federico II and INFN Napoli, Complesso Universitario di Monte Sant'Angelo, Naples, Italy
| | - Sara Hetzel
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Juliane Glaser
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Mai H Q Phan
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Charité-Universitätsmedizin Berlin, BCRT-Berlin Institute of Health Center for Regenerative Therapies, Berlin, Germany
| | - Magdalena Schindler
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Daniel M Ibrahim
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany; Charité-Universitätsmedizin Berlin, BCRT-Berlin Institute of Health Center for Regenerative Therapies, Berlin, Germany
| | - Christina Paliou
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide, Seville, Spain
| | - Andrea Esposito
- Dipartimento di Fisica, Università di Napoli Federico II and INFN Napoli, Complesso Universitario di Monte Sant'Angelo, Naples, Italy
| | - Cesar A Prada-Medina
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
| | - Stefan A Haas
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Peter Giere
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Berlin, Germany
| | - Martin Vingron
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Lars Wittler
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Alexander Meissner
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute of Chemistry and Biochemistry, Freie Universität Berlin, Berlin, Germany; Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA; Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Mario Nicodemi
- Dipartimento di Fisica, Università di Napoli Federico II and INFN Napoli, Complesso Universitario di Monte Sant'Angelo, Naples, Italy; Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Giacomo Cavalli
- Institute of Human Genetics, University of Montpellier, CNRS, Montpellier, France
| | - Frédéric Bantignies
- Institute of Human Genetics, University of Montpellier, CNRS, Montpellier, France
| | - Stefan Mundlos
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany; Charité-Universitätsmedizin Berlin, BCRT-Berlin Institute of Health Center for Regenerative Therapies, Berlin, Germany.
| | - Michael I Robson
- Max Planck Institute for Molecular Genetics, Berlin, Germany; Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany; Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|