1
|
Yang JH, Hansen AS. Enhancer selectivity in space and time: from enhancer-promoter interactions to promoter activation. Nat Rev Mol Cell Biol 2024; 25:574-591. [PMID: 38413840 DOI: 10.1038/s41580-024-00710-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/30/2024] [Indexed: 02/29/2024]
Abstract
The primary regulators of metazoan gene expression are enhancers, originally functionally defined as DNA sequences that can activate transcription at promoters in an orientation-independent and distance-independent manner. Despite being crucial for gene regulation in animals, what mechanisms underlie enhancer selectivity for promoters, and more fundamentally, how enhancers interact with promoters and activate transcription, remain poorly understood. In this Review, we first discuss current models of enhancer-promoter interactions in space and time and how enhancers affect transcription activation. Next, we discuss different mechanisms that mediate enhancer selectivity, including repression, biochemical compatibility and regulation of 3D genome structure. Through 3D polymer simulations, we illustrate how the ability of 3D genome folding mechanisms to mediate enhancer selectivity strongly varies for different enhancer-promoter interaction mechanisms. Finally, we discuss how recent technical advances may provide new insights into mechanisms of enhancer-promoter interactions and how technical biases in methods such as Hi-C and Micro-C and imaging techniques may affect their interpretation.
Collapse
Affiliation(s)
- Jin H Yang
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Anders S Hansen
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Koch Institute for Integrative Cancer Research, Cambridge, MA, USA.
| |
Collapse
|
2
|
Hu W, Li Y, Wu Y, Guan L, Li M. A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding. iScience 2024; 27:110030. [PMID: 38868182 PMCID: PMC11167433 DOI: 10.1016/j.isci.2024.110030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 04/23/2024] [Accepted: 05/16/2024] [Indexed: 06/14/2024] Open
Abstract
Enhancers, genomic DNA elements, regulate neighboring gene expression crucial for biological processes like cell differentiation and stress response. However, current machine learning methods for predicting DNA enhancers often underutilize hidden features in gene sequences, limiting model accuracy. Hence, this article proposes the PDCNN model, a deep learning-based enhancer prediction method. PDCNN extracts statistical nucleotide representations from gene sequences, discerning positional distribution information of nucleotides in modifier-like DNA sequences. With a convolutional neural network structure, PDCNN employs dual convolutional and fully connected layers. The cross-entropy loss function iteratively updates using a gradient descent algorithm, enhancing prediction accuracy. Model parameters are fine-tuned to select optimal combinations for training, achieving over 95% accuracy. Comparative analysis with traditional methods and existing models demonstrates PDCNN's robust feature extraction capability. It outperforms advanced machine learning methods in identifying DNA enhancers, presenting an effective method with broad implications for genomics, biology, and medical research.
Collapse
Affiliation(s)
- Wenxing Hu
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China
| | - Yelin Li
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China
| | - Yan Wu
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China
| | - Lixin Guan
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China
| | - Mengshan Li
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China
| |
Collapse
|
3
|
Kim KL, Rahme GJ, Goel VY, El Farran CA, Hansen AS, Bernstein BE. Dissection of a CTCF topological boundary uncovers principles of enhancer-oncogene regulation. Mol Cell 2024; 84:1365-1376.e7. [PMID: 38452764 PMCID: PMC10997458 DOI: 10.1016/j.molcel.2024.02.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 01/03/2024] [Accepted: 02/08/2024] [Indexed: 03/09/2024]
Abstract
Enhancer-gene communication is dependent on topologically associating domains (TADs) and boundaries enforced by the CCCTC-binding factor (CTCF) insulator, but the underlying structures and mechanisms remain controversial. Here, we investigate a boundary that typically insulates fibroblast growth factor (FGF) oncogenes but is disrupted by DNA hypermethylation in gastrointestinal stromal tumors (GISTs). The boundary contains an array of CTCF sites that enforce adjacent TADs, one containing FGF genes and the other containing ANO1 and its putative enhancers, which are specifically active in GIST and its likely cell of origin. We show that coordinate disruption of four CTCF motifs in the boundary fuses the adjacent TADs, allows the ANO1 enhancer to contact FGF3, and causes its robust induction. High-resolution micro-C maps reveal specific contact between transcription initiation sites in the ANO1 enhancer and FGF3 promoter that quantitatively scales with FGF3 induction such that modest changes in contact frequency result in strong changes in expression, consistent with a causal relationship.
Collapse
Affiliation(s)
- Kyung Lock Kim
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02215, USA
| | - Gilbert J Rahme
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02215, USA
| | - Viraat Y Goel
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Koch Institute for Integrative Cancer Research, Cambridge, MA 02139, USA
| | - Chadi A El Farran
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02215, USA
| | - Anders S Hansen
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Koch Institute for Integrative Cancer Research, Cambridge, MA 02139, USA
| | - Bradley E Bernstein
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02215, USA.
| |
Collapse
|
4
|
Mir BA, Rehman MU, Tayara H, Chong KT. Improving Enhancer Identification with a Multi-Classifier Stacked Ensemble Model. J Mol Biol 2023; 435:168314. [PMID: 37852600 DOI: 10.1016/j.jmb.2023.168314] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 10/06/2023] [Accepted: 10/11/2023] [Indexed: 10/20/2023]
Abstract
Enhancers are DNA regions that are responsible for controlling the expression of genes. Enhancers are usually found upstream or downstream of a gene, or even inside a gene's intron region, but are normally located at a distant location from the genes they control. By integrating experimental and computational approaches, it is possible to uncover enhancers within DNA sequences, which possess regulatory properties. Experimental techniques such as ChIP-seq and ATAC-seq can identify genomic regions that are associated with transcription factors or accessible to regulatory proteins. On the other hand, computational techniques can predict enhancers based on sequence features and epigenetic modifications. In our study, we have developed a multi-classifier stacked ensemble (MCSE-enhancer) model that can accurately identify enhancers. We utilized feature descriptors from various physiochemical properties as input for our six baseline classifiers and built a stacked classifier, which outperformed previous enhancer classification techniques in terms of accuracy, specificity, sensitivity, and Mathew's correlation coefficient. Our model achieved an accuracy of 81.5%, representing a 2-3% improvement over existing models.
Collapse
Affiliation(s)
- Bilal Ahmad Mir
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea.
| | - Mobeen Ur Rehman
- Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University, Abu Dhabi 127788, United Arab Emirates.
| | - Hilal Tayara
- School of international Engineering and Science, Jeonbuk National University, Jeonju 54896, South Korea.
| | - Kil To Chong
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea; Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, South Korea.
| |
Collapse
|
5
|
Schaeffer M, Nollmann M. Contributions of 3D chromatin structure to cell-type-specific gene regulation. Curr Opin Genet Dev 2023; 79:102032. [PMID: 36893484 DOI: 10.1016/j.gde.2023.102032] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 02/05/2023] [Accepted: 02/06/2023] [Indexed: 03/09/2023]
Abstract
Eukaryotic genomes are organized in 3D in a multiscale manner, and different mechanisms acting at each of these scales can contribute to transcriptional regulation. However, the large single-cell variability in 3D chromatin structures represents a challenge to understand how transcription may be differentially regulated between cell types in a robust and efficient manner. Here, we describe the different mechanisms by which 3D chromatin structure was shown to contribute to cell-type-specific transcriptional regulation. Excitingly, several novel methodologies able to measure 3D chromatin conformation and transcription in single cells in their native tissue context, or to detect the dynamics of cis-regulatory interactions, are starting to allow quantitative dissection of chromatin structure noise and relate it to how transcription may be regulated between different cell types and cell states.
Collapse
Affiliation(s)
- Marie Schaeffer
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U1054, Montpellier, France
| | - Marcelo Nollmann
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U1054, Montpellier, France.
| |
Collapse
|
6
|
Basith S, Hasan MM, Lee G, Wei L, Manavalan B. Integrative machine learning framework for the identification of cell-specific enhancers from the human genome. Brief Bioinform 2021; 22:6315815. [PMID: 34226917 DOI: 10.1093/bib/bbab252] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 06/08/2021] [Accepted: 06/14/2021] [Indexed: 02/06/2023] Open
Abstract
Enhancers are deoxyribonucleic acid (DNA) fragments which when bound by transcription factors enhance the transcription of related genes. Due to its sporadic distribution and similar fractions, identification of enhancers from the human genome seems a daunting task. Compared to the traditional experimental approaches, computational methods with easy-to-use platforms could be efficiently applied to annotate enhancers' functions and physiological roles. In this aspect, several bioinformatics tools have been developed to identify enhancers. Despite their spectacular performances, existing methods have certain drawbacks and limitations, including fixed length of sequences being utilized for model development and cell-specificity negligence. A novel predictor would be beneficial in the context of genome-wide enhancer prediction by addressing the above-mentioned issues. In this study, we constructed new datasets for eight different cell types. Utilizing these data, we proposed an integrative machine learning (ML)-based framework called Enhancer-IF for identifying cell-specific enhancers. Enhancer-IF comprehensively explores a wide range of heterogeneous features with five commonly used ML methods (random forest, extremely randomized tree, multilayer perceptron, support vector machine and extreme gradient boosting). Specifically, these five classifiers were trained with seven encodings and obtained 35 baseline models. The output of these baseline models was integrated and again inputted to five classifiers for the construction of five meta-models. Finally, the integration of five meta-models through ensemble learning improved the model robustness. Our proposed approach showed an excellent prediction performance compared to the baseline models on both training and independent datasets in different cell types, thus highlighting the superiority of our approach in the identification of the enhancers. We assume that Enhancer-IF will be a valuable tool for screening and identifying potential enhancers from the human DNA sequences.
Collapse
Affiliation(s)
- Shaherin Basith
- Department of Physiology, Ajou University School of Medicine, Republic of Korea
| | - Md Mehedi Hasan
- Tulane University, USA.,Kyushu Institute of Technology, Japan
| | - Gwang Lee
- Department of Physiology, Ajou University School of Medicine, Republic of Korea
| | - Leyi Wei
- Xiamen University, China.,Shandong University, China
| | | |
Collapse
|
7
|
Starzer AM, Berghoff AS, Hamacher R, Tomasich E, Feldmann K, Hatziioannou T, Traint S, Lamm W, Noebauer-Huhmann IM, Furtner J, Müllauer L, Amann G, Bauer S, Schildhaus HU, Preusser M, Heller G, Brodowicz T. Tumor DNA methylation profiles correlate with response to anti-PD-1 immune checkpoint inhibitor monotherapy in sarcoma patients. J Immunother Cancer 2021; 9:jitc-2020-001458. [PMID: 33762319 PMCID: PMC7993298 DOI: 10.1136/jitc-2020-001458] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/19/2021] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Some sarcomas respond to immune checkpoint inhibition, but predictive biomarkers are unknown. We analyzed tumor DNA methylation profiles in relation to immunological parameters and response to anti-programmed cell death 1 (anti-PD-1) immune checkpoint inhibitor (ICI) therapy in patients with sarcoma. PATIENTS AND METHODS We retrospectively identified adult patients who had received anti-PD-1 ICI therapy for recurrent sarcoma in two independent centers. We performed (1) blinded radiological response evaluation according to immune response evaluation criteria in solid tumors (iRECIST) ; (2) tumor DNA methylation profiling of >850,000 probes using Infinium MethylationEPIC microarrays; (3) analysis of tumor-infiltrating immune cell subsets (CD3, CD8, CD45RO, FOXP3) and intratumoral expression of immune checkpoint molecules (PD-L1, PD-1, LAG-3) using immunohistochemistry; and (4) evaluation of blood-based systemic inflammation scores (neutrophil-to-lymphocyte ratio, leucocyte-to-lymphocyte ratio, monocyte-to-lymphocyte ratio, platelet-to-lymphocyte ratio). Response to anti-PD-1 ICI therapy was bioinformatically and statistically correlated with DNA methylation profiles and immunological data. RESULTS 35 patients (median age of 50 (23-81) years; 18 females, 17 males; 27 soft tissue sarcomas; 8 osteosarcomas) were included in this study. The objective response rate to anti-PD-1 ICI therapy was 22.9% with complete responses in 3 out of 35 and partial responses in 5 out of 35 patients. Adjustment of DNA methylation data for tumor-infiltrating immune cells resulted in identification of methylation differences between responders and non-responders to anti-PD-1 ICI. 2453 differentially methylated CpG sites (DMPs; 2043 with decreased and 410 with increased methylation) were identified. Clustering of sarcoma samples based on these DMPs revealed two main clusters: methylation cluster 1 (MC1) consisted of 73% responders and methylation cluster 2 (MC2) contained only non-responders to anti-PD-1 ICI. Median progression-free survival from anti-PD-1 therapy start of MC1 and MC2 patients was 16.5 and 1.9 months, respectively (p=0.001). Median overall survival of these patients was 34.4 and 8.0 months, respectively (p=0.029). The most prominent DNA methylation differences were found in pathways implicated in Rap1 signaling, focal adhesion, adherens junction Phosphoinositide 3-kinase (PI3K)-Akt signaling and extracellular matrix (ECM)-receptor interaction. CONCLUSIONS Our data demonstrate that tumor DNA methylation profiles may serve as a predictive marker for response to anti-PD-1 ICI therapy in sarcoma.
Collapse
Affiliation(s)
- Angelika M Starzer
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria.,Christian Doppler Laboratory for Personalized Immunotherapy, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Anna S Berghoff
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria.,Christian Doppler Laboratory for Personalized Immunotherapy, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Rainer Hamacher
- Department of Medical Oncology, Sarcoma Center, West German Cancer Center, University Hospital Essen, University of Duisburg-Essen, Duisburg, Germany
| | - Erwin Tomasich
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Katharina Feldmann
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Teresa Hatziioannou
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Stefan Traint
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Wolfgang Lamm
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Iris M Noebauer-Huhmann
- Department of Biomedical Imaging and Image-guided Therapy, Division of General and Paediatric Radiology, Medical University of Vienna, Vienna, Austria
| | - Julia Furtner
- Department of Biomedical Imaging and Image-guided Therapy, Division of General and Paediatric Radiology, Medical University of Vienna, Vienna, Austria
| | - Leonhard Müllauer
- Department of Pathology, Medical University of Vienna, Vienna, Austria
| | - Gabriele Amann
- Department of Pathology, Medical University of Vienna, Vienna, Austria
| | - Sebastian Bauer
- Department of Medical Oncology, Sarcoma Center, West German Cancer Center, University Hospital Essen, University of Duisburg-Essen, Duisburg, Germany
| | | | - Matthias Preusser
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria.,Christian Doppler Laboratory for Personalized Immunotherapy, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Gerwin Heller
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| | - Thomas Brodowicz
- Division of Oncology, Department of Medicine I, Medical University of Vienna, Vienna, Austria
| |
Collapse
|
8
|
|