1
|
Wall BPG, Nguyen M, Harrell JC, Dozmorov MG. Machine and Deep Learning Methods for Predicting 3D Genome Organization. Methods Mol Biol 2025; 2856:357-400. [PMID: 39283464 DOI: 10.1007/978-1-0716-4136-1_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Three-dimensional (3D) chromatin interactions, such as enhancer-promoter interactions (EPIs), loops, topologically associating domains (TADs), and A/B compartments, play critical roles in a wide range of cellular processes by regulating gene expression. Recent development of chromatin conformation capture technologies has enabled genome-wide profiling of various 3D structures, even with single cells. However, current catalogs of 3D structures remain incomplete and unreliable due to differences in technology, tools, and low data resolution. Machine learning methods have emerged as an alternative to obtain missing 3D interactions and/or improve resolution. Such methods frequently use genome annotation data (ChIP-seq, DNAse-seq, etc.), DNA sequencing information (k-mers and transcription factor binding site (TFBS) motifs), and other genomic properties to learn the associations between genomic features and chromatin interactions. In this review, we discuss computational tools for predicting three types of 3D interactions (EPIs, chromatin interactions, and TAD boundaries) and analyze their pros and cons. We also point out obstacles to the computational prediction of 3D interactions and suggest future research directions.
Collapse
Affiliation(s)
- Brydon P G Wall
- Center for Biological Data Science, Virginia Commonwealth University, Richmond, VA, USA
| | - My Nguyen
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, USA
| | - J Chuck Harrell
- Department of Pathology, Virginia Commonwealth University, Richmond, VA, USA
- Massey Comprehensive Cancer Center, Virginia Commonwealth University, Richmond, VA, USA
- Center for Pharmaceutical Engineering, Virginia Commonwealth University, Richmond, VA, USA
| | - Mikhail G Dozmorov
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, USA.
- Department of Pathology, Virginia Commonwealth University, Richmond, VA, USA.
| |
Collapse
|
2
|
Wall BPG, Nguyen M, Harrell JC, Dozmorov MG. Machine and deep learning methods for predicting 3D genome organization. ARXIV 2024:arXiv:2403.03231v1. [PMID: 38495565 PMCID: PMC10942493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Three-Dimensional (3D) chromatin interactions, such as enhancer-promoter interactions (EPIs), loops, Topologically Associating Domains (TADs), and A/B compartments play critical roles in a wide range of cellular processes by regulating gene expression. Recent development of chromatin conformation capture technologies has enabled genome-wide profiling of various 3D structures, even with single cells. However, current catalogs of 3D structures remain incomplete and unreliable due to differences in technology, tools, and low data resolution. Machine learning methods have emerged as an alternative to obtain missing 3D interactions and/or improve resolution. Such methods frequently use genome annotation data (ChIP-seq, DNAse-seq, etc.), DNA sequencing information (k-mers, Transcription Factor Binding Site (TFBS) motifs), and other genomic properties to learn the associations between genomic features and chromatin interactions. In this review, we discuss computational tools for predicting three types of 3D interactions (EPIs, chromatin interactions, TAD boundaries) and analyze their pros and cons. We also point out obstacles of computational prediction of 3D interactions and suggest future research directions.
Collapse
Affiliation(s)
- Brydon P. G. Wall
- Center for Biological Data Science, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | - My Nguyen
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, 23298, USA
| | - J. Chuck Harrell
- Department of Pathology, Virginia Commonwealth University, Richmond, VA, 23284, USA
- Massey Comprehensive Cancer Center, Virginia Commonwealth University, Richmond, VA 23298, USA
- Center for Pharmaceutical Engineering, Virginia Commonwealth University, Richmond, VA 23298, USA
| | - Mikhail G. Dozmorov
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, 23298, USA
- Department of Pathology, Virginia Commonwealth University, Richmond, VA, 23284, USA
| |
Collapse
|
3
|
Mayayo-Peralta I, Gregoricchio S, Schuurman K, Yavuz S, Zaalberg A, Kojic A, Abbott N, Geverts B, Beerthuijzen S, Siefert J, Severson TM, van Baalen M, Hoekman L, Lieftink C, Altelaar M, Beijersbergen RL, Houtsmuller A, Prekovic S, Zwart W. PAXIP1 and STAG2 converge to maintain 3D genome architecture and facilitate promoter/enhancer contacts to enable stress hormone-dependent transcription. Nucleic Acids Res 2023; 51:9576-9593. [PMID: 37070193 PMCID: PMC10570044 DOI: 10.1093/nar/gkad267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 03/03/2023] [Accepted: 04/12/2023] [Indexed: 04/19/2023] Open
Abstract
How steroid hormone receptors (SHRs) regulate transcriptional activity remains partly understood. Upon activation, SHRs bind the genome together with a co-regulator repertoire, crucial to induce gene expression. However, it remains unknown which components of the SHR-recruited co-regulator complex are essential to drive transcription following hormonal stimuli. Through a FACS-based genome-wide CRISPR screen, we functionally dissected the Glucocorticoid Receptor (GR) complex. We describe a functional cross-talk between PAXIP1 and the cohesin subunit STAG2, critical for regulation of gene expression by GR. Without altering the GR cistrome, PAXIP1 and STAG2 depletion alter the GR transcriptome, by impairing the recruitment of 3D-genome organization proteins to the GR complex. Importantly, we demonstrate that PAXIP1 is required for stability of cohesin on chromatin, its localization to GR-occupied sites, and maintenance of enhancer-promoter interactions. In lung cancer, where GR acts as tumor suppressor, PAXIP1/STAG2 loss enhances GR-mediated tumor suppressor activity by modifying local chromatin interactions. All together, we introduce PAXIP1 and STAG2 as novel co-regulators of GR, required to maintain 3D-genome architecture and drive the GR transcriptional programme following hormonal stimuli.
Collapse
Affiliation(s)
- Isabel Mayayo-Peralta
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Sebastian Gregoricchio
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Karianne Schuurman
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Selçuk Yavuz
- Erasmus Optical Imaging Center, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherland
| | - Anniek Zaalberg
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Aleksandar Kojic
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Nina Abbott
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Bart Geverts
- Erasmus Optical Imaging Center, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherland
- Department of Pathology, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Suzanne Beerthuijzen
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Joseph Siefert
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Tesa M Severson
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Martijn van Baalen
- Flow Cytometry Facility, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Liesbeth Hoekman
- Proteomics Facility, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Cor Lieftink
- Division of Molecular Carcinogenesis, The NKI Robotics and Screening Centre, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Maarten Altelaar
- Proteomics Facility, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research, Utrecht Institute for Pharmaceutical Sciences, Utrecht University and Netherlands Proteomics Centre, Utrecht, The Netherlands
| | - Roderick L Beijersbergen
- Division of Molecular Carcinogenesis, The NKI Robotics and Screening Centre, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Adriaan B Houtsmuller
- Erasmus Optical Imaging Center, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherland
| | - Stefan Prekovic
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- Center for Molecular Medicine, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Wilbert Zwart
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- Laboratory of Chemical Biology and Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
| |
Collapse
|
4
|
Xu H, Yi X, Fan X, Wu C, Wang W, Chu X, Zhang S, Dong X, Wang Z, Wang J, Zhou Y, Zhao K, Yao H, Zheng N, Wang J, Chen Y, Plewczynski D, Sham PC, Chen K, Huang D, Li MJ. Inferring CTCF-binding patterns and anchored loops across human tissues and cell types. PATTERNS (NEW YORK, N.Y.) 2023; 4:100798. [PMID: 37602215 PMCID: PMC10436006 DOI: 10.1016/j.patter.2023.100798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 01/25/2023] [Accepted: 06/20/2023] [Indexed: 08/22/2023]
Abstract
CCCTC-binding factor (CTCF) is a transcription regulator with a complex role in gene regulation. The recognition and effects of CTCF on DNA sequences, chromosome barriers, and enhancer blocking are not well understood. Existing computational tools struggle to assess the regulatory potential of CTCF-binding sites and their impact on chromatin loop formation. Here we have developed a deep-learning model, DeepAnchor, to accurately characterize CTCF binding using high-resolution genomic/epigenomic features. This has revealed distinct chromatin and sequence patterns for CTCF-mediated insulation and looping. An optimized implementation of a previous loop model based on DeepAnchor score excels in predicting CTCF-anchored loops. We have established a compendium of CTCF-anchored loops across 52 human tissue/cell types, and this suggests that genomic disruption of these loops could be a general mechanism of disease pathogenesis. These computational models and resources can help investigate how CTCF-mediated cis-regulatory elements shape context-specific gene regulation in cell development and disease progression.
Collapse
Affiliation(s)
- Hang Xu
- Department of Epidemiology and Biostatistics, Key Laboratory of Prevention and Control of Human Major Diseases (Ministry of Education), National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
- Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A∗STAR), Singapore 138648, Singapore
| | - Xianfu Yi
- Department of Bioinformatics, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Xutong Fan
- Department of Bioinformatics, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Chengyue Wu
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Wei Wang
- Department of Epidemiology and Biostatistics, Key Laboratory of Prevention and Control of Human Major Diseases (Ministry of Education), National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Xinlei Chu
- Department of Epidemiology and Biostatistics, Key Laboratory of Prevention and Control of Human Major Diseases (Ministry of Education), National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Shijie Zhang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Xiaobao Dong
- Department of Genetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Zhao Wang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Jianhua Wang
- Department of Bioinformatics, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Yao Zhou
- Department of Bioinformatics, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Ke Zhao
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Hongcheng Yao
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, The University of Hong Kong, Hong Kong 999077, China
| | - Nan Zheng
- Department of Network Security and Informatization, Tianjin Medical University, Tianjin 300070, China
| | - Junwen Wang
- Department of Health Sciences Research and Center for Individualized Medicine, Mayo Clinic, Scottsdale, AZ 85259, USA
| | - Yupeng Chen
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| | - Dariusz Plewczynski
- Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
| | - Pak Chung Sham
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, The University of Hong Kong, Hong Kong 999077, China
| | - Kexin Chen
- Department of Epidemiology and Biostatistics, Key Laboratory of Prevention and Control of Human Major Diseases (Ministry of Education), National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Dandan Huang
- Wuxi School of Medicine, Jiangnan University, Wuxi 214122, China
| | - Mulin Jun Li
- Department of Epidemiology and Biostatistics, Key Laboratory of Prevention and Control of Human Major Diseases (Ministry of Education), National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
- Department of Bioinformatics, The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin 300070, China
| |
Collapse
|
5
|
Sweet DR, Padmanabhan R, Liao X, Dashora HR, Tang X, Nayak L, Jain R, De Val S, Vinayachandran V, Jain MK. Krüppel-Like Factors Orchestrate Endothelial Gene Expression Through Redundant and Non-Redundant Enhancer Networks. J Am Heart Assoc 2023; 12:e024303. [PMID: 36789992 PMCID: PMC10111506 DOI: 10.1161/jaha.121.024303] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
Background Proper function of endothelial cells is critical for vascular integrity and organismal survival. Studies over the past 2 decades have identified 2 members of the KLF (Krüppel-like factor) family of proteins, KLF2 and KLF4, as nodal regulators of endothelial function. Strikingly, inducible postnatal deletion of both KLF2 and KLF4 resulted in widespread vascular leak, coagulopathy, and rapid death. Importantly, while transcriptomic studies revealed profound alterations in gene expression, the molecular mechanisms underlying these changes remain poorly understood. Here, we seek to determine mechanisms of KLF2 and KLF4 transcriptional control in multiple vascular beds to further understand their roles as critical endothelial regulators. Methods and Results We integrate chromatin occupancy and transcription studies from multiple transgenic mouse models to demonstrate that KLF2 and KLF4 have overlapping yet distinct binding patterns and transcriptional targets in heart and lung endothelium. Mechanistically, KLFs use open chromatin regions in promoters and enhancers and bind in context-specific patterns that govern transcription in microvasculature. Importantly, this occurs during homeostasis in vivo without additional exogenous stimuli. Conclusions Together, this work provides mechanistic insight behind the well-described transcriptional and functional heterogeneity seen in vascular populations, while also establishing tools into exploring microvascular endothelial dynamics in vivo.
Collapse
Affiliation(s)
- David R Sweet
- Case Cardiovascular Research Institute, Case Western Reserve University, and Harrington Heart and Vascular Institute University Hospitals Cleveland Medical Center Cleveland OH.,Department of Pathology Case Western Reserve University Cleveland OH
| | - Roshan Padmanabhan
- Case Cardiovascular Research Institute, Case Western Reserve University, and Harrington Heart and Vascular Institute University Hospitals Cleveland Medical Center Cleveland OH
| | - Xudong Liao
- Case Cardiovascular Research Institute, Case Western Reserve University, and Harrington Heart and Vascular Institute University Hospitals Cleveland Medical Center Cleveland OH
| | - Himanshu R Dashora
- Case Cardiovascular Research Institute, Case Western Reserve University, and Harrington Heart and Vascular Institute University Hospitals Cleveland Medical Center Cleveland OH
| | - Xinmiao Tang
- Case Cardiovascular Research Institute, Case Western Reserve University, and Harrington Heart and Vascular Institute University Hospitals Cleveland Medical Center Cleveland OH
| | - Lalitha Nayak
- Division of Hematology and Oncology University Hospitals Cleveland Medical Center Cleveland OH
| | - Rajan Jain
- Department of Cell and Developmental Biology, Perelman School of Medicine University of Pennsylvania Philadelphia PA
| | - Sarah De Val
- Department of Physiology, Anatomy and Genetics University of Oxford UK
| | - Vinesh Vinayachandran
- Case Cardiovascular Research Institute, Case Western Reserve University, and Harrington Heart and Vascular Institute University Hospitals Cleveland Medical Center Cleveland OH
| | - Mukesh K Jain
- Case Cardiovascular Research Institute, Case Western Reserve University, and Harrington Heart and Vascular Institute University Hospitals Cleveland Medical Center Cleveland OH.,Division of Biology and Medicine Warren Alpert Medical School of Brown University Providence RI
| |
Collapse
|
6
|
Bentsen M, Heger V, Schultheis H, Kuenne C, Looso M. TF-COMB - discovering grammar of transcription factor binding sites. Comput Struct Biotechnol J 2022; 20:4040-4051. [PMID: 35983231 PMCID: PMC9358416 DOI: 10.1016/j.csbj.2022.07.025] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 07/12/2022] [Indexed: 02/07/2023] Open
Abstract
Cooperativity between transcription factors is important to regulate target gene expression. In particular, the binding grammar of TFs in relation to each other, as well as in the context of other genomic elements, is crucial for TF functionality. However, tools to easily uncover co-occurrence between DNA-binding proteins, and investigate the regulatory modules of TFs, are limited. Here we present TF-COMB (Transcription Factor Co-Occurrence using Market Basket analysis) - a tool to investigate co-occurring TFs and binding grammar within regulatory regions. We found that TF-COMB can accurately identify known co-occurring TFs from ChIP-seq data, as well as uncover preferential localization to other genomic elements. With the use of ATAC-seq footprinting and TF motif locations, we found that TFs exhibit both preferred orientation and distance in relation to each other, and that these are biologically significant. Finally, we extended the analysis to not only investigate individual TF pairs, but also TF pairs in the context of networks, which enabled the investigation of TF complexes and TF hubs. In conclusion, TF-COMB is a flexible tool to investigate various aspects of TF binding grammar.
Collapse
Affiliation(s)
- Mette Bentsen
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Vanessa Heger
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Hendrik Schultheis
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Carsten Kuenne
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Mario Looso
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
- Corresponding author at: Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany.
| |
Collapse
|
7
|
Rozenwald MB, Galitsyna AA, Sapunov GV, Khrameeva EE, Gelfand MS. A machine learning framework for the prediction of chromatin folding in Drosophila using epigenetic features. PeerJ Comput Sci 2020; 6:e307. [PMID: 33816958 PMCID: PMC7924456 DOI: 10.7717/peerj-cs.307] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 09/30/2020] [Indexed: 05/03/2023]
Abstract
Technological advances have lead to the creation of large epigenetic datasets, including information about DNA binding proteins and DNA spatial structure. Hi-C experiments have revealed that chromosomes are subdivided into sets of self-interacting domains called Topologically Associating Domains (TADs). TADs are involved in the regulation of gene expression activity, but the mechanisms of their formation are not yet fully understood. Here, we focus on machine learning methods to characterize DNA folding patterns in Drosophila based on chromatin marks across three cell lines. We present linear regression models with four types of regularization, gradient boosting, and recurrent neural networks (RNN) as tools to study chromatin folding characteristics associated with TADs given epigenetic chromatin immunoprecipitation data. The bidirectional long short-term memory RNN architecture produced the best prediction scores and identified biologically relevant features. Distribution of protein Chriz (Chromator) and histone modification H3K4me3 were selected as the most informative features for the prediction of TADs characteristics. This approach may be adapted to any similar biological dataset of chromatin features across various cell lines and species. The code for the implemented pipeline, Hi-ChiP-ML, is publicly available: https://github.com/MichalRozenwald/Hi-ChIP-ML.
Collapse
Affiliation(s)
- Michal B. Rozenwald
- Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
| | | | - Grigory V. Sapunov
- Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
- Intento, Inc., Berkeley, CA, USA
| | | | - Mikhail S. Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| |
Collapse
|
8
|
Belokopytova PS, Nuriddinov MA, Mozheiko EA, Fishman D, Fishman V. Quantitative prediction of enhancer-promoter interactions. Genome Res 2019; 30:72-84. [PMID: 31804952 PMCID: PMC6961579 DOI: 10.1101/gr.249367.119] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 11/25/2019] [Indexed: 11/24/2022]
Abstract
Recent experimental and computational efforts have provided large data sets describing three-dimensional organization of mouse and human genomes and showed the interconnection between the expression profile, epigenetic state, and spatial interactions of loci. These interconnections were utilized to infer the spatial organization of chromatin, including enhancer–promoter contacts, from one-dimensional epigenetic marks. Here, we show that the predictive power of some of these algorithms is overestimated due to peculiar properties of the biological data. We propose an alternative approach, which provides high-quality predictions of chromatin interactions using information on gene expression and CTCF-binding alone. Using multiple metrics, we confirmed that our algorithm could efficiently predict the three-dimensional architecture of both normal and rearranged genomes.
Collapse
Affiliation(s)
- Polina S Belokopytova
- Institute of Cytology and Genetics SB RAS 630090, Novosibirsk, Russia.,Novosibirsk State University, Novosibirsk, Russia 630090
| | | | | | - Daniil Fishman
- Novosibirsk State University, Novosibirsk, Russia 630090
| | - Veniamin Fishman
- Institute of Cytology and Genetics SB RAS 630090, Novosibirsk, Russia.,Novosibirsk State University, Novosibirsk, Russia 630090
| |
Collapse
|