1
|
Tang L, Liao J, Hill MC, Hu J, Zhao Y, Ellinor P, Li M. MMCT-Loop: a mix model-based pipeline for calling targeted 3D chromatin loops. Nucleic Acids Res 2024; 52:e25. [PMID: 38281134 PMCID: PMC10954456 DOI: 10.1093/nar/gkae029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 12/03/2023] [Accepted: 01/12/2024] [Indexed: 01/30/2024] Open
Abstract
Protein-specific Chromatin Conformation Capture (3C)-based technologies have become essential for identifying distal genomic interactions with critical roles in gene regulation. The standard techniques include Chromatin Interaction Analysis by Paired-End Tag (ChIA-PET), in situ Hi-C followed by chromatin immunoprecipitation (HiChIP) also known as PLAC-seq. To identify chromatin interactions from these data, a variety of computational methods have emerged. Although these state-of-art methods address many issues with loop calling, only few methods can fit different data types simultaneously, and the accuracy as well as the efficiency these approaches remains limited. Here we have generated a pipeline, MMCT-Loop, which ensures the accurate identification of strong loops as well as dynamic or weak loops through a mixed model. MMCT-Loop outperforms existing methods in accuracy, and the detected loops show higher activation functionality. To highlight the utility of MMCT-Loop, we applied it to conformational data derived from neural stem cell (NSCs) and uncovered several previously unidentified regulatory regions for key master regulators of stem cell identity. MMCT-Loop is an accurate and efficient loop caller for targeted conformation capture data, which supports raw data or pre-processed valid pairs as input, the output interactions are formatted and easily uploaded to a genome browser for visualization.
Collapse
Affiliation(s)
- Li Tang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Jiaqi Liao
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Matthew C Hill
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02129, USA
- Cardiovascular Disease Initiative, The Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Jiaxin Hu
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Yichao Zhao
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Patrick T Ellinor
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02129, USA
- Cardiovascular Disease Initiative, The Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| |
Collapse
|
2
|
Lou S, Lin S. An in silico procedure for generating protein-mediated chromatin interaction data and comparison of significant interaction calling methods. PLoS One 2024; 19:e0287521. [PMID: 38232107 PMCID: PMC10793909 DOI: 10.1371/journal.pone.0287521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 06/07/2023] [Indexed: 01/19/2024] Open
Abstract
The ability to simulate high-throughput data with high fidelity to real experimental data is fundamental for benchmarking methods used to detect true long-range chromatin interactions mediated by a specific protein. Yet, such tools are not currently available. To fill this gap, we develop an in silico experimental procedure, ChIA-Sim, which imitates the experimental procedures that produce real ChIA-PET, Hi-ChIP, or PLAC-seq data. We show the fidelity of ChIA-Sim to real data by using guiding characteristics of several real datasets to generate data using the simulation procedure. We also used ChIA-Sim data to demonstrate the use of our in silico procedure in benchmarking methods for significant interactions analysis by evaluating four methods for significant interaction calling (SIC). In particular, we assessed each method's performance in terms of correct identification of long-range interactions. We further analyzed four experimental datasets from publicly available databases and shew that the trend of the results are consistent with those seen in data generated from ChIA-Sim. This serves as additional evidence that ChIA-Sim closely resembles data produced from the experimental protocols it models after.
Collapse
Affiliation(s)
- Shuyuan Lou
- Interdisciplinary Ph.D. Program in Biostatistics, The Ohio State University, Columbus, OH, United States of America
| | - Shili Lin
- Interdisciplinary Ph.D. Program in Biostatistics, The Ohio State University, Columbus, OH, United States of America
- Department of Statistics, The Ohio State University, Columbus, OH, United States of America
- Translational Data Analytics Institute, The Ohio State University, Columbus, OH, United States of America
| |
Collapse
|
3
|
Wei C, Jia L, Huang X, Tan J, Wang M, Niu J, Hou Y, Sun J, Zeng P, Wang J, Qing L, Ma L, Liu X, Tang X, Li F, Jiang S, Liu J, Li T, Fan L, Sun Y, Gao J, Li C, Ding J. CTCF organizes inter-A compartment interactions through RYBP-dependent phase separation. Cell Res 2022; 32:744-760. [PMID: 35768498 PMCID: PMC9343660 DOI: 10.1038/s41422-022-00676-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 05/10/2022] [Indexed: 12/13/2022] Open
Abstract
Chromatin is spatially organized into three-dimensional structures at different levels including A/B compartments, topologically associating domains and loops. The canonical CTCF-mediated loop extrusion model can explain the formation of loops. However, the organization mechanisms underlying long-range chromatin interactions such as interactions between A-A compartments are still poorly understood. Here we show that different from the canonical loop extrusion model, RYBP-mediated phase separation of CTCF organizes inter-A compartment interactions. Based on this model, we designed and verified an induced CTCF phase separation system in embryonic stem cells (ESCs), which facilitated inter-A compartment interactions, improved self-renewal of ESCs and inhibited their differentiation toward neural progenitor cells. These findings support a novel and non-canonical role of CTCF in organizing long-range chromatin interactions via phase separation.
Collapse
Affiliation(s)
- Chao Wei
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Lumeng Jia
- School of Life Sciences, Peking University, Beijing, China
| | - Xiaona Huang
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Jin Tan
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Mulan Wang
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Jing Niu
- School of Medicine, Tsinghua University, Beijing, China
| | - Yingping Hou
- Peking-Tsinghua Center for Life Sciences; Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Jun Sun
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Pengguihang Zeng
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Department of Cell Biology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, Guangdong, China
| | - Jia Wang
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Li Qing
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Lin Ma
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Xinyi Liu
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Xiuxiao Tang
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Fenjie Li
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Department of Pediatric Surgery, Guangdong Provincial Key Laboratory of Research in Structural Birth Defect Disease, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, Guangdong, China
| | - Shaoshuai Jiang
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Jingxin Liu
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China.,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Tingting Li
- State Key Laboratory of Proteomics, National Center of Biomedical Analysis, Institute of Basic Medical Sciences, Beijing, China
| | - Lili Fan
- Guangzhou Key Laboratory of Formula-Pattern of Traditional Chinese Medicine, School of Traditional Chinese Medicine, Jinan University, Guangzhou, Guangdong, China
| | - Yujie Sun
- School of Life Sciences, Peking University, Beijing, China.,State Key Laboratory of Membrane Biology, Biomedical pioneering innovation center (BIOPIC), Peking University, Beijing, China
| | - Juntao Gao
- MOE Key Laboratory of Bioinformatics; Bioinformatics Division, BNRist; Department of Automation; Center for Synthetic & Systems Biology, Tsinghua University, Beijing, China
| | - Cheng Li
- School of Life Sciences, Peking University, Beijing, China. .,Center for Bioinformatics, Center for Statistical Science, Peking University, Beijing, China.
| | - Junjun Ding
- RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China. .,Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, Guangdong, China. .,Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.
| |
Collapse
|
4
|
Tang L, Hill MC, Ellinor PT, Li M. Bacon: a comprehensive computational benchmarking framework for evaluating targeted chromatin conformation capture-specific methodologies. Genome Biol 2022; 23:30. [PMID: 35063001 PMCID: PMC8780810 DOI: 10.1186/s13059-021-02597-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 12/30/2021] [Indexed: 01/10/2023] Open
Abstract
Chromatin conformation capture (3C)-based technologies have enabled the accurate detection of topological genomic interactions, and the adoption of ChIP techniques to 3C-based protocols makes it possible to identify long-range interactions. To analyze these large and complex datasets, computational methods are undergoing rapid and expansive evolution. Thus, a thorough evaluation of these analytical pipelines is necessary to identify which commonly used algorithms and processing pipelines need to be improved. Here we present a comprehensive benchmark framework, Bacon, to evaluate the performance of several computational methods. Finally, we provide practical recommendations for users working with HiChIP and/or ChIA-PET analyses.
Collapse
Affiliation(s)
- Li Tang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Matthew C Hill
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02129, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Patrick T Ellinor
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02129, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China.
| |
Collapse
|
5
|
Zhang MQ. A personal journey on cracking the genomic codes. QUANTITATIVE BIOLOGY 2021. [DOI: 10.15302/j-qb-021-0245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
6
|
Arega Y, Jiang H, Wang S, Zhang J, Niu X, Li G. ChIAMM: A Mixture Model for Statistical Analysis of Long-Range Chromatin Interactions From ChIA-PET Experiments. Front Genet 2021; 11:616160. [PMID: 33381154 PMCID: PMC7767989 DOI: 10.3389/fgene.2020.616160] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Accepted: 11/11/2020] [Indexed: 11/13/2022] Open
Abstract
Chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) is an important experimental method for detecting specific protein-mediated chromatin loops genome-wide at high resolution. Here, we proposed a new statistical approach with a mixture model, chromatin interaction analysis using mixture model (ChIAMM), to detect significant chromatin interactions from ChIA-PET data. The statistical model is cast into a Bayesian framework to consider more systematic biases: the genomic distance, local enrichment, mappability, and GC content. Using different ChIA-PET datasets, we evaluated the performance of ChIAMM and compared it with the existing methods, including ChIA-PET Tool, ChiaSig, Mango, ChIA-PET2, and ChIAPoP. The result showed that the new approach performed better than most top existing methods in detecting significant chromatin interactions in ChIA-PET experiments.
Collapse
Affiliation(s)
- Yibeltal Arega
- Agricultural Bioinformatics Key Laboratory of Hubei Province, Hubei Engineering Technology Research Center of Agricultural Big Data, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Hao Jiang
- Agricultural Bioinformatics Key Laboratory of Hubei Province, Hubei Engineering Technology Research Center of Agricultural Big Data, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Shuangqi Wang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
| | - Jingwen Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
| | - Xiaohui Niu
- Agricultural Bioinformatics Key Laboratory of Hubei Province, Hubei Engineering Technology Research Center of Agricultural Big Data, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Guoliang Li
- Agricultural Bioinformatics Key Laboratory of Hubei Province, Hubei Engineering Technology Research Center of Agricultural Big Data, College of Informatics, Huazhong Agricultural University, Wuhan, China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
| |
Collapse
|
7
|
Fernandez LR, Gilgenast TG, Phillips-Cremins JE. 3DeFDR: statistical methods for identifying cell type-specific looping interactions in 5C and Hi-C data. Genome Biol 2020; 21:219. [PMID: 32859248 PMCID: PMC7496221 DOI: 10.1186/s13059-020-02061-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Accepted: 05/27/2020] [Indexed: 11/18/2022] Open
Abstract
An important unanswered question in chromatin biology is the extent to which long-range looping interactions change across developmental models, genetic perturbations, drug treatments, and disease states. Computational tools for rigorous assessment of cell type-specific loops across multiple biological conditions are needed. We present 3DeFDR, a simple and effective statistical tool for classifying dynamic loops across biological conditions from Chromosome-Conformation-Capture-Carbon-Copy (5C) and Hi-C data. Our work provides a statistical framework and open-source coding libraries for sensitive detection of cell type-specific loops in high-resolution 5C and Hi-C data from multiple cellular conditions.
Collapse
Affiliation(s)
- Lindsey R Fernandez
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Thomas G Gilgenast
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Jennifer E Phillips-Cremins
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, 19104, USA. .,Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA. .,Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
8
|
Vardaxis I, Drabløs F, Rye MB, Lindqvist BH. MACPET: model-based analysis for ChIA-PET. Biostatistics 2020; 21:625-639. [PMID: 30698663 PMCID: PMC7308020 DOI: 10.1093/biostatistics/kxy084] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Revised: 12/13/2018] [Accepted: 12/16/2018] [Indexed: 11/16/2022] Open
Abstract
We present model-based analysis for ChIA-PET (MACPET), which analyzes paired-end read sequences provided by ChIA-PET for finding binding sites of a protein of interest. MACPET uses information from both tags of each PET and searches for binding sites in a two-dimensional space, while taking into account different noise levels in different genomic regions. MACPET shows favorable results compared with MACS in terms of motif occurrence and spatial resolution. Furthermore, significant binding sites discovered by MACPET are involved in a higher number of significant three-dimensional interactions than those discovered by MACS. MACPET is freely available on Bioconductor. ChIA-PET; MACPET; Model-based clustering; Paired-end tags; Peak-calling algorithm.
Collapse
Affiliation(s)
- Ioannis Vardaxis
- Department of Mathematical Sciences, Norwegian University of Science and Technology, N-7491 Trondheim, Norway
| | - Finn Drabløs
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, N-7491 Trondheim, Norway
| | - Morten B Rye
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, N-7491 Trondheim, Norway and Clinic of Surgery, St. Olavs Hospital, Trondheim University Hospital, N-7030 Trondheim, Norway
| | - Bo Henry Lindqvist
- Department of Mathematical Sciences, Norwegian University of Science and Technology, N-7491 Trondheim, Norway
| |
Collapse
|
9
|
Krismer K, Guo Y, Gifford DK. IDR2D identifies reproducible genomic interactions. Nucleic Acids Res 2020; 48:e31. [PMID: 32009147 PMCID: PMC7102997 DOI: 10.1093/nar/gkaa030] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 12/19/2019] [Accepted: 01/22/2020] [Indexed: 12/21/2022] Open
Abstract
Chromatin interaction data from protocols such as ChIA-PET, HiChIP and Hi-C provide valuable insights into genome organization and gene regulation, but can include spurious interactions that do not reflect underlying genome biology. We introduce an extension of the Irreproducible Discovery Rate (IDR) method called IDR2D that identifies replicable interactions shared by chromatin interaction experiments. IDR2D provides a principled set of interactions and eliminates artifacts from single experiments. The method is available as a Bioconductor package for the R community, as well as an online service at https://idr2d.mit.edu.
Collapse
Affiliation(s)
- Konstantin Krismer
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Yuchun Guo
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA 02139, USA
| | - David K Gifford
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA 02139, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA.,Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| |
Collapse
|
10
|
Guo Y, Krismer K, Closser M, Wichterle H, Gifford DK. High resolution discovery of chromatin interactions. Nucleic Acids Res 2019; 47:e35. [PMID: 30953075 PMCID: PMC6451139 DOI: 10.1093/nar/gkz051] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2018] [Revised: 01/17/2019] [Accepted: 02/11/2019] [Indexed: 12/03/2022] Open
Abstract
Chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) is a method for the genome-wide de novo discovery of chromatin interactions. Existing computational methods typically fail to detect weak or dynamic interactions because they use a peak-calling step that ignores paired-end linkage information. We have developed a novel computational method called Chromatin Interaction Discovery (CID) to overcome this limitation with an unbiased clustering approach for interaction discovery. CID outperforms existing chromatin interaction detection methods with improved sensitivity, replicate consistency, and concordance with other chromatin interaction datasets. In addition, CID also outperforms other methods in discovering chromatin interactions from HiChIP data. We expect that the CID method will be valuable in characterizing 3D chromatin interactions and in understanding the functional consequences of disease-associated distal genetic variations.
Collapse
Affiliation(s)
- Yuchun Guo
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Konstantin Krismer
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Michael Closser
- Departments of Pathology and Cell Biology, Neurology, and Neuroscience, Center for Motor Neuron Biology and Disease, Columbia University Medical Center, New York, NY, USA
| | - Hynek Wichterle
- Departments of Pathology and Cell Biology, Neurology, and Neuroscience, Center for Motor Neuron Biology and Disease, Columbia University Medical Center, New York, NY, USA
| | - David K Gifford
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.,Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.,Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
11
|
Hernández-Lemus E, Reyes-Gopar H, Espinal-Enríquez J, Ochoa S. The Many Faces of Gene Regulation in Cancer: A Computational Oncogenomics Outlook. Genes (Basel) 2019; 10:E865. [PMID: 31671657 PMCID: PMC6896122 DOI: 10.3390/genes10110865] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 10/16/2019] [Accepted: 10/24/2019] [Indexed: 12/16/2022] Open
Abstract
Cancer is a complex disease at many different levels. The molecular phenomenology of cancer is also quite rich. The mutational and genomic origins of cancer and their downstream effects on processes such as the reprogramming of the gene regulatory control and the molecular pathways depending on such control have been recognized as central to the characterization of the disease. More important though is the understanding of their causes, prognosis, and therapeutics. There is a multitude of factors associated with anomalous control of gene expression in cancer. Many of these factors are now amenable to be studied comprehensively by means of experiments based on diverse omic technologies. However, characterizing each dimension of the phenomenon individually has proven to fall short in presenting a clear picture of expression regulation as a whole. In this review article, we discuss some of the more relevant factors affecting gene expression control both, under normal conditions and in tumor settings. We describe the different omic approaches that we can use as well as the computational genomic analysis needed to track down these factors. Then we present theoretical and computational frameworks developed to integrate the amount of diverse information provided by such single-omic analyses. We contextualize this within a systems biology-based multi-omic regulation setting, aimed at better understanding the complex interplay of gene expression deregulation in cancer.
Collapse
Affiliation(s)
- Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City 14610, Mexico.
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City 04510, Mexico.
| | - Helena Reyes-Gopar
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City 14610, Mexico.
| | - Jesús Espinal-Enríquez
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City 14610, Mexico.
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City 04510, Mexico.
| | - Soledad Ochoa
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City 14610, Mexico.
| |
Collapse
|
12
|
Identification of significant chromatin contacts from HiChIP data by FitHiChIP. Nat Commun 2019; 10:4221. [PMID: 31530818 PMCID: PMC6748947 DOI: 10.1038/s41467-019-11950-y] [Citation(s) in RCA: 107] [Impact Index Per Article: 21.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Accepted: 08/14/2019] [Indexed: 02/06/2023] Open
Abstract
HiChIP/PLAC-seq is increasingly becoming popular for profiling 3D chromatin contacts among regulatory elements and for annotating functions of genetic variants. Here we describe FitHiChIP, a computational method for loop calling from HiChIP/PLAC-seq data, which jointly models the non-uniform coverage and genomic distance scaling of contact counts to compute statistical significance estimates. We also develop a technique to filter putative bystander loops that can be explained by stronger adjacent loops. Compared to existing methods, FitHiChIP performs better in recovering contacts reported by Hi-C, promoter capture Hi-C and ChIA-PET experiments and in capturing previously validated promoter-enhancer interactions. FitHiChIP loop calls are reproducible among replicates and are consistent across different experimental settings. Our work also provides a framework for differential HiChIP analysis with an option to utilize ChIP-seq data for further characterizing differential loops. Even though designed for HiChIP, FitHiChIP is also applicable to other conformation capture assays. HiChIP/PLAC-seq assay is popular for profiling 3D genome interactions among regulatory elements at kilobase resolution. Here the authors describe FitHiChIP an empirical null-based, flexible computational method for statistical significance estimation and loop calling from HiChIP data.
Collapse
|
13
|
Chromatin Interaction Analysis with Updated ChIA-PET Tool (V3). Genes (Basel) 2019; 10:genes10070554. [PMID: 31336684 PMCID: PMC6678675 DOI: 10.3390/genes10070554] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2019] [Revised: 07/16/2019] [Accepted: 07/16/2019] [Indexed: 12/20/2022] Open
Abstract
Understanding chromatin interactions is important because they create chromosome conformation and link the cis- and trans- regulatory elements to their target genes for transcriptional regulation. Chromatin Interaction Analysis with Paired-End Tag (ChIA-PET) sequencing is a genome-wide high-throughput technology that detects chromatin interactions associated with a specific protein of interest. We developed ChIA-PET Tool for ChIA-PET data analysis in 2010. Here, we present the updated version of ChIA-PET Tool (V3) as a computational package to process the next-generation sequence data generated from ChIA-PET experiments. It processes short-read and long-read ChIA-PET data with multithreading and generates statistics of results in an HTML file. In this paper, we provide a detailed demonstration of the design of ChIA-PET Tool V3 and how to install it and analyze RNA polymerase II (RNAPII) ChIA-PET data from human K562 cells with it. We compared our tool with existing tools, including ChiaSig, MICC, Mango and ChIA-PET2, by using the same public data set in the same computer. Most peaks detected by the ChIA-PET Tool V3 overlap with those of other tools. There is higher enrichment for significant chromatin interactions from ChIA-PET Tool V3 in aggregate peak analysis (APA) plots. The ChIA-PET Tool V3 is publicly available at GitHub.
Collapse
|
14
|
Huang W, Medvedovic M, Zhang J, Niu L. ChIAPoP: a new tool for ChIA-PET data analysis. Nucleic Acids Res 2019; 47:e37. [PMID: 30753588 PMCID: PMC6468250 DOI: 10.1093/nar/gkz062] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 12/19/2018] [Accepted: 01/24/2019] [Indexed: 01/05/2023] Open
Abstract
Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) is a popular assay method for studying genome-wide chromatin interactions mediated by a protein of interest. The main goal of ChIA-PET data analysis is to detect interactions between DNA regions. Here, we propose a new method and the associated data analysis pipeline, ChIAPoP, to detect chromatin interactions from ChIA-PET data. We compared ChIAPoP with other popular methods, including a hypergeometric model (used in ChIA-PET tool), MICC (used in ChIA-PET2), ChiaSig and mango. The results showed that ChIA-PoP performed better than or at least as well as these top existing methods in detecting true chromatin interactions. ChIAPoP is freely available to the public at https://github.com/wh90999/ChIAPoP.
Collapse
Affiliation(s)
- Weichun Huang
- National Exposure Research Laboratory, Environmental Protection Agency, Research Triangle Park, NC 27709, USA
| | - Mario Medvedovic
- Division of Biostatistics and Bioinformatics, Department of Environmental Health, College of Medicine, University of Cincinnati, Cincinnati, OH 45267, USA
| | - Jingwen Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Liang Niu
- Division of Biostatistics and Bioinformatics, Department of Environmental Health, College of Medicine, University of Cincinnati, Cincinnati, OH 45267, USA
| |
Collapse
|
15
|
Kai Y, Andricovich J, Zeng Z, Zhu J, Tzatsos A, Peng W. Predicting CTCF-mediated chromatin interactions by integrating genomic and epigenomic features. Nat Commun 2018; 9:4221. [PMID: 30310060 PMCID: PMC6181989 DOI: 10.1038/s41467-018-06664-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Accepted: 09/17/2018] [Indexed: 01/27/2023] Open
Abstract
The CCCTC-binding zinc-finger protein (CTCF)-mediated network of long-range chromatin interactions is important for genome organization and function. Although this network has been considered largely invariant, we find that it exhibits extensive cell-type-specific interactions that contribute to cell identity. Here, we present Lollipop, a machine-learning framework, which predicts CTCF-mediated long-range interactions using genomic and epigenomic features. Using ChIA-PET data as benchmark, we demonstrate that Lollipop accurately predicts CTCF-mediated chromatin interactions both within and across cell types, and outperforms other methods based only on CTCF motif orientation. Predictions are confirmed computationally and experimentally by Chromatin Conformation Capture (3C). Moreover, our approach identifies other determinants of CTCF-mediated chromatin wiring, such as gene expression within the loops. Our study contributes to a better understanding about the underlying principles of CTCF-mediated chromatin interactions and their impact on gene expression. CTCF mediates long-range chromatin interactions which are important for genome organization and function. Here, the authors demonstrate that CTCF-mediated interactome exhibits extensive plasticity and present Lollipop, a machine-learning framework which predicts CTCF-mediated long-range interactions using genomic and epigenomic features.
Collapse
Affiliation(s)
- Yan Kai
- Department of Physics, George Washington University (GWU), Washington, DC, 20052, USA.,Department of Anatomy and Cell Biology, Cancer Epigenetics Laboratory, GWU, Washington, DC, 20052, USA.,GWU Cancer Center, GWU School of Medicine and Health Sciences, Washington, DC, 20052, USA
| | - Jaclyn Andricovich
- Department of Anatomy and Cell Biology, Cancer Epigenetics Laboratory, GWU, Washington, DC, 20052, USA.,GWU Cancer Center, GWU School of Medicine and Health Sciences, Washington, DC, 20052, USA
| | - Zhouhao Zeng
- Department of Physics, George Washington University (GWU), Washington, DC, 20052, USA
| | - Jun Zhu
- Systems Biology Center, National Heart Lung and Blood Institute, National Institute of Health, Bethesda, MD, 20892, USA
| | - Alexandros Tzatsos
- Department of Anatomy and Cell Biology, Cancer Epigenetics Laboratory, GWU, Washington, DC, 20052, USA. .,GWU Cancer Center, GWU School of Medicine and Health Sciences, Washington, DC, 20052, USA.
| | - Weiqun Peng
- Department of Physics, George Washington University (GWU), Washington, DC, 20052, USA.
| |
Collapse
|
16
|
Lyu X, Rowley MJ, Corces VG. Architectural Proteins and Pluripotency Factors Cooperate to Orchestrate the Transcriptional Response of hESCs to Temperature Stress. Mol Cell 2018; 71:940-955.e7. [PMID: 30122536 PMCID: PMC6214669 DOI: 10.1016/j.molcel.2018.07.012] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2018] [Revised: 06/11/2018] [Accepted: 07/12/2018] [Indexed: 11/23/2022]
Abstract
Cells respond to temperature stress via up- and downregulation of hundreds of genes. This process is thought to be regulated by the heat shock factor HSF1, which controls the release of RNAPII from promoter-proximal pausing. Here, we analyze the events taking place in hESCs upstream of RNAPII release. We find that temperature stress results in the activation or decommissioning of thousands of enhancers. This process involves alterations in the occupancy of transcription factors HSF1, AP-1, NANOG, KLF4, and OCT4 accompanied by nucleosome remodeling by BRG1 and changes in H3K27ac. Furthermore, redistribution of RAD21 and CTCF results in the formation and disassembly of interactions mediated by these two proteins. These alterations tether and untether enhancers to their cognate promoters or refashion insulated neighborhoods, thus transforming the landscape of enhancer-promoter interactions. Details of the 3D interactome remodeling process support loop extrusion initiating at random sites as a mechanism for the establishment of CTCF/cohesin loops.
Collapse
Affiliation(s)
- Xiaowen Lyu
- Department of Biology, Emory University, Atlanta, GA 30322, USA
| | - M Jordan Rowley
- Department of Biology, Emory University, Atlanta, GA 30322, USA
| | - Victor G Corces
- Department of Biology, Emory University, Atlanta, GA 30322, USA.
| |
Collapse
|
17
|
Ando-Kuri M, Rivera ISM, Rowley MJ, Corces VG. Analysis of Chromatin Interactions Mediated by Specific Architectural Proteins in Drosophila Cells. Methods Mol Biol 2018; 1766:239-256. [PMID: 29605857 PMCID: PMC6334841 DOI: 10.1007/978-1-4939-7768-0_14] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Chromosome conformation capture assays have been established, modified, and enhanced for over a decade with the purpose of studying nuclear organization. A recently published method uses in situ Hi-C followed by chromatin immunoprecipitation (HiChIP) to enrich the overall yield of significant genome-wide interactions mediated by a specific protein. Here we applied a modified version of the HiChIP protocol to retrieve the significant contacts mediated by architectural protein CP190 in D. melanogaster cells.
Collapse
|
18
|
BL-Hi-C is an efficient and sensitive approach for capturing structural and regulatory chromatin interactions. Nat Commun 2017; 8:1622. [PMID: 29158486 PMCID: PMC5696378 DOI: 10.1038/s41467-017-01754-3] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Accepted: 10/13/2017] [Indexed: 01/29/2023] Open
Abstract
In human cells, DNA is hierarchically organized and assembled with histones and DNA-binding proteins in three dimensions. Chromatin interactions play important roles in genome architecture and gene regulation, including robustness in the developmental stages and flexibility during the cell cycle. Here we propose in situ Hi-C method named Bridge Linker-Hi-C (BL-Hi-C) for capturing structural and regulatory chromatin interactions by restriction enzyme targeting and two-step proximity ligation. This method improves the sensitivity and specificity of active chromatin loop detection and can reveal the regulatory enhancer-promoter architecture better than conventional methods at a lower sequencing depth and with a simpler protocol. We demonstrate its utility with two well-studied developmental loci: the beta-globin and HOXC cluster regions. Chromatin interactions and genome architecture are key regulators of gene expression. Here the authors present Bridge-Linker-Hi-C to map active chromatin loops and enhancer-promoter interactions.
Collapse
|
19
|
Rowley MJ, Nichols MH, Lyu X, Ando-Kuri M, Rivera ISM, Hermetz K, Wang P, Ruan Y, Corces VG. Evolutionarily Conserved Principles Predict 3D Chromatin Organization. Mol Cell 2017; 67:837-852.e7. [PMID: 28826674 DOI: 10.1016/j.molcel.2017.07.022] [Citation(s) in RCA: 354] [Impact Index Per Article: 50.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Revised: 06/23/2017] [Accepted: 07/21/2017] [Indexed: 01/02/2023]
Abstract
Topologically associating domains (TADs), CTCF loop domains, and A/B compartments have been identified as important structural and functional components of 3D chromatin organization, yet the relationship between these features is not well understood. Using high-resolution Hi-C and HiChIP, we show that Drosophila chromatin is organized into domains we term compartmental domains that correspond precisely with A/B compartments at high resolution. We find that transcriptional state is a major predictor of Hi-C contact maps in several eukaryotes tested, including C. elegans and A. thaliana. Architectural proteins insulate compartmental domains by reducing interaction frequencies between neighboring regions in Drosophila, but CTCF loops do not play a distinct role in this organism. In mammals, compartmental domains exist alongside CTCF loop domains to form topological domains. The results suggest that compartmental domains are responsible for domain structure in all eukaryotes, with CTCF playing an important role in domain formation in mammals.
Collapse
Affiliation(s)
- M Jordan Rowley
- Department of Biology, Emory University, 1510 Clifton Road Northeast, Atlanta, GA 30322, USA
| | - Michael H Nichols
- Department of Biology, Emory University, 1510 Clifton Road Northeast, Atlanta, GA 30322, USA
| | - Xiaowen Lyu
- Department of Biology, Emory University, 1510 Clifton Road Northeast, Atlanta, GA 30322, USA
| | - Masami Ando-Kuri
- Department of Biology, Emory University, 1510 Clifton Road Northeast, Atlanta, GA 30322, USA
| | - I Sarahi M Rivera
- Department of Biology, Emory University, 1510 Clifton Road Northeast, Atlanta, GA 30322, USA
| | - Karen Hermetz
- Department of Biology, Emory University, 1510 Clifton Road Northeast, Atlanta, GA 30322, USA
| | - Ping Wang
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06030, USA
| | - Yijun Ruan
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06030, USA
| | - Victor G Corces
- Department of Biology, Emory University, 1510 Clifton Road Northeast, Atlanta, GA 30322, USA.
| |
Collapse
|
20
|
Szałaj P, Tang Z, Michalski P, Pietal MJ, Luo OJ, Sadowski M, Li X, Radew K, Ruan Y, Plewczynski D. An integrated 3-Dimensional Genome Modeling Engine for data-driven simulation of spatial genome organization. Genome Res 2016; 26:1697-1709. [PMID: 27789526 PMCID: PMC5131821 DOI: 10.1101/gr.205062.116] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 10/20/2016] [Indexed: 02/03/2023]
Abstract
ChIA-PET is a high-throughput mapping technology that reveals long-range chromatin interactions and provides insights into the basic principles of spatial genome organization and gene regulation mediated by specific protein factors. Recently, we showed that a single ChIA-PET experiment provides information at all genomic scales of interest, from the high-resolution locations of binding sites and enriched chromatin interactions mediated by specific protein factors, to the low resolution of nonenriched interactions that reflect topological neighborhoods of higher-order chromosome folding. This multilevel nature of ChIA-PET data offers an opportunity to use multiscale 3D models to study structural-functional relationships at multiple length scales, but doing so requires a structural modeling platform. Here, we report the development of 3D-GNOME (3-Dimensional Genome Modeling Engine), a complete computational pipeline for 3D simulation using ChIA-PET data. 3D-GNOME consists of three integrated components: a graph-distance-based heat map normalization tool, a 3D modeling platform, and an interactive 3D visualization tool. Using ChIA-PET and Hi-C data derived from human B-lymphocytes, we demonstrate the effectiveness of 3D-GNOME in building 3D genome models at multiple levels, including the entire genome, individual chromosomes, and specific segments at megabase (Mb) and kilobase (kb) resolutions of single average and ensemble structures. Further incorporation of CTCF-motif orientation and high-resolution looping patterns in 3D simulation provided additional reliability of potential biologically plausible topological structures.
Collapse
Affiliation(s)
- Przemysław Szałaj
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland.,Centre for Innovative Research, Medical University of Bialystok, 15-089 Białystok, Poland.,I-BioStat, Hasselt University, BE3590 Hasselt, Belgium
| | - Zhonghui Tang
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Paul Michalski
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Michal J Pietal
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Oscar J Luo
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Michał Sadowski
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Xingwang Li
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Kamen Radew
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Yijun Ruan
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA.,Department of Genetics and Genome Sciences, UConn Health, Farmington, Connecticut 06032, USA
| | - Dariusz Plewczynski
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland.,Centre for Innovative Research, Medical University of Bialystok, 15-089 Białystok, Poland.,Faculty of Pharmacy, Medical University of Warsaw, 02-097 Warsaw, Poland
| |
Collapse
|
21
|
Li G, Chen Y, Snyder MP, Zhang MQ. ChIA-PET2: a versatile and flexible pipeline for ChIA-PET data analysis. Nucleic Acids Res 2016; 45:e4. [PMID: 27625391 PMCID: PMC5224499 DOI: 10.1093/nar/gkw809] [Citation(s) in RCA: 75] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2016] [Revised: 08/30/2016] [Accepted: 09/04/2016] [Indexed: 11/12/2022] Open
Abstract
ChIA-PET2 is a versatile and flexible pipeline for analyzing different types of ChIA-PET data from raw sequencing reads to chromatin loops. ChIA-PET2 integrates all steps required for ChIA-PET data analysis, including linker trimming, read alignment, duplicate removal, peak calling and chromatin loop calling. It supports different kinds of ChIA-PET data generated from different ChIA-PET protocols and also provides quality controls for different steps of ChIA-PET analysis. In addition, ChIA-PET2 can use phased genotype data to call allele-specific chromatin interactions. We applied ChIA-PET2 to different ChIA-PET datasets, demonstrating its significantly improved performance as well as its ability to easily process ChIA-PET raw data. ChIA-PET2 is available at https://github.com/GuipengLi/ChIA-PET2.
Collapse
Affiliation(s)
- Guipeng Li
- MOE Key Laboratory of Bioinformatics; Bioinformatics Division and Center for Synthetic & Systems Biology, TNLIST; Department of Automation, Tsinghua University, Beijing 100084, China
| | - Yang Chen
- MOE Key Laboratory of Bioinformatics; Bioinformatics Division and Center for Synthetic & Systems Biology, TNLIST; Department of Automation, Tsinghua University, Beijing 100084, China
| | - Michael P Snyder
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Michael Q Zhang
- MOE Key Laboratory of Bioinformatics; Bioinformatics Division and Center for Synthetic & Systems Biology, TNLIST; Department of Automation, Tsinghua University, Beijing 100084, China .,Department of Biological Sciences, Center for Systems Biology, University of Texas, Dallas, 800 West Campbell Road, RL11, Richardson, TX 75080-3021, USA
| |
Collapse
|