1
|
Wang F, Lin J, Alinejad-Rokny H, Ma W, Meng L, Huang L, Yu J, Chen N, Wang Y, Yao Z, Xie W, Wong KC, Li X. Unveiling Multi-Scale Architectural Features in Single-Cell Hi-C Data Using scCAFE. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025:e2416432. [PMID: 40270467 DOI: 10.1002/advs.202416432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2024] [Revised: 03/12/2025] [Indexed: 04/25/2025]
Abstract
Single-cell Hi-C (scHi-C) has provided unprecedented insights into the heterogeneity of 3D genome organization. However, its sparse and noisy nature poses challenges for computational analyses, such as chromatin architectural feature identification. Here, scCAFE is introduced, which is a deep learning model for the multi-scale detection of architectural features at the single-cell level. scCAFE provides a unified framework for annotating chromatin loops, TAD-like domains (TLDs), and compartments across individual cells. This model outperforms previous scHi-C loop calling methods and delivers accurate predictions of TLDs and compartments that are biologically consistent with previous studies. The resulting single-cell annotations also offer a measure to characterize the heterogeneity of different levels of architectural features across cell types. This heterogeneity is then leveraged to identify a series of marker loop anchors, demontrating the potential of the 3D genome data to annotate cell identities without the aid of simultaneously sequenced omics data. Overall, scCAFE not only serves as a useful tool for analyzing single-cell genomic architecture, but also paves the way for precise cell-type annotations solely based on 3D genome features.
Collapse
Affiliation(s)
- Fuzhou Wang
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Jiecong Lin
- Department of Computer Science, The University of Hong Kong, Pok Fu Lam, 000000, Hong Kong SAR
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Department of Pathology, Harvard Medical School, Boston, MA, 02129, USA
| | - Hamid Alinejad-Rokny
- BioMedical Machine Learning Lab, Graduate School of Biomedical Engineering, University of New South Wales, Sydney, 2052, Australia
| | - Wenjing Ma
- School of Artificial Intelligence, Jilin University, Changchun, 132000, China
| | - Lingkuan Meng
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Lei Huang
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Jixiang Yu
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Nanjun Chen
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Yuchen Wang
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Zhongyu Yao
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Weidun Xie
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Ka-Chun Wong
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
- Shenzhen Research Institute, City University of Hong Kong, Shenzhen, 518057, China
| | - Xiangtao Li
- School of Artificial Intelligence, Jilin University, Changchun, 132000, China
| |
Collapse
|
2
|
Vinayak V, Basir R, Golloshi R, Toth J, Sant'Anna L, Lakadamyali M, McCord RP, Shenoy VB. Polymer model integrates imaging and sequencing to reveal how nanoscale heterochromatin domains influence gene expression. Nat Commun 2025; 16:3816. [PMID: 40268925 PMCID: PMC12019571 DOI: 10.1038/s41467-025-59001-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 04/08/2025] [Indexed: 04/25/2025] Open
Abstract
Chromatin organization regulates gene expression, with nanoscale heterochromatin domains playing a fundamental role. Their size varies with microenvironmental stiffness and epigenetic interventions, but how these factors regulate their formation and influence transcription remains unclear. To address this, we developed a sequencing-informed copolymer model that simulates chromatin evolution through diffusion and active epigenetic reactions. Our model predicts the formation of nanoscale heterochromatin domains and quantifies how domain size scales with epigenetic reaction rates, showing that epigenetic and compaction changes primarily occur at domain boundaries. We validated these predictions via Hi-C and super-resolution imaging of hyperacetylated melanoma cells and identified differential expression of metastasis-related genes through RNA-seq. We validated our findings in hMSCs, where epigenetic reaction rates respond to microenvironmental stiffness. Conclusively, our simulations reveal that heterochromatin domain boundaries regulate gene expression and epigenetic memory. These findings demonstrate how external cues drive chromatin organization and transcriptional memory in development and disease.
Collapse
Affiliation(s)
- Vinayak Vinayak
- Center for Engineering Mechanobiology, University of Pennsylvania, Philadelphia, PA, USA
- Department of Materials Science and Engineering, University of Pennsylvania, Philadelphia, PA, USA
| | - Ramin Basir
- Center for Engineering Mechanobiology, University of Pennsylvania, Philadelphia, PA, USA
- Department of Materials Science and Engineering, University of Pennsylvania, Philadelphia, PA, USA
| | - Rosela Golloshi
- Departments of Cell Biology, Center for Cell Dynamics, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Giovanis Institute for Translational Cell Biology, Johns Hopkins Medicine, Baltimore, MD, USA
| | - Joshua Toth
- Center for Engineering Mechanobiology, University of Pennsylvania, Philadelphia, PA, USA
- Department of Materials Science and Engineering, University of Pennsylvania, Philadelphia, PA, USA
| | - Lucas Sant'Anna
- Center for Engineering Mechanobiology, University of Pennsylvania, Philadelphia, PA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Melike Lakadamyali
- Center for Engineering Mechanobiology, University of Pennsylvania, Philadelphia, PA, USA
- Department of Physiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Rachel Patton McCord
- Department of Biochemistry & Cellular and Molecular Biology, University of Tennessee, Knoxville, TN, USA
| | - Vivek B Shenoy
- Center for Engineering Mechanobiology, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Materials Science and Engineering, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
3
|
Álvarez-González L, Ruiz-Herrera A. Evolution of 3D Chromatin Folding. Annu Rev Anim Biosci 2025; 13:49-71. [PMID: 39531399 DOI: 10.1146/annurev-animal-111523-102233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
Studies examining the evolution of genomes have focused mainly on sequence conservation. However, the inner working of a cell implies tightly regulated crosstalk between complex gene networks controlled by small dispersed regulatory elements of physically contacting DNA regions. How these different levels of chromatin organization crosstalk in different species underpins the potential for genome evolutionary plasticity. We review the evolution of chromatin organization across the Animal Tree of Life. We introduce general aspects of the mode and tempo of genome evolution to later explore the multiple layers of genome organization. We argue that both genome and chromosome size modulate patterns of chromatin folding and that chromatin interactions facilitate the formation of lineage-specific chromosomal reorganizations, especially in germ cells. Overall, analyzing the mechanistic forces involved in the maintenance of chromatin structure and function of the germ line is critical for understanding genome evolution, maintenance, and inheritance.
Collapse
Affiliation(s)
- Lucía Álvarez-González
- Genome Integrity and Instability Group, Institut de Biotecnologia i Biomedicina and Departament de Biologia Cel.lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain; ,
| | - Aurora Ruiz-Herrera
- Genome Integrity and Instability Group, Institut de Biotecnologia i Biomedicina and Departament de Biologia Cel.lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain; ,
| |
Collapse
|
4
|
Tavallaee G, Orouji E. Mapping the 3D genome architecture. Comput Struct Biotechnol J 2024; 27:89-101. [PMID: 39816913 PMCID: PMC11732852 DOI: 10.1016/j.csbj.2024.12.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Revised: 12/17/2024] [Accepted: 12/20/2024] [Indexed: 01/18/2025] Open
Abstract
The spatial organization of the genome plays a critical role in regulating gene expression, cellular differentiation, and genome stability. This review provides an in-depth examination of the methodologies, computational tools, and frameworks developed to map the three-dimensional (3D) architecture of the genome, focusing on both ligation-based and ligation-free techniques. We also explore the limitations of these methods, including biases introduced by restriction enzyme digestion and ligation inefficiencies, and compare them to more recent ligation-free approaches such as Genome Architecture Mapping (GAM) and Split-Pool Recognition of Interactions by Tag Extension (SPRITE). These techniques offer unique insights into higher-order chromatin structures by bypassing ligation steps, thus enabling the capture of complex multi-way interactions that are often challenging to resolve with traditional methods. Furthermore, we discuss the integration of chromatin interaction data with other genomic layers through multimodal approaches, including recent advances in single-cell technologies like sci-HiC and scSPRITE, which help unravel the heterogeneity of chromatin architecture in development and disease.
Collapse
Affiliation(s)
- Ghazaleh Tavallaee
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Elias Orouji
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| |
Collapse
|
5
|
Banerjee A, Zhang S, Bahar I. Genome structural dynamics: insights from Gaussian network analysis of Hi-C data. Brief Funct Genomics 2024; 23:525-537. [PMID: 38654598 PMCID: PMC11428154 DOI: 10.1093/bfgp/elae014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 03/11/2024] [Accepted: 04/02/2024] [Indexed: 04/26/2024] Open
Abstract
Characterization of the spatiotemporal properties of the chromatin is essential to gaining insights into the physical bases of gene co-expression, transcriptional regulation and epigenetic modifications. The Gaussian network model (GNM) has proven in recent work to serve as a useful tool for modeling chromatin structural dynamics, using as input high-throughput chromosome conformation capture data. We focus here on the exploration of the collective dynamics of chromosomal structures at hierarchical levels of resolution, from single gene loci to topologically associating domains or entire chromosomes. The GNM permits us to identify long-range interactions between gene loci, shedding light on the role of cross-correlations between distal regions of the chromosomes in regulating gene expression. Notably, GNM analysis performed across diverse cell lines highlights the conservation of the global/cooperative movements of the chromatin across different types of cells. Variations driven by localized couplings between genomic loci, on the other hand, underlie cell differentiation, underscoring the significance of the four-dimensional properties of the genome in defining cellular identity. Finally, we demonstrate the close relation between the cell type-dependent mobility profiles of gene loci and their gene expression patterns, providing a clear demonstration of the role of chromosomal 4D features in defining cell-specific differential expression of genes.
Collapse
Affiliation(s)
- Anupam Banerjee
- Laufer Center for Physical & Quantitative Biology, Stony Brook University, NY 11794, USA
| | - She Zhang
- OpenEye, Cadence Molecular Sciences, Santa Fe, NM 87508, USA
| | - Ivet Bahar
- Laufer Center for Physical & Quantitative Biology, Stony Brook University, NY 11794, USA
- Department of Biochemistry and Cell Biology, Renaissance School of Medicine, Stony Brook University, NY 11794, USA
| |
Collapse
|
6
|
Lainscsek X, Taher L. ENT3C: an entropy-based similarity measure for Hi-C and micro-C derived contact matrices. NAR Genom Bioinform 2024; 6:lqae076. [PMID: 38962256 PMCID: PMC11217677 DOI: 10.1093/nargab/lqae076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 06/05/2024] [Accepted: 06/27/2024] [Indexed: 07/05/2024] Open
Abstract
Hi-C and micro-C sequencing have shed light on the profound importance of 3D genome organization in cellular function by probing 3D contact frequencies across the linear genome. The resulting contact matrices are extremely sparse and susceptible to technical- and sequence-based biases, making their comparison challenging. The development of reliable, robust and efficient methods for quantifying similarity between contact matrices is crucial for investigating variations in the 3D genome organization in different cell types or under different conditions, as well as evaluating experimental reproducibility. We present a novel method, ENT3C, which measures the change in pattern complexity in the vicinity of contact matrix diagonals to quantify their similarity. ENT3C provides a robust, user-friendly Hi-C or micro-C contact matrix similarity metric and a characteristic entropy signal that can be used to gain detailed biological insights into 3D genome organization.
Collapse
Affiliation(s)
- Xenia Lainscsek
- Institute of Biomedical Informatics, Graz University of Technology, Graz, Austria
| | - Leila Taher
- Institute of Biomedical Informatics, Graz University of Technology, Graz, Austria
| |
Collapse
|
7
|
Liang P, Li H, Long C, Liu M, Zhou J, Zuo Y. Chromatin region binning of gene expression for improving embryo cell subtype identification. Comput Biol Med 2024; 170:108049. [PMID: 38290319 DOI: 10.1016/j.compbiomed.2024.108049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 01/01/2024] [Accepted: 01/26/2024] [Indexed: 02/01/2024]
Abstract
Mammalian embryonic development is a complex process, characterized by intricate spatiotemporal dynamics and distinct chromatin preferences. However, the quick diversification in early embryogenesis leads to significant cellular diversity and the sparsity of scRNA-seq data, posing challenges in accurately determining cell fate decisions. In this study, we introduce a chromatin region binning method using scChrBin, designed to identify chromatin regions that elucidate the dynamics of embryonic development and lineage differentiation. This method transforms scRNA-seq data into a chromatin-based matrix, leveraging genomic annotations. Our results showed that the scChrBin method achieves high accuracy, with 98.0% and 89.2% on two single-cell embryonic datasets, demonstrating its effectiveness in analyzing complex developmental processes. We also systematically and comprehensively analysis of these key chromatin binning regions and their associated genes, focusing on their roles in lineage and stage development. The perspective of chromatin region binning method enables a comprehensive analysis of transcriptome data at the chromatin level, allowing us to unveil the dynamic expression of chromatin regions across temporal and spatial development. The tool is available as an application at https://github.com/liameihao/scChrBin.
Collapse
Affiliation(s)
- Pengfei Liang
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
| | - Hanshuang Li
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
| | - Chunshen Long
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
| | - Mingzhu Liu
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
| | - Jian Zhou
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
| | - Yongchun Zuo
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China.
| |
Collapse
|