1
|
Zhou S, Lin N, Yu L, Su X, Liu Z, Yu X, Gao H, Lin S, Zeng Y. Single-cell multi-omics in the study of digestive system cancers. Comput Struct Biotechnol J 2024; 23:431-445. [PMID: 38223343 PMCID: PMC10787224 DOI: 10.1016/j.csbj.2023.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 12/07/2023] [Accepted: 12/07/2023] [Indexed: 01/16/2024] Open
Abstract
Digestive system cancers are prevalent diseases with a high mortality rate, posing a significant threat to public health and economic burden. The diagnosis and treatment of digestive system cancer confront conventional cancer problems, such as tumor heterogeneity and drug resistance. Single-cell sequencing (SCS) emerged at times required and has developed from single-cell RNA-seq (scRNA-seq) to the single-cell multi-omics era represented by single-cell spatial transcriptomics (ST). This article comprehensively reviews the advances of single-cell omics technology in the study of digestive system tumors. While analyzing and summarizing the research cases, vital details on the sequencing platform, sample information, sampling method, and key findings are provided. Meanwhile, we summarize the commonly used SCS platforms and their features, as well as the advantages of multi-omics technologies in combination. Finally, the development trends and prospects of the application of single-cell multi-omics technology in digestive system cancer research are prospected.
Collapse
Affiliation(s)
- Shuang Zhou
- The Second Clinical Medical School of Fujian Medical University, Quanzhou, Fujian Province, China
- The Clinical Center of Molecular Diagnosis and Therapy, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China
| | - Nanfei Lin
- The Clinical Center of Molecular Diagnosis and Therapy, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China
| | - Liying Yu
- The Clinical Center of Molecular Diagnosis and Therapy, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China
| | - Xiaoshan Su
- Department of Pulmonary and Critical Care Medicine, The Second Affiliated Hospital of Fujian Medical University, Respirology Medicine Centre of Fujian Province, Quanzhou, China
| | - Zhenlong Liu
- Lady Davis Institute for Medical Research, Jewish General Hospital, & Division of Experimental Medicine, Department of Medicine, McGill University, Montreal, QC, Canada
| | - Xiaowan Yu
- Clinical Laboratory, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China
| | - Hongzhi Gao
- The Clinical Center of Molecular Diagnosis and Therapy, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China
| | - Shu Lin
- Centre of Neurological and Metabolic Research, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China
- Diabetes and Metabolism Division, Garvan Institute of Medical Research, 384 Victoria Street, Darlinghurst, Sydney, NSW 2010, Australia
| | - Yiming Zeng
- Department of Pulmonary and Critical Care Medicine, The Second Affiliated Hospital of Fujian Medical University, Respirology Medicine Centre of Fujian Province, Quanzhou, China
- Fujian Provincial Key Laboratory of Lung Stem Cells, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China
- Jinan Microecological Biomedicine Shandong Laboratory, Jinan, Shandong Province, China
| |
Collapse
|
2
|
Gong H, Zhang S, Zhang X, Chen Y. A method for chromatin domain partitioning based on hypergraph clustering. Comput Struct Biotechnol J 2024; 23:1584-1593. [PMID: 38655013 PMCID: PMC11035048 DOI: 10.1016/j.csbj.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Revised: 03/29/2024] [Accepted: 04/04/2024] [Indexed: 04/26/2024] Open
Abstract
For many years, multi-scale models of chromatin domains, such as A/B compartments, sub-compartments, topologically associated domains (TADs), sub-TADs, and loops have been popular. However, existing methods can only identify structures at a single scale and cannot partition multi-scale structures. In this paper, we proposed a method (TORNADOES) for chromatin domain partitioning based on hypergraph clustering. First, we use a density clustering algorithm to identify TADs at different scales based on Hi-C data with different resolutions. Then, by combining ChIP-seq data features and TAD results at different scales, we generate a hypergraph based on these TADs. Finally, we partition the chromatin domain structure at different scales, including A/B, A1, A2, B1, B2, and B3 based on the Laplacian matrix feature of the hypergraph. Similarity comparison experiments and ChIP-seq signal enrichment analysis are performed on the A/B region and sub-TAD levels, respectively, demonstrating that our method can identify chromatin domains with distinct features and provide a deeper understanding of the organizational patterns and functional differences in TADs at the genomic hierarchical structure. Comparative analysis of multiple cell line data shows that TORNADOES can better classify different numbers and types of compartments by changing the factors ChIP-seq data and clustering number used to characterize TAD compared to other methods. Source code for the TORNADOES method can be found at https://github.com/ghaiyan/TORNADOES.
Collapse
Affiliation(s)
- Haiyan Gong
- Beijing Advanced Innovation Center for Materials Genome Engineering, Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing, 100083, China
- Shunde Innovation School, University of Science and Technology Beijing, Foshan, 528399, Guangdong, China
| | - Sichen Zhang
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China
| | - Xiaotong Zhang
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, 100083, China
- Shunde Innovation School, University of Science and Technology Beijing, Foshan, 528399, Guangdong, China
| | - Yang Chen
- The State Key Laboratory of Common Mechanism Research for Major Diseases, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100005, China
| |
Collapse
|
3
|
Oji A, Choubani L, Miura H, Hiratani I. Structure and dynamics of nuclear A/B compartments and subcompartments. Curr Opin Cell Biol 2024; 90:102406. [PMID: 39083950 DOI: 10.1016/j.ceb.2024.102406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 07/02/2024] [Accepted: 07/12/2024] [Indexed: 08/02/2024]
Abstract
Mammalian chromosomes form a hierarchical structure within the cell nucleus, from chromatin loops, megabase (Mb)-sized topologically associating domains (TADs) to larger-scale A/B compartments. The molecular basis of the structures of loops and TADs has been actively studied. However, the A and B compartments, which correspond to early-replicating euchromatin and late-replicating heterochromatin, respectively, are still relatively unexplored. In this review, we focus on the A/B compartments, discuss their close relationship to DNA replication timing (RT), and introduce recent findings on the features of subcompartments revealed by detailed classification of the A/B compartments. In doing so, we speculate on the structure, potential function, and developmental dynamics of A/B compartments and subcompartments in mammalian cells.
Collapse
Affiliation(s)
- Asami Oji
- Laboratory for Developmental Epigenetics, RIKEN Center for Biosystems Dynamics Research (BDR), 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe 650-0047 Japan
| | - Linda Choubani
- Laboratory for Developmental Epigenetics, RIKEN Center for Biosystems Dynamics Research (BDR), 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe 650-0047 Japan
| | - Hisashi Miura
- Laboratory for Developmental Epigenetics, RIKEN Center for Biosystems Dynamics Research (BDR), 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe 650-0047 Japan
| | - Ichiro Hiratani
- Laboratory for Developmental Epigenetics, RIKEN Center for Biosystems Dynamics Research (BDR), 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe 650-0047 Japan.
| |
Collapse
|
4
|
Sun F, Li H, Sun D, Fu S, Gu L, Shao X, Wang Q, Dong X, Duan B, Xing F, Wu J, Xiao M, Zhao F, Han JDJ, Liu Q, Fan X, Li C, Wang C, Shi T. Single-cell omics: experimental workflow, data analyses and applications. SCIENCE CHINA. LIFE SCIENCES 2024:10.1007/s11427-023-2561-0. [PMID: 39060615 DOI: 10.1007/s11427-023-2561-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 04/18/2024] [Indexed: 07/28/2024]
Abstract
Cells are the fundamental units of biological systems and exhibit unique development trajectories and molecular features. Our exploration of how the genomes orchestrate the formation and maintenance of each cell, and control the cellular phenotypes of various organismsis, is both captivating and intricate. Since the inception of the first single-cell RNA technology, technologies related to single-cell sequencing have experienced rapid advancements in recent years. These technologies have expanded horizontally to include single-cell genome, epigenome, proteome, and metabolome, while vertically, they have progressed to integrate multiple omics data and incorporate additional information such as spatial scRNA-seq and CRISPR screening. Single-cell omics represent a groundbreaking advancement in the biomedical field, offering profound insights into the understanding of complex diseases, including cancers. Here, we comprehensively summarize recent advances in single-cell omics technologies, with a specific focus on the methodology section. This overview aims to guide researchers in selecting appropriate methods for single-cell sequencing and related data analysis.
Collapse
Affiliation(s)
- Fengying Sun
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China
| | - Haoyan Li
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Dongqing Sun
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Shaliu Fu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
| | - Lei Gu
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xin Shao
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
- National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China
| | - Qinqin Wang
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xin Dong
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Bin Duan
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
| | - Feiyang Xing
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Jun Wu
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China
| | - Minmin Xiao
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China.
| | - Fangqing Zhao
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Jing-Dong J Han
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China.
| | - Qi Liu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China.
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China.
| | - Xiaohui Fan
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.
- National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China.
- Zhejiang Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
| | - Chen Li
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| | - Chenfei Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
| | - Tieliu Shi
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China.
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China.
- Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE, School of Statistics, East China Normal University, Shanghai, 200062, China.
| |
Collapse
|
5
|
Li H, Han Z, Sun Y, Wang F, Hu P, Gao Y, Bai X, Peng S, Ren C, Xu X, Liu Z, Chen H, Yang Y, Bo X. CGMega: explainable graph neural network framework with attention mechanisms for cancer gene module dissection. Nat Commun 2024; 15:5997. [PMID: 39013885 PMCID: PMC11252405 DOI: 10.1038/s41467-024-50426-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 07/09/2024] [Indexed: 07/18/2024] Open
Abstract
Cancer is rarely the straightforward consequence of an abnormality in a single gene, but rather reflects a complex interplay of many genes, represented as gene modules. Here, we leverage the recent advances of model-agnostic interpretation approach and develop CGMega, an explainable and graph attention-based deep learning framework to perform cancer gene module dissection. CGMega outperforms current approaches in cancer gene prediction, and it provides a promising approach to integrate multi-omics information. We apply CGMega to breast cancer cell line and acute myeloid leukemia (AML) patients, and we uncover the high-order gene module formed by ErbB family and tumor factors NRG1, PPM1A and DLG2. We identify 396 candidate AML genes, and observe the enrichment of either known AML genes or candidate AML genes in a single gene module. We also identify patient-specific AML genes and associated gene modules. Together, these results indicate that CGMega can be used to dissect cancer gene modules, and provide high-order mechanistic insights into cancer development and heterogeneity.
Collapse
Affiliation(s)
- Hao Li
- Academy of Military Medical Sciences, Beijing, China
| | - Zebei Han
- Department of Computer Science and Engineering, Shanghai Jiao Tong University, Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai, China
| | - Yu Sun
- Academy of Military Medical Sciences, Beijing, China
| | - Fu Wang
- Department of Computer Science and Engineering, Shanghai Jiao Tong University, Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai, China
| | - Pengzhen Hu
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Yuang Gao
- Department of Hematology, PLA General Hospital, the Fifth Medical Center, Beijing, China
| | - Xuemei Bai
- Academy of Military Medical Sciences, Beijing, China
| | - Shiyu Peng
- Academy of Military Medical Sciences, Beijing, China
| | - Chao Ren
- Academy of Military Medical Sciences, Beijing, China
| | - Xiang Xu
- Academy of Military Medical Sciences, Beijing, China
| | - Zeyu Liu
- Academy of Military Medical Sciences, Beijing, China
| | - Hebing Chen
- Academy of Military Medical Sciences, Beijing, China.
| | - Yang Yang
- Department of Computer Science and Engineering, Shanghai Jiao Tong University, Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai, China.
| | - Xiaochen Bo
- Academy of Military Medical Sciences, Beijing, China.
| |
Collapse
|
6
|
Yun S, Noh M, Yu J, Kim HJ, Hui CC, Lee H, Son JE. Unlocking biological mechanisms with integrative functional genomics approaches. Mol Cells 2024; 47:100092. [PMID: 39019219 DOI: 10.1016/j.mocell.2024.100092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 07/01/2024] [Accepted: 07/08/2024] [Indexed: 07/19/2024] Open
Abstract
Reverse genetics offers precise functional insights into genes through the targeted manipulation of gene expression followed by phenotypic assessment. While these approaches have proven effective in model organisms such as Saccharomyces cerevisiae, large-scale genetic manipulations in human cells were historically unfeasible due to methodological limitations. However, recent advancements in functional genomics, particularly clustered regularly interspaced short palindromic repeats (CRISPR)-based screening technologies and next-generation sequencing platforms, have enabled pooled screening technologies that allow massively parallel, unbiased assessments of biological phenomena in human cells. This review provides a comprehensive overview of cutting-edge functional genomic screening technologies applicable to human cells, ranging from short hairpin RNA screens to modern CRISPR screens. Additionally, we explore the integration of CRISPR platforms with single-cell approaches to monitor gene expression, chromatin accessibility, epigenetic regulation, and chromatin architecture following genetic perturbations at the omics level. By offering an in-depth understanding of these genomic screening methods, this review aims to provide insights into more targeted and effective strategies for genomic research and personalized medicine.
Collapse
Affiliation(s)
- Sehee Yun
- Department of Life Sciences, Korea University, Seoul 02841, Korea
| | - Minsoo Noh
- Department of Life Sciences, Korea University, Seoul 02841, Korea; Department of Internal Medicine and Laboratory of Genomics and Translational Medicine, Gachon University College of Medicine, Incheon 21565, Korea
| | - Jivin Yu
- Department of Life Sciences, Korea University, Seoul 02841, Korea
| | - Hyeon-Jai Kim
- Department of Life Sciences, Korea University, Seoul 02841, Korea
| | - Chi-Chung Hui
- Program in Developmental and Stem Cell Biology, The Hospital for Sick Children, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Hunsang Lee
- Department of Life Sciences, Korea University, Seoul 02841, Korea.
| | - Joe Eun Son
- School of Food Science and Biotechnology, Kyungpook National University, Daegu 41566, Korea.
| |
Collapse
|
7
|
Yu H, Wu D, Mishra S, Shen G, Sun H, Hu M, Li Y. SnapFISH-IMPUTE: an imputation method for multiplexed DNA FISH data. Commun Biol 2024; 7:834. [PMID: 38982263 PMCID: PMC11233503 DOI: 10.1038/s42003-024-06428-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 06/07/2024] [Indexed: 07/11/2024] Open
Abstract
Chromatin spatial organization plays a crucial role in gene regulation. Recently developed and prospering multiplexed DNA FISH technologies enable direct visualization of chromatin conformation in the nucleus. However, incomplete data caused by limited detection efficiency can substantially complicate and impair downstream analysis. Here, we present SnapFISH-IMPUTE that imputes missing values in multiplexed DNA FISH data. Analysis on multiple published datasets shows that the proposed method preserves the distribution of pairwise distances between imaging loci, and the imputed chromatin conformations are indistinguishable from the observed conformations. Additionally, imputation greatly improves downstream analyses such as identifying enhancer-promoter loops and clustering cells into distinct cell types. SnapFISH-IMPUTE is freely available at https://github.com/hyuyu104/SnapFISH-IMPUTE .
Collapse
Affiliation(s)
- Hongyu Yu
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Daiqing Wu
- Department of Statistics, University of Toronto, Ontario, Canada
| | - Shreya Mishra
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Guning Shen
- Department of Computer Science, University of North Carolina, Chapel Hill, NC, USA
- Department of Biology, University of North Carolina, Chapel Hill, NC, USA
| | - Huaigu Sun
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA.
| | - Yun Li
- Department of Computer Science, University of North Carolina, Chapel Hill, NC, USA.
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA.
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA.
| |
Collapse
|
8
|
Pang QY, Chiu YC, Huang RYJ. Regulating epithelial-mesenchymal plasticity from 3D genome organization. Commun Biol 2024; 7:750. [PMID: 38902393 PMCID: PMC11190238 DOI: 10.1038/s42003-024-06441-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 06/11/2024] [Indexed: 06/22/2024] Open
Abstract
Epithelial-mesenchymal transition (EMT) is a dynamic process enabling polarized epithelial cells to acquire mesenchymal features implicated in development and carcinoma progression. As our understanding evolves, it is clear the reversible execution of EMT arises from complex epigenomic regulation involving histone modifications and 3-dimensional (3D) genome structural changes, leading to a cascade of transcriptional events. This review summarizes current knowledge on chromatin organization in EMT, with a focus on hierarchical structures of the 3D genome and chromatin accessibility changes.
Collapse
Affiliation(s)
- Qing You Pang
- Neuro-Oncology Research Laboratory, National Neuroscience Institute, Singapore, 308433, Singapore
| | - Yi-Chia Chiu
- School of Medicine, College of Medicine, National Taiwan University, Taipei, 10051, Taiwan
| | - Ruby Yun-Ju Huang
- School of Medicine, College of Medicine, National Taiwan University, Taipei, 10051, Taiwan.
- Center for Advanced Computing and Imaging in Biomedicine, National Taiwan University, Taipei, 10051, Taiwan.
- Department of Obstetrics & Gynaecology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 119077, Singapore.
| |
Collapse
|
9
|
Zhang L, Bartosovic M. Single-cell mapping of cell-type specific chromatin architecture in the central nervous system. Curr Opin Struct Biol 2024; 86:102824. [PMID: 38723561 DOI: 10.1016/j.sbi.2024.102824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/22/2024] [Accepted: 04/08/2024] [Indexed: 05/19/2024]
Abstract
Determining how chromatin is structured in the nucleus is critical to studying its role in gene regulation. Recent advances in the analysis of single-cell chromatin architecture have considerably improved our understanding of cell-type-specific chromosome conformation and nuclear architecture. In this review, we discuss the methods used for analysis of 3D chromatin conformation, including sequencing-based methods, imaging-based techniques, and computational approaches. We further review the application of these methods in the study of the role of chromatin topology in neural development and disorders.
Collapse
Affiliation(s)
- Letian Zhang
- Department of Biochemistry and Biophysics, Svante Arrhenius väg 16C, 162 53, Stockholm, Sweden. https://twitter.com/LetianZHANG_
| | - Marek Bartosovic
- Department of Biochemistry and Biophysics, Svante Arrhenius väg 16C, 162 53, Stockholm, Sweden.
| |
Collapse
|
10
|
Lyons A, Brown J, Davenport KM. Single-Cell Sequencing Technology in Ruminant Livestock: Challenges and Opportunities. Curr Issues Mol Biol 2024; 46:5291-5306. [PMID: 38920988 PMCID: PMC11202421 DOI: 10.3390/cimb46060316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 05/20/2024] [Accepted: 05/25/2024] [Indexed: 06/27/2024] Open
Abstract
Advancements in single-cell sequencing have transformed the genomics field by allowing researchers to delve into the intricate cellular heterogeneity within tissues at greater resolution. While single-cell omics are more widely applied in model organisms and humans, their use in livestock species is just beginning. Studies in cattle, sheep, and goats have already leveraged single-cell and single-nuclei RNA-seq as well as single-cell and single-nuclei ATAC-seq to delineate cellular diversity in tissues, track changes in cell populations and gene expression over developmental stages, and characterize immune cell populations important for disease resistance and resilience. Although challenges exist for the use of this technology in ruminant livestock, such as the precise annotation of unique cell populations and spatial resolution of cells within a tissue, there is vast potential to enhance our understanding of the cellular and molecular mechanisms underpinning traits essential for healthy and productive livestock. This review intends to highlight the insights gained from published single-cell omics studies in cattle, sheep, and goats, particularly those with publicly accessible data. Further, this manuscript will discuss the challenges and opportunities of this technology in ruminant livestock and how it may contribute to enhanced profitability and sustainability of animal agriculture in the future.
Collapse
|
11
|
Wang Y, Cheng J. HiCDiff: single-cell Hi-C data denoising with diffusion models. Brief Bioinform 2024; 25:bbae279. [PMID: 38856167 PMCID: PMC11163381 DOI: 10.1093/bib/bbae279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 05/21/2024] [Accepted: 05/29/2024] [Indexed: 06/11/2024] Open
Abstract
The genome-wide single-cell chromosome conformation capture technique, i.e. single-cell Hi-C (ScHi-C), was recently developed to interrogate the conformation of the genome of individual cells. However, single-cell Hi-C data are much sparser than bulk Hi-C data of a population of cells, and noise in single-cell Hi-C makes it difficult to apply and analyze them in biological research. Here, we developed the first generative diffusion models (HiCDiff) to denoise single-cell Hi-C data in the form of chromosomal contact matrices. HiCDiff uses a deep residual network to remove the noise in the reverse process of diffusion and can be trained in both unsupervised and supervised learning modes. Benchmarked on several single-cell Hi-C test datasets, the diffusion models substantially remove the noise in single-cell Hi-C data. The unsupervised HiCDiff outperforms most supervised non-diffusion deep learning methods and achieves the performance comparable to the state-of-the-art supervised deep learning method in terms of multiple metrics, demonstrating that diffusion models are a useful approach to denoising single-cell Hi-C data. Moreover, its good performance holds on denoising bulk Hi-C data.
Collapse
Affiliation(s)
- Yanli Wang
- Department of Electrical Engineering and Computer Science, NextGen Precision Health Institute, University of Missouri, Columbia, MO 65211, United States
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, NextGen Precision Health Institute, University of Missouri, Columbia, MO 65211, United States
| |
Collapse
|
12
|
Xu J, Xu X, Huang D, Luo Y, Lin L, Bai X, Zheng Y, Yang Q, Cheng Y, Huang A, Shi J, Bo X, Gu J, Chen H. A comprehensive benchmarking with interpretation and operational guidance for the hierarchy of topologically associating domains. Nat Commun 2024; 15:4376. [PMID: 38782890 PMCID: PMC11116433 DOI: 10.1038/s41467-024-48593-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 05/03/2024] [Indexed: 05/25/2024] Open
Abstract
Topologically associating domains (TADs), megabase-scale features of chromatin spatial architecture, are organized in a domain-within-domain TAD hierarchy. Within TADs, the inner and smaller subTADs not only manifest cell-to-cell variability, but also precisely regulate transcription and differentiation. Although over 20 TAD callers are able to detect TAD, their usability in biomedicine is confined by a disagreement of outputs and a limit in understanding TAD hierarchy. We compare 13 computational tools across various conditions and develop a metric to evaluate the similarity of TAD hierarchy. Although outputs of TAD hierarchy at each level vary among callers, data resolutions, sequencing depths, and matrices normalization, they are more consistent when they have a higher similarity of larger TADs. We present comprehensive benchmarking of TAD hierarchy callers and operational guidance to researchers of life science researchers. Moreover, by simulating the mixing of different types of cells, we confirm that TAD hierarchy is generated not simply from stacking Hi-C heatmaps of heterogeneous cells. Finally, we propose an air conditioner model to decipher the role of TAD hierarchy in transcription.
Collapse
Affiliation(s)
- Jingxuan Xu
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Gastrointestinal Surgery, Peking University Cancer Hospital & Institute, Beijing, 100142, China
| | - Xiang Xu
- Academy of Military Medical Science, Beijing, 100850, China
| | - Dandan Huang
- Department of Oncology, Peking University Shougang Hospital, Beijing, China
- Center for Precision Diagnosis and Treatment of Colorectal Cancer and Inflammatory Diseases, Peking University Health Science Center, Beijing, China
| | - Yawen Luo
- Academy of Military Medical Science, Beijing, 100850, China
| | - Lin Lin
- Academy of Military Medical Science, Beijing, 100850, China
- School of Computer Science and Information Technology& KLAS, Northeast Normal University, Changchun, China
| | - Xuemei Bai
- Academy of Military Medical Science, Beijing, 100850, China
| | - Yang Zheng
- Academy of Military Medical Science, Beijing, 100850, China
| | - Qian Yang
- Academy of Military Medical Science, Beijing, 100850, China
| | - Yu Cheng
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Gastrointestinal Surgery, Peking University Cancer Hospital & Institute, Beijing, 100142, China
| | - An Huang
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Gastrointestinal Surgery, Peking University Cancer Hospital & Institute, Beijing, 100142, China
| | - Jingyi Shi
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Gastrointestinal Surgery, Peking University Cancer Hospital & Institute, Beijing, 100142, China
| | - Xiaochen Bo
- Academy of Military Medical Science, Beijing, 100850, China.
| | - Jin Gu
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Gastrointestinal Surgery, Peking University Cancer Hospital & Institute, Beijing, 100142, China.
- Department of Oncology, Peking University Shougang Hospital, Beijing, China.
- Center for Precision Diagnosis and Treatment of Colorectal Cancer and Inflammatory Diseases, Peking University Health Science Center, Beijing, China.
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China.
- Peking University International Cancer Institute, Beijing, China.
| | - Hebing Chen
- Academy of Military Medical Science, Beijing, 100850, China.
| |
Collapse
|
13
|
Zhou T, Zhang R, Jia D, Doty RT, Munday AD, Gao D, Xin L, Abkowitz JL, Duan Z, Ma J. GAGE-seq concurrently profiles multiscale 3D genome organization and gene expression in single cells. Nat Genet 2024:10.1038/s41588-024-01745-3. [PMID: 38744973 DOI: 10.1038/s41588-024-01745-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 04/05/2024] [Indexed: 05/16/2024]
Abstract
The organization of mammalian genomes features a complex, multiscale three-dimensional (3D) architecture, whose functional significance remains elusive because of limited single-cell technologies that can concurrently profile genome organization and transcriptional activities. Here, we introduce genome architecture and gene expression by sequencing (GAGE-seq), a scalable, robust single-cell co-assay measuring 3D genome structure and transcriptome simultaneously within the same cell. Applied to mouse brain cortex and human bone marrow CD34+ cells, GAGE-seq characterized the intricate relationships between 3D genome and gene expression, showing that multiscale 3D genome features inform cell-type-specific gene expression and link regulatory elements to target genes. Integration with spatial transcriptomic data revealed in situ 3D genome variations in mouse cortex. Observations in human hematopoiesis unveiled discordant changes between 3D genome organization and gene expression, underscoring a complex, temporal interplay at the single-cell level. GAGE-seq provides a powerful, cost-effective approach for exploring genome structure and gene expression relationships at the single-cell level across diverse biological contexts.
Collapse
Affiliation(s)
- Tianming Zhou
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Ruochi Zhang
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
- Eric and Wendy Schmidt Center, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Deyong Jia
- Department of Urology, University of Washington, Seattle, WA, USA
| | - Raymond T Doty
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
| | - Adam D Munday
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
| | - Daniel Gao
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
- Department of Chemistry, Pomona College, Claremont, CA, USA
| | - Li Xin
- Department of Urology, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Janis L Abkowitz
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Zhijun Duan
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA.
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA.
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
14
|
Xiong K, Zhang R, Ma J. scGHOST: identifying single-cell 3D genome subcompartments. Nat Methods 2024; 21:814-822. [PMID: 38589516 PMCID: PMC11127718 DOI: 10.1038/s41592-024-02230-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 03/01/2024] [Indexed: 04/10/2024]
Abstract
Single-cell Hi-C (scHi-C) technologies allow for probing of genome-wide cell-to-cell variability in three-dimensional (3D) genome organization from individual cells. Computational methods have been developed to reveal single-cell 3D genome features based on scHi-C, including A/B compartments, topologically associating domains and chromatin loops. However, no method exists for annotating single-cell subcompartments, which is important for understanding chromosome spatial localization in single cells. Here we present scGHOST, a single-cell subcompartment annotation method using graph embedding with constrained random walk sampling. Applications of scGHOST to scHi-C data and contact maps derived from single-cell 3D genome imaging demonstrate reliable identification of single-cell subcompartments, offering insights into cell-to-cell variability of nuclear subcompartments. Using scHi-C data from complex tissues, scGHOST identifies cell-type-specific or allele-specific subcompartments linked to gene transcription across various cell types and developmental stages, suggesting functional implications of single-cell subcompartments. scGHOST is an effective method for annotating single-cell 3D genome subcompartments in a broad range of biological contexts.
Collapse
Affiliation(s)
- Kyle Xiong
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Ruochi Zhang
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
- Eric and Wendy Schmidt Center, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
15
|
Shi Z, Wu H. CTPredictor: A comprehensive and robust framework for predicting cell types by integrating multi-scale features from single-cell Hi-C data. Comput Biol Med 2024; 173:108336. [PMID: 38513390 DOI: 10.1016/j.compbiomed.2024.108336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 03/01/2024] [Accepted: 03/17/2024] [Indexed: 03/23/2024]
Abstract
Single-cell Hi-C (scHi-C) has emerged as a powerful technology for deciphering cell-to-cell variability in three-dimensional (3D) chromatin organization, providing insights into genome-wide chromatin interactions and their correlation with cellular functions. Nevertheless, the accurate identification of cell types across different datasets remains a formidable challenge, hindering comprehensive investigations into genome structure. In response, we introduce CTPredictor, an innovative computational method that integrates multi-scale features to accurately predict cell types in various datasets. CTPredictor strategically incorporates three distinct feature sets, namely, small intra-domain contact probability (SICP), smoothed small intra-domain contact probability (SSICP), and smoothed bin contact probability (SBCP). The resulting fusion classification model significantly enhances the accuracy of cell type prediction based on single-cell Hi-C data (scHi-C). Rigorous benchmarking against established methods and three conventional machine learning approaches demonstrates the robust performance of CTPredictor, positioning it as an advanced tool for cell type prediction within scHi-C data. Beyond its prediction capabilities, CTPredictor holds promise in illuminating 3D genome structures and their functional significance across a wide array of biological processes.
Collapse
Affiliation(s)
- Zhenqi Shi
- School of Software, Shandong University, 250100, Jinan, China
| | - Hao Wu
- School of Software, Shandong University, 250100, Jinan, China.
| |
Collapse
|
16
|
Chang L, Xie Y, Taylor B, Wang Z, Sun J, Tan TR, Bejar R, Chen CC, Furnari FB, Hu M, Ren B. Droplet Hi-C for Fast and Scalable Profiling of Chromatin Architecture in Single Cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.18.590148. [PMID: 38712075 PMCID: PMC11071305 DOI: 10.1101/2024.04.18.590148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Comprehensive analysis of chromatin architecture is crucial for understanding the gene regulatory programs during development and in disease pathogenesis, yet current methods often inadequately address the unique challenges presented by analysis of heterogeneous tissue samples. Here, we introduce Droplet Hi-C, which employs a commercial microfluidic device for high-throughput, single-cell chromatin conformation profiling in droplets. Using Droplet Hi-C, we mapped the chromatin architecture at single-cell resolution from the mouse cortex and analyzed gene regulatory programs in major cortical cell types. Additionally, we used this technique to detect copy number variation (CNV), structural variations (SVs) and extrachromosomal DNA (ecDNA) in cancer cells, revealing clonal dynamics and other oncogenic events during treatment. We further refined this technique to allow for joint profiling of chromatin architecture and transcriptome in single cells, facilitating a more comprehensive exploration of the links between chromatin architecture and gene expression in both normal tissues and tumors. Thus, Droplet Hi-C not only addresses critical gaps in chromatin analysis of heterogeneous tissues but also emerges as a versatile tool enhancing our understanding of gene regulation in health and disease.
Collapse
Affiliation(s)
- Lei Chang
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Yang Xie
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
- Biomedical Sciences Graduate Program, University of California, San Diego, La Jolla, CA, USA
| | - Brett Taylor
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
- Medical Scientist Training Program, University of California, San Diego, La Jolla, CA, USA
| | - Zhaoning Wang
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Jiachen Sun
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
- Department of Systems Biology and Bioinformatics, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Tuyet R. Tan
- Moores Cancer Center, UC San Diego, La Jolla, CA, USA
| | - Rafael Bejar
- Moores Cancer Center, UC San Diego, La Jolla, CA, USA
| | - Clark C. Chen
- Department of Neurosurgery, University of Minnesota, Minneapolis, MN, USA
| | - Frank B. Furnari
- Department of Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Bing Ren
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
- Center for Epigenomics, Institute for Genomic Medicine, Moores Cancer Center, University of California, San Diego, School of Medicine, La Jolla, CA, USA
| |
Collapse
|
17
|
Cao Y, Zhao X, Tang S, Jiang Q, Li S, Li S, Chen S. scButterfly: a versatile single-cell cross-modality translation method via dual-aligned variational autoencoders. Nat Commun 2024; 15:2973. [PMID: 38582890 PMCID: PMC10998864 DOI: 10.1038/s41467-024-47418-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Accepted: 03/28/2024] [Indexed: 04/08/2024] Open
Abstract
Recent advancements for simultaneously profiling multi-omics modalities within individual cells have enabled the interrogation of cellular heterogeneity and molecular hierarchy. However, technical limitations lead to highly noisy multi-modal data and substantial costs. Although computational methods have been proposed to translate single-cell data across modalities, broad applications of the methods still remain impeded by formidable challenges. Here, we propose scButterfly, a versatile single-cell cross-modality translation method based on dual-aligned variational autoencoders and data augmentation schemes. With comprehensive experiments on multiple datasets, we provide compelling evidence of scButterfly's superiority over baseline methods in preserving cellular heterogeneity while translating datasets of various contexts and in revealing cell type-specific biological insights. Besides, we demonstrate the extensive applications of scButterfly for integrative multi-omics analysis of single-modality data, data enhancement of poor-quality single-cell multi-omics, and automatic cell type annotation of scATAC-seq data. Moreover, scButterfly can be generalized to unpaired data training, perturbation-response analysis, and consecutive translation.
Collapse
Affiliation(s)
- Yichuan Cao
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, China
| | - Xiamiao Zhao
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, China
| | - Songming Tang
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, China
| | - Qun Jiang
- MOE Key Laboratory of Bioinformatics and Bioinformatics Division of BNRIST, Department of Automation, Tsinghua University, 100084, Beijing, China
| | - Sijie Li
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, China
| | - Siyu Li
- School of Statistics and Data Science, Nankai University, Tianjin, 300071, China
| | - Shengquan Chen
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, China.
| |
Collapse
|
18
|
Liu E, Lyu H, Liu Y, Fu L, Cheng X, Yin X. Identifying TAD-like domains on single-cell Hi-C data by graph embedding and changepoint detection. Bioinformatics 2024; 40:btae138. [PMID: 38449288 PMCID: PMC10960928 DOI: 10.1093/bioinformatics/btae138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 01/10/2024] [Accepted: 03/05/2024] [Indexed: 03/08/2024] Open
Abstract
MOTIVATION Topologically associating domains (TADs) are fundamental building blocks of 3D genome. TAD-like domains in single cells are regarded as the underlying genesis of TADs discovered in bulk cells. Understanding the organization of TAD-like domains helps to get deeper insights into their regulatory functions. Unfortunately, it remains a challenge to identify TAD-like domains on single-cell Hi-C data due to its ultra-sparsity. RESULTS We propose scKTLD, an in silico tool for the identification of TAD-like domains on single-cell Hi-C data. It takes Hi-C contact matrix as the adjacency matrix for a graph, embeds the graph structures into a low-dimensional space with the help of sparse matrix factorization followed by spectral propagation, and the TAD-like domains can be identified using a kernel-based changepoint detection in the embedding space. The results tell that our scKTLD is superior to the other methods on the sparse contact matrices, including downsampled bulk Hi-C data as well as simulated and experimental single-cell Hi-C data. Besides, we demonstrated the conservation of TAD-like domain boundaries at single-cell level apart from heterogeneity within and across cell types, and found that the boundaries with higher frequency across single cells are more enriched for architectural proteins and chromatin marks, and they preferentially occur at TAD boundaries in bulk cells, especially at those with higher hierarchical levels. AVAILABILITY AND IMPLEMENTATION scKTLD is freely available at https://github.com/lhqxinghun/scKTLD.
Collapse
Affiliation(s)
- Erhu Liu
- School of Information and Control Engineering, Xi'an University of Architecture and Technology, Xi'an 710055, China
| | - Hongqiang Lyu
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi’an Jiaotong University, Xi'an 710049, China
| | - Yuan Liu
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi’an Jiaotong University, Xi'an 710049, China
| | - Laiyi Fu
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi’an Jiaotong University, Xi'an 710049, China
| | - Xiaoliang Cheng
- Department of Pharmacy, The First Affiliated Hospital of Xi’an Jiaotong University, Xi'an 710061, China
| | - Xiaoran Yin
- Department of Oncology, The Second Affiliated Hospital of Xi’an Jiaotong University, Xi'an 710004, China
| |
Collapse
|
19
|
Li H, Playter C, Das P, McCord RP. Chromosome compartmentalization: causes, changes, consequences, and conundrums. Trends Cell Biol 2024:S0962-8924(24)00021-7. [PMID: 38395734 DOI: 10.1016/j.tcb.2024.01.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 01/12/2024] [Accepted: 01/22/2024] [Indexed: 02/25/2024]
Abstract
The spatial segregation of the genome into compartments is a major feature of 3D genome organization. New data on mammalian chromosome organization across different conditions reveal important information about how and why these compartments form and change. A combination of epigenetic state, nuclear body tethering, physical forces, gene expression, and replication timing (RT) can all influence the establishment and alteration of chromosome compartments. We review the causes and implications of genomic regions undergoing a 'compartment switch' that changes their physical associations and spatial location in the nucleus. About 20-30% of genomic regions change compartment during cell differentiation or cancer progression, whereas alterations in response to a stimulus within a cell type are usually much more limited. However, even a change in 1-2% of genomic bins may have biologically relevant implications. Finally, we review the effects of compartment changes on gene regulation, DNA damage repair, replication, and the physical state of the cell.
Collapse
Affiliation(s)
- Heng Li
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN, USA
| | - Christopher Playter
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN, USA
| | - Priyojit Das
- University of Tennessee-Oak Ridge National Laboratory (UT-ORNL) Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, TN, USA
| | - Rachel Patton McCord
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN, USA.
| |
Collapse
|
20
|
Zhang K, Zemke NR, Armand EJ, Ren B. A fast, scalable and versatile tool for analysis of single-cell omics data. Nat Methods 2024; 21:217-227. [PMID: 38191932 PMCID: PMC10864184 DOI: 10.1038/s41592-023-02139-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 11/23/2023] [Indexed: 01/10/2024]
Abstract
Single-cell omics technologies have revolutionized the study of gene regulation in complex tissues. A major computational challenge in analyzing these datasets is to project the large-scale and high-dimensional data into low-dimensional space while retaining the relative relationships between cells. This low dimension embedding is necessary to decompose cellular heterogeneity and reconstruct cell-type-specific gene regulatory programs. Traditional dimensionality reduction techniques, however, face challenges in computational efficiency and in comprehensively addressing cellular diversity across varied molecular modalities. Here we introduce a nonlinear dimensionality reduction algorithm, embodied in the Python package SnapATAC2, which not only achieves a more precise capture of single-cell omics data heterogeneities but also ensures efficient runtime and memory usage, scaling linearly with the number of cells. Our algorithm demonstrates exceptional performance, scalability and versatility across diverse single-cell omics datasets, including single-cell assay for transposase-accessible chromatin using sequencing, single-cell RNA sequencing, single-cell Hi-C and single-cell multi-omics datasets, underscoring its utility in advancing single-cell analysis.
Collapse
Affiliation(s)
- Kai Zhang
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Westlake Laboratory of Life Sciences and Biomedicine, School of Life Sciences, Westlake University, Hangzhou, China
| | - Nathan R Zemke
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Ethan J Armand
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA
| | - Bing Ren
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA.
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA.
- Ludwig Institute for Cancer Research, La Jolla, CA, USA.
- Institute for Genomic Medicine, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
21
|
Zhang Y, Boninsegna L, Yang M, Misteli T, Alber F, Ma J. Computational methods for analysing multiscale 3D genome organization. Nat Rev Genet 2024; 25:123-141. [PMID: 37673975 PMCID: PMC11127719 DOI: 10.1038/s41576-023-00638-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/12/2023] [Indexed: 09/08/2023]
Abstract
Recent progress in whole-genome mapping and imaging technologies has enabled the characterization of the spatial organization and folding of the genome in the nucleus. In parallel, advanced computational methods have been developed to leverage these mapping data to reveal multiscale three-dimensional (3D) genome features and to provide a more complete view of genome structure and its connections to genome functions such as transcription. Here, we discuss how recently developed computational tools, including machine-learning-based methods and integrative structure-modelling frameworks, have led to a systematic, multiscale delineation of the connections among different scales of 3D genome organization, genomic and epigenomic features, functional nuclear components and genome function. However, approaches that more comprehensively integrate a wide variety of genomic and imaging datasets are still needed to uncover the functional role of 3D genome structure in defining cellular phenotypes in health and disease.
Collapse
Affiliation(s)
- Yang Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Lorenzo Boninsegna
- Department of Microbiology, Immunology and Molecular Genetics and Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA, USA
| | - Muyu Yang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Tom Misteli
- Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA.
| | - Frank Alber
- Department of Microbiology, Immunology and Molecular Genetics and Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA, USA.
| | - Jian Ma
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
22
|
Mihai IS, Chafle S, Henriksson J. Representing and extracting knowledge from single-cell data. Biophys Rev 2024; 16:29-56. [PMID: 38495441 PMCID: PMC10937862 DOI: 10.1007/s12551-023-01091-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 06/28/2023] [Indexed: 03/19/2024] Open
Abstract
Single-cell analysis is currently one of the most high-resolution techniques to study biology. The large complex datasets that have been generated have spurred numerous developments in computational biology, in particular the use of advanced statistics and machine learning. This review attempts to explain the deeper theoretical concepts that underpin current state-of-the-art analysis methods. Single-cell analysis is covered from cell, through instruments, to current and upcoming models. The aim of this review is to spread concepts which are not yet in common use, especially from topology and generative processes, and how new statistical models can be developed to capture more of biology. This opens epistemological questions regarding our ontology and models, and some pointers will be given to how natural language processing (NLP) may help overcome our cognitive limitations for understanding single-cell data.
Collapse
Affiliation(s)
- Ionut Sebastian Mihai
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå, Sweden
- Umeå Centre for Microbial Research (UCMR), Department of Molecular Biology, Umeå University, Umeå, Sweden
- Industrial Doctoral School, Umeå University, Umeå, Sweden
| | - Sarang Chafle
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå, Sweden
- Umeå Centre for Microbial Research (UCMR), Department of Molecular Biology, Umeå University, Umeå, Sweden
| | - Johan Henriksson
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå, Sweden
- Umeå Centre for Microbial Research (UCMR), Department of Molecular Biology, Umeå University, Umeå, Sweden
| |
Collapse
|
23
|
Yu H, Wu D, Shen G, Hu M, Li Y. SnapFISH-IMPUTE: an imputation method for multiplexed DNA FISH data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.12.575427. [PMID: 38293083 PMCID: PMC10827092 DOI: 10.1101/2024.01.12.575427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Chromatin spatial organization plays a crucial role in gene regulation. Recently developed and prospering multiplexed DNA FISH technologies enable direct visualization of chromatin conformation in nucleus. However, incomplete data caused by limited detection efficiency can substantially complicate and impair downstream analysis. Here, we present SnapFISH-IMPUTE that imputes missing values in multiplexed DNA FISH data. Analysis on multiple published datasets shows that the proposed method preserves the distribution of pairwise distances between imaging loci, and the imputed chromatin conformations are indistinguishable from the observed conformations. Additionally, imputation greatly improves downstream analyses such as identifying enhancer-promoter loops and clustering cells into distinct cell types. SnapFISH-IMPUTE is freely available at https://github.com/hyuyu104/SnapFISH-IMPUTE.
Collapse
Affiliation(s)
- Hongyu Yu
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Daiqing Wu
- Department of Mathematics, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Guning Shen
- Department of Computer Science, University of North Carolina, Chapel Hill, NC, USA
- Department of Biology, University of North Carolina, Chapel Hill, NC, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Yun Li
- Department of Computer Science, University of North Carolina, Chapel Hill, NC, USA
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| |
Collapse
|
24
|
Hua D, Gu M, Zhang X, Du Y, Xie H, Qi L, Du X, Bai Z, Zhu X, Tian D. DiffDomain enables identification of structurally reorganized topologically associating domains. Nat Commun 2024; 15:502. [PMID: 38218905 PMCID: PMC10787792 DOI: 10.1038/s41467-024-44782-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 01/02/2024] [Indexed: 01/15/2024] Open
Abstract
Topologically associating domains (TADs) are critical structural units in three-dimensional genome organization of mammalian genome. Dynamic reorganizations of TADs between health and disease states are associated with essential genome functions. However, computational methods for identifying reorganized TADs are still in the early stages of development. Here, we present DiffDomain, an algorithm leveraging high-dimensional random matrix theory to identify structurally reorganized TADs using high-throughput chromosome conformation capture (Hi-C) contact maps. Method comparison using multiple real Hi-C datasets reveals that DiffDomain outperforms alternative methods for false positive rates, true positive rates, and identifying a new subtype of reorganized TADs. Applying DiffDomain to Hi-C data from different cell types and disease states demonstrates its biological relevance. Identified reorganized TADs are associated with structural variations and epigenomic changes such as changes in CTCF binding sites. By applying to a single-cell Hi-C data from mouse neuronal development, DiffDomain can identify reorganized TADs between cell types with reasonable reproducibility using pseudo-bulk Hi-C data from as few as 100 cells per condition. Moreover, DiffDomain reveals differential cell-to-population variability and heterogeneous cell-to-cell variability in TADs. Therefore, DiffDomain is a statistically sound method for better comparative analysis of TADs using both Hi-C and single-cell Hi-C data.
Collapse
Affiliation(s)
- Dunming Hua
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Ming Gu
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Xiao Zhang
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Yanyi Du
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Hangcheng Xie
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Li Qi
- Chongqing Municipal Center for Disease Control and Prevention, Chongqing, 400042, China
| | - Xiangjun Du
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Zhidong Bai
- KLASMOE & School of Mathematics and Statistics, Northeast Normal University, Changchun, Jilin, 130024, China
| | - Xiaopeng Zhu
- MyCellome LLC., Allison Park, PA, 15101, USA
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Dechao Tian
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China.
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
| |
Collapse
|
25
|
Rothörl J, Brems MA, Stevens TJ, Virnau P. Reconstructing diploid 3D chromatin structures from single cell Hi-C data with a polymer-based approach. FRONTIERS IN BIOINFORMATICS 2023; 3:1284484. [PMID: 38148761 PMCID: PMC10750380 DOI: 10.3389/fbinf.2023.1284484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 11/24/2023] [Indexed: 12/28/2023] Open
Abstract
Detailed understanding of the 3D structure of chromatin is a key ingredient to investigate a variety of processes inside the cell. Since direct methods to experimentally ascertain these structures lack the desired spatial fidelity, computational inference methods based on single cell Hi-C data have gained significant interest. Here, we develop a progressive simulation protocol to iteratively improve the resolution of predicted interphase structures by maximum-likelihood association of ambiguous Hi-C contacts using lower-resolution predictions. Compared to state-of-the-art methods, our procedure is not limited to haploid cell data and allows us to reach a resolution of up to 5,000 base pairs per bead. High resolution chromatin models grant access to a multitude of structural phenomena. Exemplarily, we verify the formation of chromosome territories and holes near aggregated chromocenters as well as the inversion of the CpG content for rod photoreceptor cells.
Collapse
Affiliation(s)
- Jan Rothörl
- Institute of Physics, Johannes Gutenberg-Universität Mainz, Mainz, Germany
| | - Maarten A. Brems
- Institute of Physics, Johannes Gutenberg-Universität Mainz, Mainz, Germany
| | - Tim J. Stevens
- MRC Laboratory of Molecular Biology, Cambridge, United Kingdom
| | - Peter Virnau
- Institute of Physics, Johannes Gutenberg-Universität Mainz, Mainz, Germany
| |
Collapse
|
26
|
Zheng J, Yang Y, Dai Z. Subgraph extraction and graph representation learning for single cell Hi-C imputation and clustering. Brief Bioinform 2023; 25:bbad379. [PMID: 38040494 PMCID: PMC10691963 DOI: 10.1093/bib/bbad379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 09/10/2023] [Accepted: 10/03/2023] [Indexed: 12/03/2023] Open
Abstract
Single-cell Hi-C (scHi-C) technology enables the investigation of 3D chromatin structure variability across individual cells. However, the analysis of scHi-C data is challenged by a large number of missing values. Here, we present a scHi-C data imputation model HiC-SGL, based on Subgraph extraction and graph representation learning. HiC-SGL can also learn informative low-dimensional embeddings of cells. We demonstrate that our method surpasses existing methods in terms of imputation accuracy and clustering performance by various metrics.
Collapse
Affiliation(s)
- Jiahao Zheng
- School of Computer Science and Engineering, Sun Yat-Sen University, 510006 Guangzhou, China
| | - Yuedong Yang
- School of Computer Science and Engineering, Sun Yat-Sen University, 510006 Guangzhou, China
| | - Zhiming Dai
- School of Computer Science and Engineering, Sun Yat-Sen University, 510006 Guangzhou, China
| |
Collapse
|
27
|
Gunsalus LM, Keiser MJ, Pollard KS. ChromaFactor: deconvolution of single-molecule chromatin organization with non-negative matrix factorization. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.22.568268. [PMID: 38045231 PMCID: PMC10690235 DOI: 10.1101/2023.11.22.568268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The investigation of chromatin organization in single cells holds great promise for identifying causal relationships between genome structure and function. However, analysis of single-molecule data is hampered by extreme yet inherent heterogeneity, making it challenging to determine the contributions of individual chromatin fibers to bulk trends. To address this challenge, we propose ChromaFactor, a novel computational approach based on non-negative matrix factorization that deconvolves single-molecule chromatin organization datasets into their most salient primary components. ChromaFactor provides the ability to identify trends accounting for the maximum variance in the dataset while simultaneously describing the contribution of individual molecules to each component. Applying our approach to two single-molecule imaging datasets across different genomic scales, we find that these primary components demonstrate significant correlation with key functional phenotypes, including active transcription, enhancer-promoter distance, and genomic compartment. ChromaFactor offers a robust tool for understanding the complex interplay between chromatin structure and function on individual DNA molecules, pinpointing which subpopulations drive functional changes and fostering new insights into cellular heterogeneity and its implications for bulk genomic phenomena.
Collapse
Affiliation(s)
- Laura M. Gunsalus
- Gladstone Institutes, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
| | - Michael J. Keiser
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA
- Kavli Institute for Fundamental Neuroscience, University of California, San Francisco, San Francisco, CA
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA
- Department of Epidemiology & Biostatistics, University of California, San Francisco, San Francisco, CA
| | - Katherine S. Pollard
- Gladstone Institutes, San Francisco, CA
- Institute for Neurodegenerative Diseases, University of California, San Francisco, San Francisco, CA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA
- Chan Zuckerberg Biohub, San Francisco, CA
| |
Collapse
|
28
|
Wang F, Alinejad‐Rokny H, Lin J, Gao T, Chen X, Zheng Z, Meng L, Li X, Wong K. A Lightweight Framework For Chromatin Loop Detection at the Single-Cell Level. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2303502. [PMID: 37816141 PMCID: PMC10667817 DOI: 10.1002/advs.202303502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 08/10/2023] [Indexed: 10/12/2023]
Abstract
Single-cell Hi-C (scHi-C) has made it possible to analyze chromatin organization at the single-cell level. However, scHi-C experiments generate inherently sparse data, which poses a challenge for loop calling methods. The existing approach performs significance tests across the imputed dense contact maps, leading to substantial computational overhead and loss of information at the single-cell level. To overcome this limitation, a lightweight framework called scGSLoop is proposed, which sets a new paradigm for scHi-C loop calling by adapting the training and inferencing strategies of graph-based deep learning to leverage the sequence features and 1D positional information of genomic loci. With this framework, sparsity is no longer a challenge, but rather an advantage that the model leverages to achieve unprecedented computational efficiency. Compared to existing methods, scGSLoop makes more accurate predictions and is able to identify more loops that have the potential to play regulatory roles in genome functioning. Moreover, scGSLoop preserves single-cell information by identifying a distinct group of loops for each individual cell, which not only enables an understanding of the variability of chromatin looping states between cells, but also allows scGSLoop to be extended for the investigation of multi-connected hubs and their underlying mechanisms.
Collapse
Affiliation(s)
- Fuzhou Wang
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Hamid Alinejad‐Rokny
- BioMedical Machine Learning Lab, Graduate School of Biomedical EngineeringUniversity of New South WalesSydney2052Australia
| | - Jiecong Lin
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General HospitalDepartment of PathologyHarvard Medical SchoolBostonMA02129USA
- Department of Computer ScienceThe University of Hong KongPok Fu LamHong Kong SAR
| | - Tingxiao Gao
- Department of Medical Biophysics, Faculty of MedicineUniversity of TorontoTorontoOntarioM5G1L7Canada
| | - Xingjian Chen
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Zetian Zheng
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Lingkuan Meng
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| | - Xiangtao Li
- School of Artificial IntelligenceJilin UniversityChangchun130012China
| | - Ka‐Chun Wong
- Department of Computer ScienceCity University of Hong KongKowloon TongHong Kong SAR
| |
Collapse
|
29
|
Badia-I-Mompel P, Wessels L, Müller-Dott S, Trimbour R, Ramirez Flores RO, Argelaguet R, Saez-Rodriguez J. Gene regulatory network inference in the era of single-cell multi-omics. Nat Rev Genet 2023; 24:739-754. [PMID: 37365273 DOI: 10.1038/s41576-023-00618-5] [Citation(s) in RCA: 48] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2023] [Indexed: 06/28/2023]
Abstract
The interplay between chromatin, transcription factors and genes generates complex regulatory circuits that can be represented as gene regulatory networks (GRNs). The study of GRNs is useful to understand how cellular identity is established, maintained and disrupted in disease. GRNs can be inferred from experimental data - historically, bulk omics data - and/or from the literature. The advent of single-cell multi-omics technologies has led to the development of novel computational methods that leverage genomic, transcriptomic and chromatin accessibility information to infer GRNs at an unprecedented resolution. Here, we review the key principles of inferring GRNs that encompass transcription factor-gene interactions from transcriptomics and chromatin accessibility data. We focus on the comparison and classification of methods that use single-cell multimodal data. We highlight challenges in GRN inference, in particular with respect to benchmarking, and potential further developments using additional data modalities.
Collapse
Affiliation(s)
- Pau Badia-I-Mompel
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Lorna Wessels
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Department of Vascular Biology and Tumor Angiogenesis, European Center for Angioscience, Medical Faculty, MannHeim Heidelberg University, Mannheim, Germany
| | - Sophia Müller-Dott
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Rémi Trimbour
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France
| | - Ricardo O Ramirez Flores
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | | | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany.
| |
Collapse
|
30
|
龚 海, 麻 付, 张 晓. [Advances in methods and applications of single-cell Hi-C data analysis]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2023; 40:1033-1039. [PMID: 37879935 PMCID: PMC10600426 DOI: 10.7507/1001-5515.202303046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 08/29/2023] [Indexed: 10/27/2023]
Abstract
Chromatin three-dimensional genome structure plays a key role in cell function and gene regulation. Single-cell Hi-C techniques can capture genomic structure information at the cellular level, which provides an opportunity to study changes in genomic structure between different cell types. Recently, some excellent computational methods have been developed for single-cell Hi-C data analysis. In this paper, the available methods for single-cell Hi-C data analysis were first reviewed, including preprocessing of single-cell Hi-C data, multi-scale structure recognition based on single-cell Hi-C data, bulk-like Hi-C contact matrix generation based on single-cell Hi-C data sets, pseudo-time series analysis, and cell classification. Then the application of single-cell Hi-C data in cell differentiation and structural variation was described. Finally, the future development direction of single-cell Hi-C data analysis was also prospected.
Collapse
Affiliation(s)
- 海燕 龚
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 付强 麻
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 晓彤 张
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| |
Collapse
|
31
|
Tian W, Zhou J, Bartlett A, Zeng Q, Liu H, Castanon RG, Kenworthy M, Altshul J, Valadon C, Aldridge A, Nery JR, Chen H, Xu J, Johnson ND, Lucero J, Osteen JK, Emerson N, Rink J, Lee J, Li Y, Siletti K, Liem M, Claffey N, O’Connor C, Yanny AM, Nyhus J, Dee N, Casper T, Shapovalova N, Hirschstein D, Ding SL, Hodge R, Levi BP, Keene CD, Linnarsson S, Lein E, Ren B, Behrens MM, Ecker JR. Single-cell DNA methylation and 3D genome architecture in the human brain. Science 2023; 382:eadf5357. [PMID: 37824674 PMCID: PMC10572106 DOI: 10.1126/science.adf5357] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 09/05/2023] [Indexed: 10/14/2023]
Abstract
Delineating the gene-regulatory programs underlying complex cell types is fundamental for understanding brain function in health and disease. Here, we comprehensively examined human brain cell epigenomes by probing DNA methylation and chromatin conformation at single-cell resolution in 517 thousand cells (399 thousand neurons and 118 thousand non-neurons) from 46 regions of three adult male brains. We identified 188 cell types and characterized their molecular signatures. Integrative analyses revealed concordant changes in DNA methylation, chromatin accessibility, chromatin organization, and gene expression across cell types, cortical areas, and basal ganglia structures. We further developed single-cell methylation barcodes that reliably predict brain cell types using the methylation status of select genomic sites. This multimodal epigenomic brain cell atlas provides new insights into the complexity of cell-type-specific gene regulation in adult human brains.
Collapse
Affiliation(s)
- Wei Tian
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jingtian Zhou
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Anna Bartlett
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Qiurui Zeng
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92037, USA
| | - Hanqing Liu
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Rosa G. Castanon
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Mia Kenworthy
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jordan Altshul
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Cynthia Valadon
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Andrew Aldridge
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Joseph R. Nery
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Huaming Chen
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jiaying Xu
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Nicholas D. Johnson
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jacinta Lucero
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Julia K. Osteen
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Nora Emerson
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jon Rink
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jasper Lee
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Yang Li
- Ludwig Institute for Cancer Research, La Jolla, CA 92037, USA
| | - Kimberly Siletti
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet; 171 77 Stockholm, Sweden
| | - Michelle Liem
- Flow Cytometry Core Facility, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Naomi Claffey
- Flow Cytometry Core Facility, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Caz O’Connor
- Flow Cytometry Core Facility, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | | | - Julie Nyhus
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Nick Dee
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Tamara Casper
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | | | | | - Song-Lin Ding
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Rebecca Hodge
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Boaz P. Levi
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - C. Dirk Keene
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
| | - Sten Linnarsson
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet; 171 77 Stockholm, Sweden
| | - Ed Lein
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research, La Jolla, CA 92037, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
- Institute of Genomic Medicine, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
- Moores Cancer Center, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
| | - M. Margarita Behrens
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Joseph R. Ecker
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| |
Collapse
|
32
|
Zhou Y, Li T, Choppavarapu L, Jin VX. Integration of scHi-C and scRNA-seq data defines distinct 3D-regulated and biological-context dependent cell subpopulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.29.560193. [PMID: 37873257 PMCID: PMC10592853 DOI: 10.1101/2023.09.29.560193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
An integration of 3D chromatin structure and gene expression at single-cell resolution has yet been demonstrated. Here, we develop a computational method, a multiomic data integration (MUDI) algorithm, which integrates scHi-C and scRNA-seq data to precisely define the 3D-regulated and biological-context dependent cell subpopulations or topologically integrated subpopulations (TISPs). We demonstrate its algorithmic utility on the publicly available and newly generated scHi-C and scRNA-seq data. We then test and apply MUDI in a breast cancer cell model system to demonstrate its biological-context dependent utility. We found the newly defined topologically conserved associating domain (CAD) is the characteristic single-cell 3D chromatin structure and better characterizes chromatin domains in single-cell resolution. We further identify 20 TISPs uniquely characterizing 3D-regulated breast cancer cellular states. We reveal two of TISPs are remarkably resemble to high cycling breast cancer persister cells and chromatin modifying enzymes might be functional regulators to drive the alteration of the 3D chromatin structures. Our comprehensive integration of scHi-C and scRNA-seq data in cancer cells at single-cell resolution provides mechanistic insights into 3D-regulated heterogeneity of developing drug-tolerant cancer cells.
Collapse
|
33
|
Tan L, Shi J, Moghadami S, Parasar B, Wright CP, Seo Y, Vallejo K, Cobos I, Duncan L, Chen R, Deisseroth K. Lifelong restructuring of 3D genome architecture in cerebellar granule cells. Science 2023; 381:1112-1119. [PMID: 37676945 PMCID: PMC11059189 DOI: 10.1126/science.adh3253] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 08/03/2023] [Indexed: 09/09/2023]
Abstract
The cerebellum contains most of the neurons in the human brain and exhibits distinctive modes of development and aging. In this work, by developing our single-cell three-dimensional (3D) genome assay-diploid chromosome conformation capture, or Dip-C-into population-scale (Pop-C) and virus-enriched (vDip-C) modes, we resolved the first 3D genome structures of single cerebellar cells, created life-spanning 3D genome atlases for both humans and mice, and jointly measured transcriptome and chromatin accessibility during development. We found that although the transcriptome and chromatin accessibility of cerebellar granule neurons mature in early postnatal life, 3D genome architecture gradually remodels throughout life, establishing ultra-long-range intrachromosomal contacts and specific interchromosomal contacts that are rarely seen in neurons. These results reveal unexpected evolutionarily conserved molecular processes that underlie distinctive features of neural development and aging across the mammalian life span.
Collapse
Affiliation(s)
- Longzhi Tan
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
| | - Jenny Shi
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
- Department of Chemistry, Stanford University, Stanford, CA, 94305
| | - Siavash Moghadami
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA, 94305
| | - Bibudha Parasar
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
| | - Cydney P. Wright
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Biology, Stanford University, Stanford, CA, 94305
| | - Yunji Seo
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
| | - Kristen Vallejo
- Department of Pathology, Stanford University, Stanford, CA, 94305
| | - Inma Cobos
- Department of Pathology, Stanford University, Stanford, CA, 94305
| | - Laramie Duncan
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, 94305
| | - Ritchie Chen
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
| | - Karl Deisseroth
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, 94305
- Howard Hughes Medical Institute, Stanford, CA, 94305
| |
Collapse
|
34
|
Qu J, Sun J, Zhao C, Liu X, Zhang X, Jiang S, Wei C, Yu H, Zeng X, Fan L, Ding J. Simultaneous profiling of chromatin architecture and transcription in single cells. Nat Struct Mol Biol 2023; 30:1393-1402. [PMID: 37580628 DOI: 10.1038/s41594-023-01066-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 07/12/2023] [Indexed: 08/16/2023]
Abstract
The three-dimensional structure of chromatin plays a crucial role in development and disease, both of which are associated with transcriptional changes. However, given the heterogeneity in single-cell chromatin architecture and transcription, the regulatory relationship between the three-dimensional chromatin structure and gene expression is difficult to explain based on bulk cell populations. Here we develop a single-cell, multimodal, omics method allowing the simultaneous detection of chromatin architecture and messenger RNA expression by sequencing (single-cell transcriptome sequencing (scCARE-seq)). Applying scCARE-seq to examine chromatin architecture and transcription from 2i to serum single mouse embryonic stem cells, we observe improved separation of cell clusters compared with single-cell chromatin conformation capture. In addition, after defining the cell-cycle phase of each cell through chromatin architecture extracted by scCARE-seq, we find that periodic changes in chromatin architecture occur in parallel with transcription during the cell cycle. These findings highlight the potential of scCARE-seq to facilitate comprehensive analyses that may boost our understanding of chromatin architecture and transcription in the same single cell.
Collapse
Affiliation(s)
- Jiale Qu
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Jun Sun
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Cai Zhao
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Xinyi Liu
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Xinyao Zhang
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Shaoshuai Jiang
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Chao Wei
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| | - Haopeng Yu
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
| | - Xiaoxi Zeng
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China.
| | - Lili Fan
- Guangzhou Key Laboratory of Formula-Pattern of Traditional Chinese Medicine, School of Traditional Chinese Medicine, Jinan University, Guangzhou, China.
| | - Junjun Ding
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China.
- Department of Histology and Embryology, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China.
| |
Collapse
|
35
|
Dekker J, Alber F, Aufmkolk S, Beliveau BJ, Bruneau BG, Belmont AS, Bintu L, Boettiger A, Calandrelli R, Disteche CM, Gilbert DM, Gregor T, Hansen AS, Huang B, Huangfu D, Kalhor R, Leslie CS, Li W, Li Y, Ma J, Noble WS, Park PJ, Phillips-Cremins JE, Pollard KS, Rafelski SM, Ren B, Ruan Y, Shav-Tal Y, Shen Y, Shendure J, Shu X, Strambio-De-Castillia C, Vertii A, Zhang H, Zhong S. Spatial and temporal organization of the genome: Current state and future aims of the 4D nucleome project. Mol Cell 2023; 83:2624-2640. [PMID: 37419111 PMCID: PMC10528254 DOI: 10.1016/j.molcel.2023.06.018] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 06/10/2023] [Accepted: 06/12/2023] [Indexed: 07/09/2023]
Abstract
The four-dimensional nucleome (4DN) consortium studies the architecture of the genome and the nucleus in space and time. We summarize progress by the consortium and highlight the development of technologies for (1) mapping genome folding and identifying roles of nuclear components and bodies, proteins, and RNA, (2) characterizing nuclear organization with time or single-cell resolution, and (3) imaging of nuclear organization. With these tools, the consortium has provided over 2,000 public datasets. Integrative computational models based on these data are starting to reveal connections between genome structure and function. We then present a forward-looking perspective and outline current aims to (1) delineate dynamics of nuclear architecture at different timescales, from minutes to weeks as cells differentiate, in populations and in single cells, (2) characterize cis-determinants and trans-modulators of genome organization, (3) test functional consequences of changes in cis- and trans-regulators, and (4) develop predictive models of genome structure and function.
Collapse
Affiliation(s)
- Job Dekker
- University of Massachusetts Chan Medical School, Boston, MA, USA; Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| | - Frank Alber
- University of California, Los Angeles, Los Angeles, CA, USA
| | | | | | - Benoit G Bruneau
- Gladstone Institutes, San Francisco, CA, USA; University of California, San Francisco, San Francisco, CA, USA
| | | | | | | | | | | | | | | | | | - Bo Huang
- University of California, San Francisco, San Francisco, CA, USA
| | - Danwei Huangfu
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Reza Kalhor
- Johns Hopkins University, Baltimore, MD, USA
| | | | - Wenbo Li
- University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Yun Li
- University of North Carolina, Gillings School of Global Public Health, Chapel Hill, NC, USA
| | - Jian Ma
- Carnegie Mellon University, Pittsburgh, PA, USA
| | | | | | | | - Katherine S Pollard
- Gladstone Institutes, San Francisco, CA, USA; University of California, San Francisco, San Francisco, CA, USA; Chan Zuckerberg Biohub, San Francisco, San Francisco, CA, USA
| | | | - Bing Ren
- University of California, San Diego, La Jolla, CA, USA
| | - Yijun Ruan
- Zhejiang University, Hangzhou, Zhejiang, China
| | | | - Yin Shen
- University of California, San Francisco, San Francisco, CA, USA
| | | | - Xiaokun Shu
- University of California, San Francisco, San Francisco, CA, USA
| | | | | | | | - Sheng Zhong
- University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
36
|
Wang H, Fu T, Du Y, Gao W, Huang K, Liu Z, Chandak P, Liu S, Van Katwyk P, Deac A, Anandkumar A, Bergen K, Gomes CP, Ho S, Kohli P, Lasenby J, Leskovec J, Liu TY, Manrai A, Marks D, Ramsundar B, Song L, Sun J, Tang J, Veličković P, Welling M, Zhang L, Coley CW, Bengio Y, Zitnik M. Scientific discovery in the age of artificial intelligence. Nature 2023; 620:47-60. [PMID: 37532811 DOI: 10.1038/s41586-023-06221-2] [Citation(s) in RCA: 89] [Impact Index Per Article: 89.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 05/16/2023] [Indexed: 08/04/2023]
Abstract
Artificial intelligence (AI) is being increasingly integrated into scientific discovery to augment and accelerate research, helping scientists to generate hypotheses, design experiments, collect and interpret large datasets, and gain insights that might not have been possible using traditional scientific methods alone. Here we examine breakthroughs over the past decade that include self-supervised learning, which allows models to be trained on vast amounts of unlabelled data, and geometric deep learning, which leverages knowledge about the structure of scientific data to enhance model accuracy and efficiency. Generative AI methods can create designs, such as small-molecule drugs and proteins, by analysing diverse data modalities, including images and sequences. We discuss how these methods can help scientists throughout the scientific process and the central issues that remain despite such advances. Both developers and users of AI toolsneed a better understanding of when such approaches need improvement, and challenges posed by poor data quality and stewardship remain. These issues cut across scientific disciplines and require developing foundational algorithmic approaches that can contribute to scientific understanding or acquire it autonomously, making them critical areas of focus for AI innovation.
Collapse
Affiliation(s)
- Hanchen Wang
- Department of Engineering, University of Cambridge, Cambridge, UK
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA
- Department of Research and Early Development, Genentech Inc, South San Francisco, CA, USA
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Tianfan Fu
- Department of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Yuanqi Du
- Department of Computer Science, Cornell University, Ithaca, NY, USA
| | - Wenhao Gao
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Kexin Huang
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Ziming Liu
- Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Payal Chandak
- Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA, USA
| | - Shengchao Liu
- Mila - Quebec AI Institute, Montreal, Quebec, Canada
- Université de Montréal, Montreal, Quebec, Canada
| | - Peter Van Katwyk
- Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA
- Data Science Institute, Brown University, Providence, RI, USA
| | - Andreea Deac
- Mila - Quebec AI Institute, Montreal, Quebec, Canada
- Université de Montréal, Montreal, Quebec, Canada
| | - Anima Anandkumar
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA
- NVIDIA, Santa Clara, CA, USA
| | - Karianne Bergen
- Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA
- Data Science Institute, Brown University, Providence, RI, USA
| | - Carla P Gomes
- Department of Computer Science, Cornell University, Ithaca, NY, USA
| | - Shirley Ho
- Center for Computational Astrophysics, Flatiron Institute, New York, NY, USA
- Department of Astrophysical Sciences, Princeton University, Princeton, NJ, USA
- Department of Physics, Carnegie Mellon University, Pittsburgh, PA, USA
- Department of Physics and Center for Data Science, New York University, New York, NY, USA
| | | | - Joan Lasenby
- Department of Engineering, University of Cambridge, Cambridge, UK
| | - Jure Leskovec
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | | | - Arjun Manrai
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Debora Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Le Song
- BioMap, Beijing, China
- Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, United Arab Emirates
| | - Jimeng Sun
- University of Illinois at Urbana-Champaign, Champaign, IL, USA
| | - Jian Tang
- Mila - Quebec AI Institute, Montreal, Quebec, Canada
- HEC Montréal, Montreal, Quebec, Canada
- CIFAR AI Chair, Toronto, Ontario, Canada
| | - Petar Veličković
- Google DeepMind, London, UK
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
| | - Max Welling
- University of Amsterdam, Amsterdam, Netherlands
- Microsoft Research Amsterdam, Amsterdam, Netherlands
| | - Linfeng Zhang
- DP Technology, Beijing, China
- AI for Science Institute, Beijing, China
| | - Connor W Coley
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Yoshua Bengio
- Mila - Quebec AI Institute, Montreal, Quebec, Canada
- Université de Montréal, Montreal, Quebec, Canada
| | - Marinka Zitnik
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Harvard Data Science Initiative, Cambridge, MA, USA.
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
37
|
Wang Y, Guo Z, Cheng J. Single-cell Hi-C data enhancement with deep residual and generative adversarial networks. Bioinformatics 2023; 39:btad458. [PMID: 37498561 PMCID: PMC10403428 DOI: 10.1093/bioinformatics/btad458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 07/19/2023] [Accepted: 07/25/2023] [Indexed: 07/28/2023] Open
Abstract
MOTIVATION The spatial genome organization of a eukaryotic cell is important for its function. The development of single-cell technologies for probing the 3D genome conformation, especially single-cell chromosome conformation capture techniques, has enabled us to understand genome function better than before. However, due to extreme sparsity and high noise associated with single-cell Hi-C data, it is still difficult to study genome structure and function using the HiC-data of one single cell. RESULTS In this work, we developed a deep learning method ScHiCEDRN based on deep residual networks and generative adversarial networks for the imputation and enhancement of Hi-C data of a single cell. In terms of both image evaluation and Hi-C reproducibility metrics, ScHiCEDRN outperforms the four deep learning methods (DeepHiC, HiCPlus, HiCSR, and Loopenhance) on enhancing the raw single-cell Hi-C data of human and Drosophila. The experiments also show that it can generate single-cell Hi-C data more suitable for identifying topologically associating domain boundaries and reconstructing 3D chromosome structures than the existing methods. Moreover, ScHiCEDRN's performance generalizes well across different single cells and cell types, and it can be applied to improving population Hi-C data. AVAILABILITY AND IMPLEMENTATION The source code of ScHiCEDRN is available at the GitHub repository: https://github.com/BioinfoMachineLearning/ScHiCEDRN.
Collapse
Affiliation(s)
- Yanli Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, United States
- NextGen Precision Health Institute, University of Missouri, Columbia, MO 65211, United States
| | - Zhiye Guo
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, United States
- NextGen Precision Health Institute, University of Missouri, Columbia, MO 65211, United States
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, United States
- NextGen Precision Health Institute, University of Missouri, Columbia, MO 65211, United States
| |
Collapse
|
38
|
Gao VR, Yang R, Das A, Luo R, Luo H, McNally DR, Karagiannidis I, Rivas MA, Wang ZM, Barisic D, Karbalayghareh A, Wong W, Zhan YA, Chin CR, Noble W, Bilmes JA, Apostolou E, Kharas MG, Béguelin W, Viny AD, Huangfu D, Rudensky AY, Melnick AM, Leslie CS. ChromaFold predicts the 3D contact map from single-cell chromatin accessibility. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.27.550836. [PMID: 37546906 PMCID: PMC10402156 DOI: 10.1101/2023.07.27.550836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
The identification of cell-type-specific 3D chromatin interactions between regulatory elements can help to decipher gene regulation and to interpret the function of disease-associated non-coding variants. However, current chromosome conformation capture (3C) technologies are unable to resolve interactions at this resolution when only small numbers of cells are available as input. We therefore present ChromaFold, a deep learning model that predicts 3D contact maps and regulatory interactions from single-cell ATAC sequencing (scATAC-seq) data alone. ChromaFold uses pseudobulk chromatin accessibility, co-accessibility profiles across metacells, and predicted CTCF motif tracks as input features and employs a lightweight architecture to enable training on standard GPUs. Once trained on paired scATAC-seq and Hi-C data in human cell lines and tissues, ChromaFold can accurately predict both the 3D contact map and peak-level interactions across diverse human and mouse test cell types. In benchmarking against a recent deep learning method that uses bulk ATAC-seq, DNA sequence, and CTCF ChIP-seq to make cell-type-specific predictions, ChromaFold yields superior prediction performance when including CTCF ChIP-seq data as an input and comparable performance without. Finally, fine-tuning ChromaFold on paired scATAC-seq and Hi-C in a complex tissue enables deconvolution of chromatin interactions across cell subpopulations. ChromaFold thus achieves state-of-the-art prediction of 3D contact maps and regulatory interactions using scATAC-seq alone as input data, enabling accurate inference of cell-type-specific interactions in settings where 3C-based assays are infeasible.
Collapse
Affiliation(s)
- Vianne R. Gao
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Tri-Institutional Program in Computational Biology and Medicine, New York, NY, USA
| | - Rui Yang
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Tri-Institutional Program in Computational Biology and Medicine, New York, NY, USA
| | - Arnav Das
- University of Washington, Seattle, WA, USA
| | - Renhe Luo
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA
| | - Hanzhi Luo
- Molecular Pharmacology Program, Experimental Therapeutics Center and Center for Stem Cell Biology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Dylan R. McNally
- Caryl and Israel Englander Institute for Precision Medicine, Institute for Computational Biomedicine, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Ioannis Karagiannidis
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Martin A. Rivas
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Zhong-Min Wang
- Howard Hughes Medical Institute and Immunology Program, Sloan Kettering Institute and Ludwig Center at Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Darko Barisic
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Alireza Karbalayghareh
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Wilfred Wong
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Tri-Institutional Program in Computational Biology and Medicine, New York, NY, USA
| | - Yingqian A. Zhan
- Center for Epigenetics Research, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Christopher R. Chin
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | | | | | - Effie Apostolou
- Sanford I Weill department of Medicine, Sandra and Edward Meyer Cancer center, Weill Cornell Medicine, New York, NY, USA
| | - Michael G. Kharas
- Molecular Pharmacology Program, Experimental Therapeutics Center and Center for Stem Cell Biology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Wendy Béguelin
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Aaron D. Viny
- Departments of Medicine, Division of Hematology & Oncology, and of Genetics & Development, Columbia Stem Cell Initiative, Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA
| | - Danwei Huangfu
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA
| | - Alexander Y. Rudensky
- Howard Hughes Medical Institute and Immunology Program, Sloan Kettering Institute and Ludwig Center at Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ari M. Melnick
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Christina S. Leslie
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| |
Collapse
|
39
|
Zhou T, Zhang R, Jia D, Doty RT, Munday AD, Gao D, Xin L, Abkowitz JL, Duan Z, Ma J. Concurrent profiling of multiscale 3D genome organization and gene expression in single mammalian cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.20.549578. [PMID: 37546900 PMCID: PMC10401946 DOI: 10.1101/2023.07.20.549578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
The organization of mammalian genomes within the nucleus features a complex, multiscale three-dimensional (3D) architecture. The functional significance of these 3D genome features, however, remains largely elusive due to limited single-cell technologies that can concurrently profile genome organization and transcriptional activities. Here, we report GAGE-seq, a highly scalable, robust single-cell co-assay that simultaneously measures 3D genome structure and transcriptome within the same cell. Employing GAGE-seq on mouse brain cortex and human bone marrow CD34+ cells, we comprehensively characterized the intricate relationships between 3D genome and gene expression. We found that these multiscale 3D genome features collectively inform cell type-specific gene expressions, hence contributing to defining cell identity at the single-cell level. Integration of GAGE-seq data with spatial transcriptomic data revealed in situ variations of the 3D genome in mouse cortex. Moreover, our observations of lineage commitment in normal human hematopoiesis unveiled notable discordant changes between 3D genome organization and gene expression, underscoring a complex, temporal interplay at the single-cell level that is more nuanced than previously appreciated. Together, GAGE-seq provides a powerful, cost-effective approach for interrogating genome structure and gene expression relationships at the single-cell level across diverse biological contexts.
Collapse
Affiliation(s)
- Tianming Zhou
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ruochi Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Present address: Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Deyong Jia
- Department of Urology, University of Washington, Seattle, WA 98195, USA
| | - Raymond T. Doty
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Adam D. Munday
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Daniel Gao
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
- Present address: Department of Chemistry, Pomona College, Claremont, CA 91711, USA
| | - Li Xin
- Department of Urology, University of Washington, Seattle, WA 98195, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
| | - Janis L. Abkowitz
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
| | - Zhijun Duan
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
| | - Jian Ma
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
40
|
Littman R, Cheng M, Wang N, Peng C, Yang X. SCING: Inference of robust, interpretable gene regulatory networks from single cell and spatial transcriptomics. iScience 2023; 26:107124. [PMID: 37434694 PMCID: PMC10331489 DOI: 10.1016/j.isci.2023.107124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 03/31/2023] [Accepted: 06/09/2023] [Indexed: 07/13/2023] Open
Abstract
Gene regulatory network (GRN) inference is an integral part of understanding physiology and disease. Single cell/nuclei RNA-seq (scRNA-seq/snRNA-seq) data has been used to elucidate cell-type GRNs; however, the accuracy and speed of current scRNAseq-based GRN approaches are suboptimal. Here, we present Single Cell INtegrative Gene regulatory network inference (SCING), a gradient boosting and mutual information-based approach for identifying robust GRNs from scRNA-seq, snRNA-seq, and spatial transcriptomics data. Performance evaluation using Perturb-seq datasets, held-out data, and the mouse cell atlas combined with the DisGeNET database demonstrates the improved accuracy and biological interpretability of SCING compared to existing methods. We applied SCING to the entire mouse single cell atlas, human Alzheimer's disease (AD), and mouse AD spatial transcriptomics. SCING GRNs reveal unique disease subnetwork modeling capabilities, have intrinsic capacity to correct for batch effects, retrieve disease relevant genes and pathways, and are informative on spatial specificity of disease pathogenesis.
Collapse
Affiliation(s)
- Russell Littman
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - Michael Cheng
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - Ning Wang
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
| | - Chao Peng
- Department of Neurology, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA
| | - Xia Yang
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, USA
- Institute for Quantitative and Computational Biosciences (QCBio), Los Angeles, CA, USA
- Molecular Biology Institute (MBI), Los Angeles, CA, USA
- Brain Research Institute (BRI), Los Angeles, CA, USA
| |
Collapse
|
41
|
Prakash A, Banerjee M. An interpretable block-attention network for identifying regulatory feature interactions. Brief Bioinform 2023; 24:bbad250. [PMID: 37401370 DOI: 10.1093/bib/bbad250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 05/15/2023] [Accepted: 06/16/2023] [Indexed: 07/05/2023] Open
Abstract
The importance of regulatory features in health and disease is increasing, making it crucial to identify the hallmarks of these features. Self-attention networks (SAN) have given rise to numerous models for the prediction of complex phenomena. But the potential of SANs in biological models was limited because of high memory requirement proportional to input token length and lack of interpretability of self-attention scores. To overcome these constraints, we propose a deep learning model named Interpretable Self-Attention Network for REGulatory interactions (ISANREG) that combines both block self-attention and attention-attribution mechanisms. This model predicts transcription factor-bound motif instances and DNA-mediated TF-TF interactions using self-attention attribution scores derived from the network, overcoming the limitations of previous deep learning models. ISANREG will serve as a framework for other biological models in interpreting the contribution of the input with single-nucleotide resolution.
Collapse
Affiliation(s)
- Anil Prakash
- Human Molecular Genetics Lab, Neurobiology and Genetics Division, Rajiv Gandhi Centre for Biotechnology, Thiruvananthapuram, Kerala, 695014, India
- Department of Biotechnology, University of Kerala, Kariavattom, Thiruvananthapuram, Kerala, India
| | - Moinak Banerjee
- Human Molecular Genetics Lab, Neurobiology and Genetics Division, Rajiv Gandhi Centre for Biotechnology, Thiruvananthapuram, Kerala, 695014, India
| |
Collapse
|
42
|
Flores V, Farabella I, Nir G. Genome-wide tracing to decipher nuclear organization. Curr Opin Cell Biol 2023; 82:102175. [PMID: 37263058 DOI: 10.1016/j.ceb.2023.102175] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 05/10/2023] [Accepted: 05/11/2023] [Indexed: 06/03/2023]
Abstract
Nuclear organization impacts gene expression activity and cell phenotype. Our current understanding is mainly derived from ensemble-level sequencing studies that reflect the 3D genome structure of millions of cells. These approaches have provided invaluable details on the 3D organizations of the genome and their relation to other nuclear landmarks. However, they mostly lack the ability to provide multimodal information simultaneously at the single-cell level. In recent years, cutting-edge imaging technologies have risen to the challenge of simultaneously describing multiple components of the nuclear space at the single-cell level, paving the way for a deeper understanding of the genome structure-function relationship. This review will focus on the development and utilization of such technologies to gain a multi-component view of the nucleus at single-cell resolution, dissecting the complexity and heterogeneity of nuclear organization.
Collapse
Affiliation(s)
- Victoria Flores
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX, USA
| | - Irene Farabella
- Integrative Nuclear Architecture Laboratory, Center for Human Technologies, Istituto Italiano di Tecnologia, Genova, Italy.
| | - Guy Nir
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX, USA.
| |
Collapse
|
43
|
Van de Sande B, Lee JS, Mutasa-Gottgens E, Naughton B, Bacon W, Manning J, Wang Y, Pollard J, Mendez M, Hill J, Kumar N, Cao X, Chen X, Khaladkar M, Wen J, Leach A, Ferran E. Applications of single-cell RNA sequencing in drug discovery and development. Nat Rev Drug Discov 2023; 22:496-520. [PMID: 37117846 PMCID: PMC10141847 DOI: 10.1038/s41573-023-00688-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/10/2023] [Indexed: 04/30/2023]
Abstract
Single-cell technologies, particularly single-cell RNA sequencing (scRNA-seq) methods, together with associated computational tools and the growing availability of public data resources, are transforming drug discovery and development. New opportunities are emerging in target identification owing to improved disease understanding through cell subtyping, and highly multiplexed functional genomics screens incorporating scRNA-seq are enhancing target credentialling and prioritization. ScRNA-seq is also aiding the selection of relevant preclinical disease models and providing new insights into drug mechanisms of action. In clinical development, scRNA-seq can inform decision-making via improved biomarker identification for patient stratification and more precise monitoring of drug response and disease progression. Here, we illustrate how scRNA-seq methods are being applied in key steps in drug discovery and development, and discuss ongoing challenges for their implementation in the pharmaceutical industry.
Collapse
Affiliation(s)
| | | | | | - Bart Naughton
- Computational Neurobiology, Eisai, Cambridge, MA, USA
| | - Wendi Bacon
- EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
- The Open University, Milton Keynes, UK
| | | | - Yong Wang
- Precision Bioinformatics, Prometheus Biosciences, San Diego, CA, USA
| | | | - Melissa Mendez
- Genomic Sciences, GlaxoSmithKline, Collegeville, PA, USA
| | - Jon Hill
- Global Computational Biology and Digital Sciences, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, USA
| | - Namit Kumar
- Informatics & Predictive Sciences, Bristol Myers Squibb, San Diego, CA, USA
| | - Xiaohong Cao
- Genomic Research Center, AbbVie Inc., Cambridge, MA, USA
| | - Xiao Chen
- Magnet Biomedicine, Cambridge, MA, USA
| | - Mugdha Khaladkar
- Human Genetics and Computational Biology, GlaxoSmithKline, Collegeville, PA, USA
| | - Ji Wen
- Oncology Research and Development Unit, Pfizer, La Jolla, CA, USA
| | | | | |
Collapse
|
44
|
Ziyani C, Delaneau O, Ribeiro DM. Multimodal single cell analysis infers widespread enhancer co-activity in a lymphoblastoid cell line. Commun Biol 2023; 6:563. [PMID: 37237005 DOI: 10.1038/s42003-023-04954-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 05/18/2023] [Indexed: 05/28/2023] Open
Abstract
Non-coding regulatory elements such as enhancers are key in controlling the cell-type specificity and spatio-temporal expression of genes. To drive stable and precise gene transcription robust to genetic variation and environmental stress, genes are often targeted by multiple enhancers with redundant action. However, it is unknown whether enhancers targeting the same gene display simultaneous activity or whether some enhancer combinations are more often co-active than others. Here, we take advantage of recent developments in single cell technology that permit assessing chromatin status (scATAC-seq) and gene expression (scRNA-seq) in the same single cells to correlate gene expression to the activity of multiple enhancers. Measuring activity patterns across 24,844 human lymphoblastoid single cells, we find that the majority of enhancers associated with the same gene display significant correlation in their chromatin profiles. For 6944 expressed genes associated with enhancers, we predict 89,885 significant enhancer-enhancer associations between nearby enhancers. We find that associated enhancers share similar transcription factor binding profiles and that gene essentiality is linked with higher enhancer co-activity. We provide a set of predicted enhancer-enhancer associations based on correlation derived from a single cell line, which can be further investigated for functional relevance.
Collapse
Affiliation(s)
- Chaymae Ziyani
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Olivier Delaneau
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Diogo M Ribeiro
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
| |
Collapse
|
45
|
Xiong K, Zhang R, Ma J. scGHOST: Identifying single-cell 3D genome subcompartments. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.24.542032. [PMID: 37292994 PMCID: PMC10245874 DOI: 10.1101/2023.05.24.542032] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
New single-cell Hi-C (scHi-C) technologies enable probing of the genome-wide cell-to-cell variability in 3D genome organization from individual cells. Several computational methods have been developed to reveal single-cell 3D genome features based on scHi-C data, including A/B compartments, topologically-associating domains, and chromatin loops. However, no scHi-C analysis method currently exists for annotating single-cell subcompartments, which are crucial for providing a more refined view of large-scale chromosome spatial localization in single cells. Here, we present scGhost, a single-cell subcompartment annotation method based on graph embedding with constrained random walk sampling. Applications of scGhost to scHi-C data and single-cell 3D genome imaging data demonstrate the reliable identification of single-cell subcompartments and offer new insights into cell-to-cell variability of nuclear subcompartments. Using scHi-C data from the human prefrontal cortex, scGhost identifies cell type-specific subcompartments that are strongly connected to cell type-specific gene expression, suggesting the functional implications of single-cell subcompartments. Overall, scGhost is an effective new method for single-cell 3D genome subcompartment annotation based on scHi-C data for a broad range of biological contexts.
Collapse
Affiliation(s)
- Kyle Xiong
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ruochi Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Jian Ma
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
46
|
Gunsalus LM, McArthur E, Gjoni K, Kuang S, Pittman M, Capra JA, Pollard KS. Comparing chromatin contact maps at scale: methods and insights. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.04.535480. [PMID: 37066196 PMCID: PMC10104037 DOI: 10.1101/2023.04.04.535480] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Comparing chromatin contact maps is an essential step in quantifying how three-dimensional (3D) genome organization shapes development, evolution, and disease. However, no gold standard exists for comparing contact maps, and even simple methods often disagree. In this study, we propose novel comparison methods and evaluate them alongside existing approaches using genome-wide Hi-C data and 22,500 in silico predicted contact maps. We also quantify the robustness of methods to common sources of biological and technical variation, such as boundary size and noise. We find that simple difference-based methods such as mean squared error are suitable for initial screening, but biologically informed methods are necessary to identify why maps diverge and propose specific functional hypotheses. We provide a reference guide, codebase, and benchmark for rapidly comparing chromatin contact maps at scale to enable biological insights into the 3D organization of the genome.
Collapse
Affiliation(s)
- Laura M. Gunsalus
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA
| | - Evonne McArthur
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN
| | - Ketrin Gjoni
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA
| | - Shuzhen Kuang
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA
| | - Maureen Pittman
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA
| | - John A. Capra
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
| | - Katherine S. Pollard
- Gladstone Institute of Data Science and Biotechnology, San Francisco, CA
- Department of Epidemiology & Biostatistics, University of California, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
47
|
Zhan Y, Yildirim A, Boninsegna L, Alber F. Conformational analysis of chromosome structures reveals vital role of chromosome morphology in gene function. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.18.528138. [PMID: 36824908 PMCID: PMC9949133 DOI: 10.1101/2023.02.18.528138] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
The 3D conformations of chromosomes are highly variant and stochastic between single cells. Recent progress in multiplexed 3D FISH imaging, single cell Hi-C and genome structure modeling allows a closer analysis of the structural variations of chromosomes between cells to infer the functional implications of structural heterogeneity. Here, we introduce a two-step dimensionality reduction method to classify a population of single cell 3D chromosome structures, either from simulation or imaging experiment, into dominant conformational clusters with distinct chromosome morphologies. We found that almost half of all structures for each chromosome can be described by 5-10 dominant chromosome morphologies, which play a fundamental role in establishing conformational variation of chromosomes. These morphologies are conserved in different cell types, but vary in their relative proportion of structures. Chromosome morphologies are distinguished by the presence or absence of characteristic chromosome territory domains, which expose some chromosomal regions to varying nuclear environments in different morphologies, such as nuclear positions and associations to nuclear speckles, lamina, and nucleoli. These observations point to distinct functional variations for the same chromosomal region in different chromosome morphologies. We validated chromosome conformational clusters and their associated subnuclear locations with data from DNA-MERFISH imaging and single cell sci-HiC data. Our method provides an important approach to assess the variation of chromosome structures between cells and link differences in conformational states with distinct gene functions.
Collapse
|
48
|
van Mierlo G, Pushkarev O, Kribelbauer JF, Deplancke B. Chromatin modules and their implication in genomic organization and gene regulation. Trends Genet 2023; 39:140-153. [PMID: 36549923 DOI: 10.1016/j.tig.2022.11.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 11/04/2022] [Accepted: 11/27/2022] [Indexed: 12/24/2022]
Abstract
Regulation of gene expression is a complex but highly guided process. While genomic technologies and computational approaches have allowed high-throughput mapping of cis-regulatory elements (CREs) and their interactions in 3D, their precise role in regulating gene expression remains obscure. Recent complementary observations revealed that interactions between CREs frequently result in the formation of small-scale functional modules within topologically associating domains. Such chromatin modules likely emerge from a complex interplay between regulatory machineries assembled at CREs, including site-specific binding of transcription factors. Here, we review the methods that allow identifying chromatin modules, summarize possible mechanisms that steer CRE interactions within these modules, and discuss outstanding challenges to uncover how chromatin modules fit in our current understanding of the functional 3D genome.
Collapse
Affiliation(s)
- Guido van Mierlo
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Olga Pushkarev
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Judith F Kribelbauer
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
| |
Collapse
|
49
|
Park K, Keleş S. Joint tensor modeling of single cell 3D genome and epigenetic data with Muscle. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.27.525871. [PMID: 36747701 PMCID: PMC9900892 DOI: 10.1101/2023.01.27.525871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Emerging single cell technologies that simultaneously capture long-range interactions of genomic loci together with their DNA methylation levels are advancing our understanding of three-dimensional genome structure and its interplay with the epigenome at the single cell level. While methods to analyze data from single cell high throughput chromatin conformation capture (scHi-C) experiments are maturing, methods that can jointly analyze multiple single cell modalities with scHi-C data are lacking. Here, we introduce Muscle, a semi-nonnegative joint decomposition of Multiple single cell tensors, to jointly analyze 3D conformation and DNA methylation data at the single cell level. Muscle takes advantage of the inherent tensor structure of the scHi-C data, and integrates this modality with DNA methylation. We developed an alternating least squares algorithm for estimating Muscle parameters and established its optimality properties. Parameters estimated by Muscle directly align with the key components of the downstream analysis of scHi-C data in a cell type specific manner. Evaluations with data-driven experiments and simulations demonstrate the advantages of the joint modeling framework of Muscle over single modality modeling or a baseline multi modality modeling for cell type delineation and elucidating associations between modalities. Muscle is publicly available at https://github.com/keleslab/muscle.
Collapse
Affiliation(s)
- Kwangmoon Park
- Department of Statistics, University of Wisconsin, Madison, WI, USA, 53706
| | - Sündüz Keleş
- Department of Statistics, University of Wisconsin, Madison, WI, USA, 53706
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, USA, 53726
| |
Collapse
|
50
|
Liu Q, Zeng W, Zhang W, Wang S, Chen H, Jiang R, Zhou M, Zhang S. Deep generative modeling and clustering of single cell Hi-C data. Brief Bioinform 2023; 24:6858951. [PMID: 36458445 DOI: 10.1093/bib/bbac494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 09/28/2022] [Accepted: 10/18/2022] [Indexed: 12/05/2022] Open
Abstract
Deciphering 3D genome conformation is important for understanding gene regulation and cellular function at a spatial level. The recent advances of single cell Hi-C technologies have enabled the profiling of the 3D architecture of DNA within individual cell, which allows us to study the cell-to-cell variability of 3D chromatin organization. Computational approaches are in urgent need to comprehensively analyze the sparse and heterogeneous single cell Hi-C data. Here, we proposed scDEC-Hi-C, a new framework for single cell Hi-C analysis with deep generative neural networks. scDEC-Hi-C outperforms existing methods in terms of single cell Hi-C data clustering and imputation. Moreover, the generative power of scDEC-Hi-C could help unveil the differences of chromatin architecture across cell types. We expect that scDEC-Hi-C could shed light on deepening our understanding of the complex mechanism underlying the formation of chromatin contacts.
Collapse
Affiliation(s)
- Qiao Liu
- Department of Statistics, Stanford University, Stanford, CA 94305, USA
| | - Wanwen Zeng
- College of Software, Nankai University, Tianjin 300071, China
| | - Wei Zhang
- Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
| | - Sicheng Wang
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Hongyang Chen
- The Research Center for Intelligent Network, Zhejiang Lab, Hangzhou 311121, China
| | - Rui Jiang
- Ministry of Education Key Laboratory of Bioinformatics, Research Department of Bioinformatics at the Beijing National Research Center for Information Science and Technology, Center for Synthetic and Systems Biology, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Mu Zhou
- SenseBrain Research, San Jose, CA 95131, USA
| | - Shaoting Zhang
- Shanghai Artificial Intelligence Laboratory, Shanghai 200240, China
| |
Collapse
|