1
|
Banecki K, Korsak S, Plewczynski D. Advancements and future directions in single-cell Hi-C based 3D chromatin modeling. Comput Struct Biotechnol J 2024; 23:3549-3558. [PMID: 39963420 PMCID: PMC11832020 DOI: 10.1016/j.csbj.2024.09.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 09/27/2024] [Accepted: 09/29/2024] [Indexed: 02/20/2025] Open
Abstract
Single-cell Hi-C data provides valuable insights into the three-dimensional organization of chromatin within individual cells, yet modeling this data poses significant challenges due to its inherent sparsity and variability. This review comprehensively explores the predominant approaches to reconstructing 3D chromatin structures from single-cell Hi-C data, positioning these methods within the broader contexts of single-cell Hi-C research and bulk Hi-C data modeling. We categorize the modeling strategies based on their objective functions, which are framed in terms of force fields, potentials, cost functions, or likelihood probabilities. Despite their diverse methodologies, these approaches exhibit deep underlying similarities. We further dissect the basic components of these models, such as attractive restraint forces and repulsive forces, and discuss additional terms like fluid viscosity and variation penalties. The review also critically evaluates the current state of model validation, highlighting the inconsistencies across various studies and emphasizing the need for a comprehensive validation framework. We detail common validation techniques, including the comparison of distance matrices and the assessment of contact violations. We argue that the future of single-cell Hi-C modeling lies in integrating multiple data modalities and incorporating cell cycle trajectory information. Such integration could significantly advance our understanding of chromatin conformation dynamics during cell cycle progression and cell differentiation. We also foresee the continued growth of optimization-based and molecular dynamics approaches, supported by general molecular dynamics toolkits.
Collapse
Affiliation(s)
- Krzysztof Banecki
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland
| | - Sevastianos Korsak
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland
| | - Dariusz Plewczynski
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland
| |
Collapse
|
2
|
Jiang S, Cai Z, Wang Y, Zeng C, Zhang J, Yu W, Su C, Zhao S, Chen Y, Shen Y, Ma Y, Cai Y, Dai J. High plasticity of ribosomal DNA organization in budding yeast. Cell Rep 2024; 43:113742. [PMID: 38324449 DOI: 10.1016/j.celrep.2024.113742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 12/12/2023] [Accepted: 01/19/2024] [Indexed: 02/09/2024] Open
Abstract
In eukaryotic genomes, rDNA generally resides as a highly repetitive and dynamic structure, making it difficult to study. Here, a synthetic rDNA array on chromosome III in budding yeast was constructed to serve as the sole source of rRNA. Utilizing the loxPsym site within each rDNA repeat and the Cre recombinase, we were able to reduce the copy number to as few as eight copies. Additionally, we constructed strains with two or three rDNA arrays and found that the presence of multiple arrays did not affect the formation of a single nucleolus. Although alteration of the position and number of rDNA arrays did impact the three-dimensional genome structure, the additional rDNA arrays had no deleterious influence on cell growth or transcriptomes. Overall, this study sheds light on the high plasticity of rDNA organization and opens up opportunities for future rDNA engineering.
Collapse
Affiliation(s)
- Shuangying Jiang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
| | - Zelin Cai
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yun Wang
- BGI Research, BGI, Shenzhen 518083, China; Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Cheng Zeng
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Jiaying Zhang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; School of Mechanical Engineering and Automation, Beihang University, Beijing 100191, China
| | - Wenfei Yu
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chenghao Su
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Shijun Zhao
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ying Chen
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; BGI Research, BGI, Shenzhen 518083, China
| | - Yue Shen
- BGI Research, BGI, Shenzhen 518083, China; Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Yingxin Ma
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Yizhi Cai
- Manchester Institute of Biotechnology, University of Manchester, Manchester M1 7DN, UK.
| | - Junbiao Dai
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China; College of Life Sciences and Oceanography, Shenzhen University, 1066 Xueyuan Road, Shenzhen 518055, China.
| |
Collapse
|
3
|
Poinsignon T, Gallopin M, Grognet P, Malagnac F, Lelandais G, Poulain P. 3D models of fungal chromosomes to enhance visual integration of omics data. NAR Genom Bioinform 2023; 5:lqad104. [PMID: 38058589 PMCID: PMC10696920 DOI: 10.1093/nargab/lqad104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 10/11/2023] [Accepted: 11/17/2023] [Indexed: 12/08/2023] Open
Abstract
The functions of eukaryotic chromosomes and their spatial architecture in the nucleus are reciprocally dependent. Hi-C experiments are routinely used to study chromosome 3D organization by probing chromatin interactions. Standard representation of the data has relied on contact maps that show the frequency of interactions between parts of the genome. In parallel, it has become easier to build 3D models of the entire genome based on the same Hi-C data, and thus benefit from the methodology and visualization tools developed for structural biology. 3D modeling of entire genomes leverages the understanding of their spatial organization. However, this opportunity for original and insightful modeling is underexploited. In this paper, we show how seeing the spatial organization of chromosomes can bring new perspectives to omics data integration. We assembled state-of-the-art tools into a workflow that goes from Hi-C raw data to fully annotated 3D models and we re-analysed public omics datasets available for three fungal species. Besides the well-described properties of the spatial organization of their chromosomes (Rabl conformation, hypercoiling and chromosome territories), our results highlighted (i) in Saccharomyces cerevisiae, the backbones of the cohesin anchor regions, which were aligned all along the chromosomes, (ii) in Schizosaccharomyces pombe, the oscillations of the coiling of chromosome arms throughout the cell cycle and (iii) in Neurospora crassa, the massive relocalization of histone marks in mutants of heterochromatin regulators. 3D modeling of the chromosomes brings new opportunities for visual integration of omics data. This holistic perspective supports intuition and lays the foundation for building new concepts.
Collapse
Affiliation(s)
- Thibault Poinsignon
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Mélina Gallopin
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Pierre Grognet
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Fabienne Malagnac
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Gaëlle Lelandais
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Pierre Poulain
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| |
Collapse
|
4
|
Shen Y, Gao F, Wang Y, Wang Y, Zheng J, Gong J, Zhang J, Luo Z, Schindler D, Deng Y, Ding W, Lin T, Swidah R, Zhao H, Jiang S, Zeng C, Chen S, Chen T, Wang Y, Luo Y, Mitchell L, Bader JS, Zhang G, Shen X, Wang J, Fu X, Dai J, Boeke JD, Yang H, Xu X, Cai Y. Dissecting aneuploidy phenotypes by constructing Sc2.0 chromosome VII and SCRaMbLEing synthetic disomic yeast. CELL GENOMICS 2023; 3:100364. [PMID: 38020968 PMCID: PMC10667312 DOI: 10.1016/j.xgen.2023.100364] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 04/03/2023] [Accepted: 07/06/2023] [Indexed: 12/01/2023]
Abstract
Aneuploidy compromises genomic stability, often leading to embryo inviability, and is frequently associated with tumorigenesis and aging. Different aneuploid chromosome stoichiometries lead to distinct transcriptomic and phenotypic changes, making it helpful to study aneuploidy in tightly controlled genetic backgrounds. By deploying the engineered SCRaMbLE (synthetic chromosome rearrangement and modification by loxP-mediated evolution) system to the newly synthesized megabase Sc2.0 chromosome VII (synVII), we constructed a synthetic disomic yeast and screened hundreds of SCRaMbLEd derivatives with diverse chromosomal rearrangements. Phenotypic characterization and multi-omics analysis revealed that fitness defects associated with aneuploidy could be restored by (1) removing most of the chromosome content or (2) modifying specific regions in the duplicated chromosome. These findings indicate that both chromosome copy number and specific chromosomal regions contribute to the aneuploidy-related phenotypes, and the synthetic chromosome resource opens new paradigms in studying aneuploidy.
Collapse
Affiliation(s)
- Yue Shen
- BGI Research, Shenzhen 518083, China
- BGI Research, Changzhou 213299, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Feng Gao
- BGI Research, Shenzhen 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Yun Wang
- BGI Research, Shenzhen 518083, China
- BGI Research, Changzhou 213299, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
- University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Yuerong Wang
- BGI Research, Shenzhen 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ju Zheng
- BGI Research, Shenzhen 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | | | | | - Zhouqing Luo
- Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Key Laboratory of Synthetic Genomics, Center for Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518055, China
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, Xiamen 361102, China
| | - Daniel Schindler
- Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester M1 7DN, UK
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Strasse 10, 35043 Marburg, Germany
| | - Yang Deng
- BGI Research, Shenzhen 518083, China
| | - Weichao Ding
- BGI Research, Shenzhen 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tao Lin
- BGI Research, Shenzhen 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Reem Swidah
- Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester M1 7DN, UK
| | - Hongcui Zhao
- BGI Research, Shenzhen 518083, China
- BGI Research, Changzhou 213299, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Shuangying Jiang
- Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Key Laboratory of Synthetic Genomics, Center for Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518055, China
| | - Cheng Zeng
- Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Key Laboratory of Synthetic Genomics, Center for Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518055, China
| | | | - Tai Chen
- BGI Research, Shenzhen 518083, China
- BGI Research, Changzhou 213299, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Yong Wang
- BGI Research, Shenzhen 518083, China
| | - Yisha Luo
- Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester M1 7DN, UK
| | - Leslie Mitchell
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
| | - Joel S. Bader
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Guojie Zhang
- University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Xia Shen
- Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou, China
- Center for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh, UK
| | - Jian Wang
- BGI Research, Shenzhen 518083, China
| | - Xian Fu
- BGI Research, Shenzhen 518083, China
- BGI Research, Changzhou 213299, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Junbiao Dai
- Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Key Laboratory of Synthetic Genomics, Center for Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518055, China
| | - Jef D. Boeke
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
- Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY 11201, USA
| | | | - Xun Xu
- BGI Research, Shenzhen 518083, China
- BGI Research, Changzhou 213299, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China
| | - Yizhi Cai
- Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester M1 7DN, UK
| |
Collapse
|
5
|
Li FZ, Zhang XF, Cai HY, Ran LQ, Zhou HY, Liu ZE. Chromosome Three-Dimensional Structure Reconstruction: An Iterative ShRec3D Algorithm. J Comput Biol 2023; 30:575-587. [PMID: 36847350 DOI: 10.1089/cmb.2022.0179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023] Open
Abstract
The three-dimensional (3D) structure of chromosomes is of great significance to ensure that the genome performs various functions (e.g., gene expression) correctly and replicates and separates correctly in mitosis. Since the emergence of Hi-C in 2009, a new experimental technique in molecular biology, researchers have been paying more and more attention to the reconstruction of chromosome 3D structure. To reconstruct the 3D structure of chromosomes based on Hi-C experimental data, many algorithms have been proposed, among which ShRec3D is one of the most outstanding. In this article, an iterative ShRec3D algorithm is presented to greatly improve the native ShRec3D algorithm. Experimental results show that our algorithm can significantly promote the performance of ShRec3D, and this improvement is applicable to almost all data noise range and signal coverage range, so it is universal.
Collapse
Affiliation(s)
- Fang-Zhen Li
- School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China
| | - Xue-Fen Zhang
- College of Smart City, Beijing Union University, Beijing, China
| | - Hui-Ying Cai
- School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China
| | - Ling-Qiang Ran
- School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China
| | - Hai-Yan Zhou
- Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan, China
| | - Zhi-E Liu
- College of Physics and Electronic Engineering, Qilu Normal University, Jinan, China
| |
Collapse
|
6
|
Varoquaux N, Noble WS, Vert JP. Inference of 3D genome architecture by modeling overdispersion of Hi-C data. Bioinformatics 2023; 39:btac838. [PMID: 36594573 PMCID: PMC9857972 DOI: 10.1093/bioinformatics/btac838] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 11/16/2022] [Indexed: 01/04/2023] Open
Abstract
MOTIVATION We address the challenge of inferring a consensus 3D model of genome architecture from Hi-C data. Existing approaches most often rely on a two-step algorithm: first, convert the contact counts into distances, then optimize an objective function akin to multidimensional scaling (MDS) to infer a 3D model. Other approaches use a maximum likelihood approach, modeling the contact counts between two loci as a Poisson random variable whose intensity is a decreasing function of the distance between them. However, a Poisson model of contact counts implies that the variance of the data is equal to the mean, a relationship that is often too restrictive to properly model count data. RESULTS We first confirm the presence of overdispersion in several real Hi-C datasets, and we show that the overdispersion arises even in simulated datasets. We then propose a new model, called Pastis-NB, where we replace the Poisson model of contact counts by a negative binomial one, which is parametrized by a mean and a separate dispersion parameter. The dispersion parameter allows the variance to be adjusted independently from the mean, thus better modeling overdispersed data. We compare the results of Pastis-NB to those of several previously published algorithms, both MDS-based and statistical methods. We show that the negative binomial inference yields more accurate structures on simulated data, and more robust structures than other models across real Hi-C replicates and across different resolutions. AVAILABILITY AND IMPLEMENTATION A Python implementation of Pastis-NB is available at https://github.com/hiclib/pastis under the BSD license. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nelle Varoquaux
- TIMC, Université Grenoble Alpes, CNRS, Grenoble INP, Grenoble 38000, France
| | - William S Noble
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98195, USA
| | - Jean-Philippe Vert
- Brain Team, Google Research, Paris 75009, France
- Centre for Computational Biology , MINES ParisTech, PSL University, Paris 75006, France
| |
Collapse
|
7
|
Zhang H, Fu X, Gong X, Wang Y, Zhang H, Zhao Y, Shen Y. Systematic dissection of key factors governing recombination outcomes by GCE-SCRaMbLE. Nat Commun 2022; 13:5836. [PMID: 36192484 PMCID: PMC9530153 DOI: 10.1038/s41467-022-33606-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 09/26/2022] [Indexed: 11/08/2022] Open
Abstract
With the completion of Sc2.0 chromosomes, synthetic chromosome rearrangement and modification by loxP-mediated evolution (SCRaMbLE) becomes more critical for in-depth investigation of fundamental biological questions and screening of industrially valuable characteristics. Further applications, however, are hindered due to the lack of facile and tight regulation of the SCRaMbLE process, and limited understanding of key factors that may affect the rearrangement outcomes. Here we propose an approach to precisely regulate SCRaMbLE recombination in a dose-dependent manner using genetic code expansion (GCE) technology with low basal activity. By systematically analyzing 1380 derived strains and six yeast pools subjected to GCE-SCRaMbLE, we find that Cre enzyme abundance, genome ploidy and chromosome conformation play key roles in recombination frequencies and determine the SCRaMbLE outcomes. With these insights, the GCE-SCRaMbLE system will serve as a powerful tool in the future exploitation and optimization of the Sc2.0-related technologies.
Collapse
Affiliation(s)
- Huiming Zhang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
- BGI Research-Shenzhen, BGI, Shenzhen, 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, 518120, China
| | - Xian Fu
- BGI Research-Shenzhen, BGI, Shenzhen, 518083, China.
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, 518120, China.
- BGI Research-Changzhou, BGI, Changzhou, 213000, China.
| | - Xuemei Gong
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
- BGI Research-Shenzhen, BGI, Shenzhen, 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, 518120, China
| | - Yun Wang
- BGI Research-Shenzhen, BGI, Shenzhen, 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, 518120, China
- BGI Research-Changzhou, BGI, Changzhou, 213000, China
| | - Haolin Zhang
- BGI Research-Shenzhen, BGI, Shenzhen, 518083, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, 518120, China
| | - Yu Zhao
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY, 10016, USA
| | - Yue Shen
- BGI Research-Shenzhen, BGI, Shenzhen, 518083, China.
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, 518120, China.
- BGI Research-Changzhou, BGI, Changzhou, 213000, China.
- Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
| |
Collapse
|
8
|
MacKay K, Kusalik A. Computational methods for predicting 3D genomic organization from high-resolution chromosome conformation capture data. Brief Funct Genomics 2021; 19:292-308. [PMID: 32353112 PMCID: PMC7388788 DOI: 10.1093/bfgp/elaa004] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 01/30/2020] [Accepted: 02/07/2020] [Indexed: 12/19/2022] Open
Abstract
The advent of high-resolution chromosome conformation capture assays (such as 5C, Hi-C and Pore-C) has allowed for unprecedented sequence-level investigations into the structure-function relationship of the genome. In order to comprehensively understand this relationship, computational tools are required that utilize data generated from these assays to predict 3D genome organization (the 3D genome reconstruction problem). Many computational tools have been developed that answer this need, but a comprehensive comparison of their underlying algorithmic approaches has not been conducted. This manuscript provides a comprehensive review of the existing computational tools (from November 2006 to September 2019, inclusive) that can be used to predict 3D genome organizations from high-resolution chromosome conformation capture data. Overall, existing tools were found to use a relatively small set of algorithms from one or more of the following categories: dimensionality reduction, graph/network theory, maximum likelihood estimation (MLE) and statistical modeling. Solutions in each category are far from maturity, and the breadth and depth of various algorithmic categories have not been fully explored. While the tools for predicting 3D structure for a genomic region or single chromosome are diverse, there is a general lack of algorithmic diversity among computational tools for predicting the complete 3D genome organization from high-resolution chromosome conformation capture data.
Collapse
|
9
|
Zha M, Wang N, Zhang C, Wang Z. Inferring Single-Cell 3D Chromosomal Structures Based on the Lennard-Jones Potential. Int J Mol Sci 2021; 22:ijms22115914. [PMID: 34072879 PMCID: PMC8199262 DOI: 10.3390/ijms22115914] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 05/23/2021] [Accepted: 05/28/2021] [Indexed: 11/16/2022] Open
Abstract
Reconstructing three-dimensional (3D) chromosomal structures based on single-cell Hi-C data is a challenging scientific problem due to the extreme sparseness of the single-cell Hi-C data. In this research, we used the Lennard-Jones potential to reconstruct both 500 kb and high-resolution 50 kb chromosomal structures based on single-cell Hi-C data. A chromosome was represented by a string of 500 kb or 50 kb DNA beads and put into a 3D cubic lattice for simulations. A 2D Gaussian function was used to impute the sparse single-cell Hi-C contact matrices. We designed a novel loss function based on the Lennard-Jones potential, in which the ε value, i.e., the well depth, was used to indicate how stable the binding of every pair of beads is. For the bead pairs that have single-cell Hi-C contacts and their neighboring bead pairs, the loss function assigns them stronger binding stability. The Metropolis-Hastings algorithm was used to try different locations for the DNA beads, and simulated annealing was used to optimize the loss function. We proved the correctness and validness of the reconstructed 3D structures by evaluating the models according to multiple criteria and comparing the models with 3D-FISH data.
Collapse
Affiliation(s)
- Mengsheng Zha
- School of Computing Sciences and Computer Engineering, University of Southern Mississippi, 118 College Dr, Hattiesburg, MS 39406, USA; (M.Z.); (C.Z.)
| | - Nan Wang
- Department of Computer Science, New Jersey City University, 2039 Kennedy Blvd, Jersey City, NJ 07305, USA;
| | - Chaoyang Zhang
- School of Computing Sciences and Computer Engineering, University of Southern Mississippi, 118 College Dr, Hattiesburg, MS 39406, USA; (M.Z.); (C.Z.)
| | - Zheng Wang
- Department of Computer Science, University of Miami, 1364 Memorial Drive, Coral Gables, FL 33124, USA
- Correspondence:
| |
Collapse
|
10
|
Gong H, Yang Y, Zhang S, Li M, Zhang X. Application of Hi-C and other omics data analysis in human cancer and cell differentiation research. Comput Struct Biotechnol J 2021; 19:2070-2083. [PMID: 33995903 PMCID: PMC8086027 DOI: 10.1016/j.csbj.2021.04.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 04/04/2021] [Accepted: 04/04/2021] [Indexed: 02/07/2023] Open
Abstract
With the development of 3C (chromosome conformation capture) and its derivative technology Hi-C (High-throughput chromosome conformation capture) research, the study of the spatial structure of the genomic sequence in the nucleus helps researchers understand the functions of biological processes such as gene transcription, replication, repair, and regulation. In this paper, we first introduce the research background and purpose of Hi-C data visualization analysis. After that, we discuss the Hi-C data analysis methods from genome 3D structure, A/B compartment, TADs (topologically associated domain), and loop detection. We also discuss how to apply genome visualization technologies to the identification of chromosome feature structures. We continue with a review of correlation analysis differences among multi-omics data, and how to apply Hi-C and other omics data analysis into cancer and cell differentiation research. Finally, we summarize the various problems in joint analyses based on Hi-C and other multi-omics data. We believe this review can help researchers better understand the progress and applications of 3D genome technology.
Collapse
Affiliation(s)
- Haiyan Gong
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, China
- Shunde Graduate School of University of Science and Technology Beijing, Foshan 528000, China
| | - Yi Yang
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
| | - Sichen Zhang
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
| | - Minghong Li
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
| | - Xiaotong Zhang
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, China
- Shunde Graduate School of University of Science and Technology Beijing, Foshan 528000, China
| |
Collapse
|
11
|
Guarnera E, Tan ZW, Berezovsky IN. Three-dimensional chromatin ensemble reconstruction via stochastic embedding. Structure 2021; 29:622-634.e3. [PMID: 33567266 DOI: 10.1016/j.str.2021.01.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 11/17/2020] [Accepted: 01/13/2021] [Indexed: 01/04/2023]
Abstract
We propose a comprehensive method for reconstructing the whole-genome chromatin ensemble from the Hi-C data. The procedure starts from Markov state modeling (MSM), delineating the structural hierarchy of chromatin organization with partitioning and effective interactions archetypal for corresponding levels of hierarchy. The stochastic embedding procedure introduced in this work provides the 3D ensemble reconstruction, using effective interactions obtained by the MSM as the input. As a result, we obtain the structural ensemble of a genome, allowing one to model the functional and the cell-type variability in the chromatin structure. The whole-genome reconstructions performed on the human B lymphoblastoid (GM12878) and lung fibroblast (IMR90) Hi-C data unravel distinctions in their morphologies and in the spatial arrangement of intermingling chromosomal territories, paving the way to studies of chromatin dynamics, developmental changes, and conformational transitions taking place in normal cells and during potential pathological developments.
Collapse
Affiliation(s)
- Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A(∗)STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671, Singapore
| | - Zhen Wah Tan
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A(∗)STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671, Singapore
| | - Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A(∗)STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, Singapore 117597, Singapore.
| |
Collapse
|
12
|
Meluzzi D, Arya G. Computational approaches for inferring 3D conformations of chromatin from chromosome conformation capture data. Methods 2020; 181-182:24-34. [PMID: 31470090 PMCID: PMC7044057 DOI: 10.1016/j.ymeth.2019.08.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 06/24/2019] [Accepted: 08/23/2019] [Indexed: 02/08/2023] Open
Abstract
Chromosome conformation capture (3C) and its variants are powerful experimental techniques for probing intra- and inter-chromosomal interactions within cell nuclei at high resolution and in a high-throughput, quantitative manner. The contact maps derived from such experiments provide an avenue for inferring the 3D spatial organization of the genome. This review provides an overview of the various computational methods developed in the past decade for addressing the very important but challenging problem of deducing the detailed 3D structure or structure population of chromosomal domains, chromosomes, and even entire genomes from 3C contact maps.
Collapse
Affiliation(s)
- Dario Meluzzi
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, United States
| | - Gaurav Arya
- Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC 27708, United States.
| |
Collapse
|
13
|
Li FZ, Liu ZE, Li XY, Bu LM, Bu HX, Liu H, Zhang CM. Chromatin 3D structure reconstruction with consideration of adjacency relationship among genomic loci. BMC Bioinformatics 2020; 21:272. [PMID: 32611376 PMCID: PMC7329537 DOI: 10.1186/s12859-020-03612-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 06/18/2020] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Chromatin 3D conformation plays important roles in regulating gene or protein functions. High-throughout chromosome conformation capture (3C)-based technologies, such as Hi-C, have been exploited to acquire the contact frequencies among genomic loci at genome-scale. Various computational tools have been proposed to recover the underlying chromatin 3D structures from in situ Hi-C contact map data. As connected residuals in a polymer, neighboring genomic loci have intrinsic mutual dependencies in building a 3D conformation. However, current methods seldom take this feature into account. RESULTS We present a method called ShNeigh, which combines the classical MDS technique with local dependence of neighboring loci modeled by a Gaussian formula, to infer the best 3D structure from noisy and incomplete contact frequency matrices. We validated ShNeigh by comparing it to two typical distance-based algorithms, ShRec3D and ChromSDE. The comparison results on simulated Hi-C dataset showed that, while keeping the high-speed nature of classical MDS, ShNeigh can recover the true structure better than ShRec3D and ChromSDE. Meanwhile, ShNeigh is more robust to data noise. On the publicly available human GM06990 Hi-C data, we demonstrated that the structures reconstructed by ShNeigh are more reproducible between different restriction enzymes than by ShRec3D and ChromSDE, especially at high resolutions manifested by sparse contact maps, which means ShNeigh is more robust to signal coverage. CONCLUSIONS Our method can recover stable structures in high noise and sparse signal settings. It can also reconstruct similar structures from Hi-C data obtained using different restriction enzymes. Therefore, our method provides a new direction for enhancing the reconstruction quality of chromatin 3D structures.
Collapse
Affiliation(s)
- Fang-Zhen Li
- School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China. .,Key Laboratory of Machine Learning and Financial Data Mining in Universities of Shandong, Jinan, China.
| | - Zhi-E Liu
- College of Physics and Electronic Engineering, Qilu Normal University, Jinan, China
| | - Xiu-Yuan Li
- School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China.,Key Laboratory of Machine Learning and Financial Data Mining in Universities of Shandong, Jinan, China
| | - Li-Mei Bu
- Department of Gastroenterology, Shanghai Pudong Hospital, Fudan University Pudong Medical Center, Shanghai, China
| | - Hong-Xia Bu
- Key Laboratory of Machine Learning and Financial Data Mining in Universities of Shandong, Jinan, China
| | - Hui Liu
- School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China.,Digital Media Technology Key Lab of Shandong Province, Jinan, China
| | - Cai-Ming Zhang
- School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China.,Digital Media Technology Key Lab of Shandong Province, Jinan, China
| |
Collapse
|
14
|
Qian M, Cheng Y, Wang X. The methodology study of three-dimensional (3D) genome research. Semin Cell Dev Biol 2019; 90:12-18. [DOI: 10.1016/j.semcdb.2018.07.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Accepted: 07/03/2018] [Indexed: 12/12/2022]
|
15
|
Oluwadare O, Highsmith M, Cheng J. An Overview of Methods for Reconstructing 3-D Chromosome and Genome Structures from Hi-C Data. Biol Proced Online 2019; 21:7. [PMID: 31049033 PMCID: PMC6482566 DOI: 10.1186/s12575-019-0094-0] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 04/01/2019] [Indexed: 01/08/2023] Open
Abstract
Over the past decade, methods for predicting three-dimensional (3-D) chromosome and genome structures have proliferated. This has been primarily due to the development of high-throughput, next-generation chromosome conformation capture (3C) technologies, which have provided next-generation sequencing data about chromosome conformations in order to map the 3-D genome structure. The introduction of the Hi-C technique-a variant of the 3C method-has allowed researchers to extract the interaction frequency (IF) for all loci of a genome at high-throughput and at a genome-wide scale. In this review we describe, categorize, and compare the various methods developed to map chromosome and genome structures from 3C data-particularly Hi-C data. We summarize the improvements introduced by these methods, describe the approach used for method evaluation, and discuss how these advancements shape the future of genome structure construction.
Collapse
Affiliation(s)
- Oluwatosin Oluwadare
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Max Highsmith
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
- Informatics Institute, University of Missouri, Columbia, MO 65211 USA
| |
Collapse
|