1
|
Junier I, Ghobadpour E, Espeli O, Everaers R. DNA supercoiling in bacteria: state of play and challenges from a viewpoint of physics based modeling. Front Microbiol 2023; 14:1192831. [PMID: 37965550 PMCID: PMC10642903 DOI: 10.3389/fmicb.2023.1192831] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 09/25/2023] [Indexed: 11/16/2023] Open
Abstract
DNA supercoiling is central to many fundamental processes of living organisms. Its average level along the chromosome and over time reflects the dynamic equilibrium of opposite activities of topoisomerases, which are required to relax mechanical stresses that are inevitably produced during DNA replication and gene transcription. Supercoiling affects all scales of the spatio-temporal organization of bacterial DNA, from the base pair to the large scale chromosome conformation. Highlighted in vitro and in vivo in the 1960s and 1970s, respectively, the first physical models were proposed concomitantly in order to predict the deformation properties of the double helix. About fifteen years later, polymer physics models demonstrated on larger scales the plectonemic nature and the tree-like organization of supercoiled DNA. Since then, many works have tried to establish a better understanding of the multiple structuring and physiological properties of bacterial DNA in thermodynamic equilibrium and far from equilibrium. The purpose of this essay is to address upcoming challenges by thoroughly exploring the relevance, predictive capacity, and limitations of current physical models, with a specific focus on structural properties beyond the scale of the double helix. We discuss more particularly the problem of DNA conformations, the interplay between DNA supercoiling with gene transcription and DNA replication, its role on nucleoid formation and, finally, the problem of scaling up models. Our primary objective is to foster increased collaboration between physicists and biologists. To achieve this, we have reduced the respective jargon to a minimum and we provide some explanatory background material for the two communities.
Collapse
Affiliation(s)
- Ivan Junier
- CNRS, UMR 5525, VetAgro Sup, Grenoble INP, TIMC, Université Grenoble Alpes, Grenoble, France
| | - Elham Ghobadpour
- CNRS, UMR 5525, VetAgro Sup, Grenoble INP, TIMC, Université Grenoble Alpes, Grenoble, France
- École Normale Supérieure (ENS) de Lyon, CNRS, Laboratoire de Physique and Centre Blaise Pascal de l'ENS de Lyon, Lyon, France
| | - Olivier Espeli
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, Université PSL, Paris, France
| | - Ralf Everaers
- École Normale Supérieure (ENS) de Lyon, CNRS, Laboratoire de Physique and Centre Blaise Pascal de l'ENS de Lyon, Lyon, France
| |
Collapse
|
2
|
Gilbert BR, Thornburg ZR, Brier TA, Stevens JA, Grünewald F, Stone JE, Marrink SJ, Luthey-Schulten Z. Dynamics of chromosome organization in a minimal bacterial cell. Front Cell Dev Biol 2023; 11:1214962. [PMID: 37621774 PMCID: PMC10445541 DOI: 10.3389/fcell.2023.1214962] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 07/10/2023] [Indexed: 08/26/2023] Open
Abstract
Computational models of cells cannot be considered complete unless they include the most fundamental process of life, the replication and inheritance of genetic material. By creating a computational framework to model systems of replicating bacterial chromosomes as polymers at 10 bp resolution with Brownian dynamics, we investigate changes in chromosome organization during replication and extend the applicability of an existing whole-cell model (WCM) for a genetically minimal bacterium, JCVI-syn3A, to the entire cell-cycle. To achieve cell-scale chromosome structures that are realistic, we model the chromosome as a self-avoiding homopolymer with bending and torsional stiffnesses that capture the essential mechanical properties of dsDNA in Syn3A. In addition, the conformations of the circular DNA must avoid overlapping with ribosomes identitied in cryo-electron tomograms. While Syn3A lacks the complex regulatory systems known to orchestrate chromosome segregation in other bacteria, its minimized genome retains essential loop-extruding structural maintenance of chromosomes (SMC) protein complexes (SMC-scpAB) and topoisomerases. Through implementing the effects of these proteins in our simulations of replicating chromosomes, we find that they alone are sufficient for simultaneous chromosome segregation across all generations within nested theta structures. This supports previous studies suggesting loop-extrusion serves as a near-universal mechanism for chromosome organization within bacterial and eukaryotic cells. Furthermore, we analyze ribosome diffusion under the influence of the chromosome and calculate in silico chromosome contact maps that capture inter-daughter interactions. Finally, we present a methodology to map the polymer model of the chromosome to a Martini coarse-grained representation to prepare molecular dynamics models of entire Syn3A cells, which serves as an ultimate means of validation for cell states predicted by the WCM.
Collapse
Affiliation(s)
- Benjamin R. Gilbert
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Zane R. Thornburg
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Troy A. Brier
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Jan A. Stevens
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Fabian Grünewald
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - John E. Stone
- NVIDIA Corporation, Santa Clara, CA, United States
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Siewert J. Marrink
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, United States
- NSF Center for the Physics of Living Cells, Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| |
Collapse
|
3
|
Rowland B, Huh R, Hou Z, Crowley C, Wen J, Shen Y, Hu M, Giusti-Rodríguez P, Sullivan PF, Li Y. THUNDER: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data. PLoS Genet 2022; 18:e1010102. [PMID: 35259165 PMCID: PMC8932604 DOI: 10.1371/journal.pgen.1010102] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 03/18/2022] [Accepted: 02/14/2022] [Indexed: 11/30/2022] Open
Abstract
Hi-C data provide population averaged estimates of three-dimensional chromatin contacts across cell types and states in bulk samples. Effective analysis of Hi-C data entails controlling for the potential confounding factor of differential cell type proportions across heterogeneous bulk samples. We propose a novel unsupervised deconvolution method for inferring cell type composition from bulk Hi-C data, the Two-step Hi-c UNsupervised DEconvolution appRoach (THUNDER). We conducted extensive simulations to test THUNDER based on combining two published single-cell Hi-C (scHi-C) datasets. THUNDER more accurately estimates the underlying cell type proportions compared to reference-free methods (e.g., TOAST, and NMF) and is more robust than reference-dependent methods (e.g. MuSiC). We further demonstrate the practical utility of THUNDER to estimate cell type proportions and identify cell-type-specific interactions in Hi-C data from adult human cortex tissue samples. THUNDER will be a useful tool in adjusting for varying cell type composition in population samples, facilitating valid and more powerful downstream analysis such as differential chromatin organization studies. Additionally, THUNDER estimated contact profiles provide a useful exploratory framework to investigate cell-type-specificity of the chromatin interactome while experimental data is still rare.
Collapse
Affiliation(s)
- Bryce Rowland
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Ruth Huh
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Zoey Hou
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois, United States of America
| | - Cheynna Crowley
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Jia Wen
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Yin Shen
- Institute for Human Genetics, University of California San Francisco, San Francisco, California, United States of America
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, Ohio, United States of America
| | - Paola Giusti-Rodríguez
- Department of Psychiatry, University of Florida College of Medicine, Gainesville, Florida, United States of America
| | - Patrick F. Sullivan
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Yun Li
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
4
|
Varoquaux N, Lioy VS, Boccard F, Junier I. Computational Tools for the Multiscale Analysis of Hi-C Data in Bacterial Chromosomes. Methods Mol Biol 2022; 2301:197-207. [PMID: 34415537 DOI: 10.1007/978-1-0716-1390-0_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Just as in eukaryotes, high-throughput chromosome conformation capture (Hi-C) data have revealed nested organizations of bacterial chromosomes into overlapping interaction domains. In this chapter, we present a multiscale analysis framework aiming at capturing and quantifying these properties. These include both standard tools (e.g., contact laws) and novel ones such as an index that allows identifying loci involved in domain formation independently of the structuring scale at play. Our objective is twofold. On the one hand, we aim at providing a full, understandable Python/Jupyter-based code which can be used by both computer scientists and biologists with no advanced computational background. On the other hand, we discuss statistical issues inherent to Hi-C data analysis, focusing more particularly on how to properly assess the statistical significance of results. As a pedagogical example, we analyze data produced in Pseudomonas aeruginosa, a model pathogenetic bacterium. All files (codes and input data) can be found on a GitHub repository. We have also embedded the files into a Binder package so that the full analysis can be run on any machine through Internet.
Collapse
Affiliation(s)
| | - Virginia S Lioy
- Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay, CEA, CNRS, Gif-sur-Yvette, France
| | - Frédéric Boccard
- Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay, CEA, CNRS, Gif-sur-Yvette, France
| | - Ivan Junier
- TIMC-IMAG, CNRS, Univ. Grenoble Alpes, Grenoble, France.
| |
Collapse
|
5
|
Perspectives for the reconstruction of 3D chromatin conformation using single cell Hi-C data. PLoS Comput Biol 2021; 17:e1009546. [PMID: 34793453 PMCID: PMC8601426 DOI: 10.1371/journal.pcbi.1009546] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 10/08/2021] [Indexed: 11/19/2022] Open
Abstract
Construction of chromosomes 3D models based on single cell Hi-C data constitute an important challenge. We present a reconstruction approach, DPDchrom, that incorporates basic knowledge whether the reconstructed conformation should be coil-like or globular and spring relaxation at contact sites. In contrast to previously published protocols, DPDchrom can naturally form globular conformation due to the presence of explicit solvent. Benchmarking of this and several other methods on artificial polymer models reveals similar reconstruction accuracy at high contact density and DPDchrom advantage at low contact density. To compare 3D structures insensitively to spatial orientation and scale, we propose the Modified Jaccard Index. We analyzed two sources of the contact dropout: contact radius change and random contact sampling. We found that the reconstruction accuracy exponentially depends on the number of contacts per genomic bin allowing to estimate the reconstruction accuracy in advance. We applied DPDchrom to model chromosome configurations based on single-cell Hi-C data of mouse oocytes and found that these configurations differ significantly from a random one, that is consistent with other studies.
Collapse
|
6
|
Gilbert BR, Thornburg ZR, Lam V, Rashid FZM, Glass JI, Villa E, Dame RT, Luthey-Schulten Z. Generating Chromosome Geometries in a Minimal Cell From Cryo-Electron Tomograms and Chromosome Conformation Capture Maps. Front Mol Biosci 2021; 8:644133. [PMID: 34368224 PMCID: PMC8339304 DOI: 10.3389/fmolb.2021.644133] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Accepted: 05/14/2021] [Indexed: 12/31/2022] Open
Abstract
JCVI-syn3A is a genetically minimal bacterial cell, consisting of 493 genes and only a single 543 kbp circular chromosome. Syn3A’s genome and physical size are approximately one-tenth those of the model bacterial organism Escherichia coli’s, and the corresponding reduction in complexity and scale provides a unique opportunity for whole-cell modeling. Previous work established genome-scale gene essentiality and proteomics data along with its essential metabolic network and a kinetic model of genetic information processing. In addition to that information, whole-cell, spatially-resolved kinetic models require cellular architecture, including spatial distributions of ribosomes and the circular chromosome’s configuration. We reconstruct cellular architectures of Syn3A cells at the single-cell level directly from cryo-electron tomograms, including the ribosome distributions. We present a method of generating self-avoiding circular chromosome configurations in a lattice model with a resolution of 11.8 bp per monomer on a 4 nm cubic lattice. Realizations of the chromosome configurations are constrained by the ribosomes and geometry reconstructed from the tomograms and include DNA loops suggested by experimental chromosome conformation capture (3C) maps. Using ensembles of simulated chromosome configurations we predict chromosome contact maps for Syn3A cells at resolutions of 250 bp and greater and compare them to the experimental maps. Additionally, the spatial distributions of ribosomes and the DNA-crowding resulting from the individual chromosome configurations can be used to identify macromolecular structures formed from ribosomes and DNA, such as polysomes and expressomes.
Collapse
Affiliation(s)
- Benjamin R Gilbert
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Zane R Thornburg
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Vinson Lam
- Division of Biological Sciences, University of California San Diego, San Diego, CA, United States
| | - Fatema-Zahra M Rashid
- Leiden Institute of Chemistry, Leiden University, Leiden, Netherlands.,Center for Microbial Cell Biology, Leiden University, Leiden, Netherlands
| | - John I Glass
- Synthetic Biology Group, J. Craig Venter Institute, La Jolla, CA, United States
| | - Elizabeth Villa
- Division of Biological Sciences, University of California San Diego, San Diego, CA, United States
| | - Remus T Dame
- Leiden Institute of Chemistry, Leiden University, Leiden, Netherlands.,Center for Microbial Cell Biology, Leiden University, Leiden, Netherlands
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| |
Collapse
|
7
|
Di Stefano M, Stadhouders R, Farabella I, Castillo D, Serra F, Graf T, Marti-Renom MA. Transcriptional activation during cell reprogramming correlates with the formation of 3D open chromatin hubs. Nat Commun 2020; 11:2564. [PMID: 32444798 PMCID: PMC7244774 DOI: 10.1038/s41467-020-16396-1] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Accepted: 05/01/2020] [Indexed: 12/22/2022] Open
Abstract
Chromosome structure is a crucial regulatory factor for a wide range of nuclear processes. Chromosome conformation capture (3C)-based experiments combined with computational modelling are pivotal for unveiling 3D chromosome structure. Here, we introduce TADdyn, a tool that integrates time-course 3C data, restraint-based modelling, and molecular dynamics to simulate the structural rearrangements of genomic loci in a completely data-driven way. We apply TADdyn on in situ Hi-C time-course experiments studying the reprogramming of murine B cells to pluripotent cells, and characterize the structural rearrangements that take place upon changes in the transcriptional state of 21 genomic loci of diverse expression dynamics. By measuring various structural and dynamical properties, we find that during gene activation, the transcription starting site contacts with open and active regions in 3D chromatin domains. We propose that these 3D hubs of open and active chromatin may constitute a general feature to trigger and maintain gene transcription.
Collapse
Affiliation(s)
- Marco Di Stefano
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain. .,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.
| | - Ralph Stadhouders
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.,Department of Pulmonary Medicine and Department of Cell Biology, Erasmus MC, Rotterdam, the Netherlands
| | - Irene Farabella
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain.,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain
| | - David Castillo
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain.,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain
| | - François Serra
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain.,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.,Computational Biology Group-Barcelona Supercomputing Center (BSC), 08034, Barcelona, Spain
| | - Thomas Graf
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.
| | - Marc A Marti-Renom
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain. .,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain. .,Universitat Pompeu Fabra (UPF), 08002, Barcelona, Spain. .,ICREA, Pg. Lluís Companys 23, 08010, Barcelona, Spain.
| |
Collapse
|
8
|
Bayesian inference of chromatin structure ensembles from population-averaged contact data. Proc Natl Acad Sci U S A 2020; 117:7824-7830. [PMID: 32193349 DOI: 10.1073/pnas.1910364117] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Mounting experimental evidence suggests a role for the spatial organization of chromatin in crucial processes of the cell nucleus such as transcription regulation. Chromosome conformation capture techniques allow us to characterize chromatin structure by mapping contacts between chromosomal loci on a genome-wide scale. The most widespread modality is to measure contact frequencies averaged over a population of cells. Single-cell variants exist, but suffer from low contact numbers and have not yet gained the same resolution as population methods. While intriguing biological insights have already been garnered from ensemble-averaged data, information about three-dimensional (3D) genome organization in the underlying individual cells remains largely obscured because the contact maps show only an average over a huge population of cells. Moreover, computational methods for structure modeling of chromatin have mostly focused on fitting a single consensus structure, thereby ignoring any cell-to-cell variability in the model itself. Here, we propose a fully Bayesian method to infer ensembles of chromatin structures and to determine the optimal number of states in a principled, objective way. We illustrate our approach on simulated data and compute multistate models of chromatin from chromosome conformation capture carbon copy (5C) data. Comparison with independent data suggests that the inferred ensembles represent the underlying sample population faithfully. Harnessing the rich information contained in multistate models, we investigate cell-to-cell variability of chromatin organization into topologically associating domains, thus highlighting the ability of our approach to deliver insights into chromatin organization of great biological relevance.
Collapse
|
9
|
Caudai C, Salerno E, Zoppe M, Tonazzini A. Estimation of the Spatial Chromatin Structure Based on a Multiresolution Bead-Chain Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:550-559. [PMID: 29994172 DOI: 10.1109/tcbb.2018.2791439] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
We present a method to infer 3D chromatin configurations from Chromosome Conformation Capture data. Quite a few methods have been proposed to estimate the structure of the nuclear dna in homogeneous populations of cells from this kind of data. Many of them transform contact frequencies into euclidean distances between pairs of chromatin fragments, and then reconstruct the structure by solving a distance-to-geometry problem. To avoid inconsistencies, our method is based on a score function that does not require any frequency-to-distance translation. We propose a multiscale chromatin model where the chromatin fiber is suitably partitioned at each scale. The partial structures are estimated independently, and connected to rebuild the whole fiber. Our score function consists of a data-fit part and a penalty part, balanced automatically at each scale and each subchain. The penalty part enforces soft geometric constraints. As many different structures can fit the data, our sampling strategy produces a set of solutions with similar scores. The procedure contains a few parameters, independent of both the scale and the genomic segment treated. The partition of the fiber, along with intrinsically parallel parts, make this method computationally efficient. Results from human genome data support the biological plausibility of our solutions.
Collapse
|
10
|
Gürsoy G, Xu Y, Kenter AL, Liang J. Computational construction of 3D chromatin ensembles and prediction of functional interactions of alpha-globin locus from 5C data. Nucleic Acids Res 2017; 45:11547-11558. [PMID: 28981716 PMCID: PMC5714131 DOI: 10.1093/nar/gkx784] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 08/30/2017] [Indexed: 01/23/2023] Open
Abstract
Conformation capture technologies measure frequencies of interactions between chromatin regions. However, understanding gene-regulation require knowledge of detailed spatial structures of heterogeneous chromatin in cells. Here we describe the nC-SAC (n-Constrained-Self Avoiding Chromatin) method that transforms experimental interaction frequencies into 3D ensembles of chromatin chains. nC-SAC first distinguishes specific from non-specific interaction frequencies, then generates 3D chromatin ensembles using identified specific interactions as spatial constraints. Application to α-globin locus shows that these constraints (∼20%) drive the formation of ∼99% all experimentally captured interactions, in which ∼30% additional to the imposed constraints is found to be specific. Many novel specific spatial contacts not captured by experiments are also predicted. A subset, of which independent ChIA-PET data are available, is validated to be RNAPII-, CTCF-, and RAD21-mediated. Their positioning in the architectural context of imposed specific interactions from nC-SAC is highly important. Our results also suggest the presence of a many-body structural unit involving α-globin gene, its enhancers, and POL3RK gene for regulating the expression of α-globin in silent cells.
Collapse
Affiliation(s)
- Gamze Gürsoy
- Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, USA
| | - Yun Xu
- Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, USA
| | - Amy L Kenter
- Department of Microbiology and Immunology, University of Illinois College of Medicine, Chicago, IL 60612, USA
| | - Jie Liang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, USA
| |
Collapse
|
11
|
Haddad N, Vaillant C, Jost D. IC-Finder: inferring robustly the hierarchical organization of chromatin folding. Nucleic Acids Res 2017; 45:e81. [PMID: 28130423 PMCID: PMC5449546 DOI: 10.1093/nar/gkx036] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Accepted: 01/13/2017] [Indexed: 11/22/2022] Open
Abstract
The spatial organization of the genome plays a crucial role in the regulation of gene expression. Recent experimental techniques like Hi-C have emphasized the segmentation of genomes into interaction compartments that constitute conserved functional domains participating in the maintenance of a proper cell identity. Here, we propose a novel method, IC-Finder, to identify interaction compartments (IC) from experimental Hi-C maps. IC-Finder is based on a hierarchical clustering approach that we adapted to account for the polymeric nature of chromatin. Based on a benchmark of realistic in silico Hi-C maps, we show that IC-Finder is one of the best methods in terms of reliability and is the most efficient numerically. IC-Finder proposes two original options: a probabilistic description of the inferred compartments and the possibility to explore the various hierarchies of chromatin organization. Applying the method to experimental data in fly and human, we show how the predicted segmentation may depend on the normalization scheme and how 3D compartmentalization is tightly associated with epigenomic information. IC-Finder provides a robust and generic ‘all-in-one’ tool to uncover the general principles of 3D chromatin folding and their influence on gene regulation. The software is available at http://membres-timc.imag.fr/Daniel.Jost/DJ-TIMC/Software.html.
Collapse
Affiliation(s)
- Noelle Haddad
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS, Laboratoire de Physique, F-69007 Lyon, France
| | - Cédric Vaillant
- Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS, Laboratoire de Physique, F-69007 Lyon, France
| | - Daniel Jost
- Univ. Grenoble Alpes, CNRS, TIMC-IMAG, F-38000 Grenoble, France
| |
Collapse
|
12
|
Zhan Y, Mariani L, Barozzi I, Schulz EG, Blüthgen N, Stadler M, Tiana G, Giorgetti L. Reciprocal insulation analysis of Hi-C data shows that TADs represent a functionally but not structurally privileged scale in the hierarchical folding of chromosomes. Genome Res 2017; 27:479-490. [PMID: 28057745 PMCID: PMC5340975 DOI: 10.1101/gr.212803.116] [Citation(s) in RCA: 135] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 01/04/2017] [Indexed: 12/02/2022]
Abstract
Understanding how regulatory sequences interact in the context of chromosomal architecture is a central challenge in biology. Chromosome conformation capture revealed that mammalian chromosomes possess a rich hierarchy of structural layers, from multi-megabase compartments to sub-megabase topologically associating domains (TADs) and sub-TAD contact domains. TADs appear to act as regulatory microenvironments by constraining and segregating regulatory interactions across discrete chromosomal regions. However, it is unclear whether other (or all) folding layers share similar properties, or rather TADs constitute a privileged folding scale with maximal impact on the organization of regulatory interactions. Here, we present a novel algorithm named CaTCH that identifies hierarchical trees of chromosomal domains in Hi-C maps, stratified through their reciprocal physical insulation, which is a single and biologically relevant parameter. By applying CaTCH to published Hi-C data sets, we show that previously reported folding layers appear at different insulation levels. We demonstrate that although no structurally privileged folding level exists, TADs emerge as a functionally privileged scale defined by maximal boundary enrichment in CTCF and maximal cell-type conservation. By measuring transcriptional output in embryonic stem cells and neural precursor cells, we show that the likelihood that genes in a domain are coregulated during differentiation is also maximized at the scale of TADs. Finally, we observe that regulatory sequences occur at genomic locations corresponding to optimized mutual interactions at the same scale. Our analysis suggests that the architectural functionality of TADs arises from the interplay between their ability to partition interactions and the specific genomic position of regulatory sequences.
Collapse
Affiliation(s)
- Yinxiu Zhan
- Friedrich Miescher Institute for Biomedical Research, Basel, CH-4058, Switzerland.,University of Basel, CH-4003 Basel, Switzerland
| | - Luca Mariani
- Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, 75248 Paris Cedex 05, France
| | - Iros Barozzi
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Edda G Schulz
- Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, 75248 Paris Cedex 05, France
| | - Nils Blüthgen
- Institute of Pathology, Charité -Universitätsmedizin Berlin, 10117 Berlin, Germany.,Interdisciplinary Research Institute for the Life Sciences, Humboldt University, 10115 Berlin, Germany
| | - Michael Stadler
- Friedrich Miescher Institute for Biomedical Research, Basel, CH-4058, Switzerland.,Swiss Institute of Bioinformatics, CH-4058 Basel, Switzerland
| | - Guido Tiana
- Department of Physics and Center for Complexity and Biosystems, University of Milano and Istituto Nazionale di Fisica Nucleare, 20133, Milano, Italy
| | - Luca Giorgetti
- Friedrich Miescher Institute for Biomedical Research, Basel, CH-4058, Switzerland
| |
Collapse
|
13
|
Szałaj P, Tang Z, Michalski P, Pietal MJ, Luo OJ, Sadowski M, Li X, Radew K, Ruan Y, Plewczynski D. An integrated 3-Dimensional Genome Modeling Engine for data-driven simulation of spatial genome organization. Genome Res 2016; 26:1697-1709. [PMID: 27789526 PMCID: PMC5131821 DOI: 10.1101/gr.205062.116] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 10/20/2016] [Indexed: 02/03/2023]
Abstract
ChIA-PET is a high-throughput mapping technology that reveals long-range chromatin interactions and provides insights into the basic principles of spatial genome organization and gene regulation mediated by specific protein factors. Recently, we showed that a single ChIA-PET experiment provides information at all genomic scales of interest, from the high-resolution locations of binding sites and enriched chromatin interactions mediated by specific protein factors, to the low resolution of nonenriched interactions that reflect topological neighborhoods of higher-order chromosome folding. This multilevel nature of ChIA-PET data offers an opportunity to use multiscale 3D models to study structural-functional relationships at multiple length scales, but doing so requires a structural modeling platform. Here, we report the development of 3D-GNOME (3-Dimensional Genome Modeling Engine), a complete computational pipeline for 3D simulation using ChIA-PET data. 3D-GNOME consists of three integrated components: a graph-distance-based heat map normalization tool, a 3D modeling platform, and an interactive 3D visualization tool. Using ChIA-PET and Hi-C data derived from human B-lymphocytes, we demonstrate the effectiveness of 3D-GNOME in building 3D genome models at multiple levels, including the entire genome, individual chromosomes, and specific segments at megabase (Mb) and kilobase (kb) resolutions of single average and ensemble structures. Further incorporation of CTCF-motif orientation and high-resolution looping patterns in 3D simulation provided additional reliability of potential biologically plausible topological structures.
Collapse
Affiliation(s)
- Przemysław Szałaj
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland.,Centre for Innovative Research, Medical University of Bialystok, 15-089 Białystok, Poland.,I-BioStat, Hasselt University, BE3590 Hasselt, Belgium
| | - Zhonghui Tang
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Paul Michalski
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Michal J Pietal
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Oscar J Luo
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Michał Sadowski
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Xingwang Li
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Kamen Radew
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Yijun Ruan
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA.,Department of Genetics and Genome Sciences, UConn Health, Farmington, Connecticut 06032, USA
| | - Dariusz Plewczynski
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland.,Centre for Innovative Research, Medical University of Bialystok, 15-089 Białystok, Poland.,Faculty of Pharmacy, Medical University of Warsaw, 02-097 Warsaw, Poland
| |
Collapse
|
14
|
Sefer E, Duggal G, Kingsford C. Deconvolution of Ensemble Chromatin Interaction Data Reveals the Latent Mixing Structures in Cell Subpopulations. J Comput Biol 2016; 23:425-38. [PMID: 27267775 PMCID: PMC4904159 DOI: 10.1089/cmb.2015.0210] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Chromosome conformation capture (3C) experiments provide a window into the spatial packing of a genome in three dimensions within the cell. This structure has been shown to be correlated with gene regulation, cancer mutations, and other genomic functions. However, 3C provides mixed measurements on a population of typically millions of cells, each with a different genome structure due to the fluidity of the genome and differing cell states. Here, we present several algorithms to deconvolve these measured 3C matrices into estimations of the contact matrices for each subpopulation of cells and relative densities of each subpopulation. We formulate the problem as that of choosing matrices and densities that minimize the Frobenius distance between the observed 3C matrix and the weighted sum of the estimated subpopulation matrices. Results on HeLa 5C and mouse and bacteria Hi-C data demonstrate the methods' effectiveness. We also show that domain boundaries from deconvolved matrices are often more enriched or depleted for regulatory chromatin markers when compared to boundaries from convolved matrices.
Collapse
Affiliation(s)
- Emre Sefer
- School of Computer Science, Carnegie Mellon University , Pittsburgh, Pennsylvania
| | - Geet Duggal
- School of Computer Science, Carnegie Mellon University , Pittsburgh, Pennsylvania
| | - Carl Kingsford
- School of Computer Science, Carnegie Mellon University , Pittsburgh, Pennsylvania
| |
Collapse
|
15
|
Brackley CA, Johnson J, Kelly S, Cook PR, Marenduzzo D. Simulated binding of transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and topological domains. Nucleic Acids Res 2016; 44:3503-12. [PMID: 27060145 PMCID: PMC4856988 DOI: 10.1093/nar/gkw135] [Citation(s) in RCA: 110] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 02/22/2016] [Accepted: 02/24/2016] [Indexed: 01/12/2023] Open
Abstract
Biophysicists are modeling conformations of interphase chromosomes, often basing the strengths of interactions between segments distant on the genetic map on contact frequencies determined experimentally. Here, instead, we develop a fitting-free, minimal model: bivalent or multivalent red and green 'transcription factors' bind to cognate sites in strings of beads ('chromatin') to form molecular bridges stabilizing loops. In the absence of additional explicit forces, molecular dynamic simulations reveal that bound factors spontaneously cluster-red with red, green with green, but rarely red with green-to give structures reminiscent of transcription factories. Binding of just two transcription factors (or proteins) to active and inactive regions of human chromosomes yields rosettes, topological domains and contact maps much like those seen experimentally. This emergent 'bridging-induced attraction' proves to be a robust, simple and generic force able to organize interphase chromosomes at all scales.
Collapse
Affiliation(s)
- Chris A Brackley
- SUPA, School of Physics & Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh, EH9 3FD, UK
| | - James Johnson
- SUPA, School of Physics & Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh, EH9 3FD, UK
| | - Steven Kelly
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, UK
| | - Peter R Cook
- Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford, OX1 3RE, UK
| | - Davide Marenduzzo
- SUPA, School of Physics & Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh, EH9 3FD, UK
| |
Collapse
|
16
|
Sekelja M, Paulsen J, Collas P. 4D nucleomes in single cells: what can computational modeling reveal about spatial chromatin conformation? Genome Biol 2016; 17:54. [PMID: 27052789 PMCID: PMC4823877 DOI: 10.1186/s13059-016-0923-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Genome-wide sequencing technologies enable investigations of the structural properties of the genome in various spatial dimensions. Here, we review computational techniques developed to model the three-dimensional genome in single cells versus ensembles of cells and assess their underlying assumptions. We further address approaches to study the spatio-temporal aspects of genome organization from single-cell data.
Collapse
Affiliation(s)
- Monika Sekelja
- Department of Molecular Medicine, Faculty of Medicine, University of Oslo, PO Box 1112, Blindern, 0317, Oslo, Norway
| | - Jonas Paulsen
- Department of Molecular Medicine, Faculty of Medicine, University of Oslo, PO Box 1112, Blindern, 0317, Oslo, Norway
| | - Philippe Collas
- Department of Molecular Medicine, Faculty of Medicine, University of Oslo, PO Box 1112, Blindern, 0317, Oslo, Norway.
| |
Collapse
|
17
|
Ay F, Noble WS. Analysis methods for studying the 3D architecture of the genome. Genome Biol 2015; 16:183. [PMID: 26328929 PMCID: PMC4556012 DOI: 10.1186/s13059-015-0745-7] [Citation(s) in RCA: 96] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 08/10/2015] [Indexed: 11/10/2022] Open
Abstract
The rapidly increasing quantity of genome-wide chromosome conformation capture data presents great opportunities and challenges in the computational modeling and interpretation of the three-dimensional genome. In particular, with recent trends towards higher-resolution high-throughput chromosome conformation capture (Hi-C) data, the diversity and complexity of biological hypotheses that can be tested necessitates rigorous computational and statistical methods as well as scalable pipelines to interpret these datasets. Here we review computational tools to interpret Hi-C data, including pipelines for mapping, filtering, and normalization, and methods for confidence estimation, domain calling, visualization, and three-dimensional modeling.
Collapse
Affiliation(s)
- Ferhat Ay
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA. .,Feinberg School of Medicine, Northwestern University, Chicago, 60661, IL, USA.
| | - William S Noble
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA. .,Department of Computer Science and Engineering, University of Washington, Seattle, 98195, WA, USA.
| |
Collapse
|