1
|
Conte M, Abraham A, Esposito A, Yang L, Gibcus JH, Parsi KM, Vercellone F, Fontana A, Di Pierno F, Dekker J, Nicodemi M. Polymer Physics Models Reveal Structural Folding Features of Single-Molecule Gene Chromatin Conformations. Int J Mol Sci 2024; 25:10215. [PMID: 39337699 PMCID: PMC11432541 DOI: 10.3390/ijms251810215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Revised: 09/17/2024] [Accepted: 09/22/2024] [Indexed: 09/30/2024] Open
Abstract
Here, we employ polymer physics models of chromatin to investigate the 3D folding of a 2 Mb wide genomic region encompassing the human LTN1 gene, a crucial DNA locus involved in key cellular functions. Through extensive Molecular Dynamics simulations, we reconstruct in silico the ensemble of single-molecule LTN1 3D structures, which we benchmark against recent in situ Hi-C 2.0 data. The model-derived single molecules are then used to predict structural folding features at the single-cell level, providing testable predictions for super-resolution microscopy experiments.
Collapse
Affiliation(s)
- Mattia Conte
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
| | - Alex Abraham
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
| | - Andrea Esposito
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
| | - Liyan Yang
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Johan H. Gibcus
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Krishna M. Parsi
- Diabetes Center of Excellence and Program in Molecular Medicine, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA
| | - Francesca Vercellone
- DIETI, Università di Napoli Federico II, Via Claudio 21, 80125 Naples, Italy
- INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
| | - Andrea Fontana
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
| | - Florinda Di Pierno
- DIETI, Università di Napoli Federico II, Via Claudio 21, 80125 Naples, Italy
- INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
| | - Job Dekker
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
- Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Mario Nicodemi
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126 Naples, Italy
| |
Collapse
|
2
|
龚 海, 张 司, 张 晓. [An identification method of chromatin topological associated domains based on spatial density clustering]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2024; 41:552-559. [PMID: 38932542 PMCID: PMC11208664 DOI: 10.7507/1001-5515.202311059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 03/05/2024] [Indexed: 06/28/2024]
Abstract
The rapid development of high-throughput chromatin conformation capture (Hi-C) technology provides rich genomic interaction data between chromosomal loci for chromatin structure analysis. However, existing methods for identifying topologically associated domains (TADs) based on Hi-C data suffer from low accuracy and sensitivity to parameters. In this context, a TAD identification method based on spatial density clustering was designed and implemented in this paper. The method preprocessed the raw Hi-C data to obtain normalized Hi-C contact matrix data. Then, it computed the distance matrix between loci, generated a reachability graph based on the core distance and reachability distance of loci, and extracted clustering clusters. Finally, it extracted TAD boundaries based on clustering results. This method could identify TAD structures with higher coherence, and TAD boundaries were enriched with more ChIP-seq factors. Experimental results demonstrate that our method has advantages such as higher accuracy and practical significance in TAD identification.
Collapse
Affiliation(s)
- 海燕 龚
- 北京科技大学 新材料技术研究院(北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 司臣 张
- 北京科技大学 新材料技术研究院(北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 晓彤 张
- 北京科技大学 新材料技术研究院(北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| |
Collapse
|
3
|
Kocanova S, Raynal F, Goiffon I, Oksuz BA, Baú D, Kamgoué A, Cantaloube S, Zhan Y, Lajoie B, Marti-Renom MA, Dekker J, Bystricky K. Enhancer-driven 3D chromatin domain folding modulates transcription in human mammary tumor cells. Life Sci Alliance 2024; 7:e202302154. [PMID: 37989525 PMCID: PMC10663337 DOI: 10.26508/lsa.202302154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 11/03/2023] [Accepted: 11/06/2023] [Indexed: 11/23/2023] Open
Abstract
The genome is organized in functional compartments and structural domains at the sub-megabase scale. How within these domains interactions between numerous cis-acting enhancers and promoters regulate transcription remains an open question. Here, we determined chromatin folding and composition over several hundred kb around estrogen-responsive genes in human breast cancer cell lines after hormone stimulation. Modeling of 5C data at 1.8 kb resolution was combined with quantitative 3D analysis of multicolor FISH measurements at 100 nm resolution and integrated with ChIP-seq data on transcription factor binding and histone modifications. We found that rapid estradiol induction of the progesterone gene expression occurs in the context of preexisting, cell type-specific chromosomal architectures encompassing the 90 kb progesterone gene coding region and an enhancer-spiked 5' 300 kb upstream genomic region. In response to estradiol, interactions between estrogen receptor α (ERα) bound regulatory elements are reinforced. Whereas initial enhancer-gene contacts coincide with RNA Pol 2 binding and transcription initiation, sustained hormone stimulation promotes ERα accumulation creating a regulatory hub stimulating transcript synthesis. In addition to implications for estrogen receptor signaling, we uncover that preestablished chromatin architectures efficiently regulate gene expression upon stimulation without the need for de novo extensive rewiring of long-range chromatin interactions.
Collapse
Affiliation(s)
- Silvia Kocanova
- Molecular, Cellular and Developmental Biology Unit (MCD), Centre de Biologie Integrative (CBI), University of Toulouse, UPS, CNRS, Toulouse, France
| | - Flavien Raynal
- Molecular, Cellular and Developmental Biology Unit (MCD), Centre de Biologie Integrative (CBI), University of Toulouse, UPS, CNRS, Toulouse, France
| | - Isabelle Goiffon
- Molecular, Cellular and Developmental Biology Unit (MCD), Centre de Biologie Integrative (CBI), University of Toulouse, UPS, CNRS, Toulouse, France
| | - Betul Akgol Oksuz
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Davide Baú
- Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain
| | - Alain Kamgoué
- Molecular, Cellular and Developmental Biology Unit (MCD), Centre de Biologie Integrative (CBI), University of Toulouse, UPS, CNRS, Toulouse, France
| | - Sylvain Cantaloube
- Molecular, Cellular and Developmental Biology Unit (MCD), Centre de Biologie Integrative (CBI), University of Toulouse, UPS, CNRS, Toulouse, France
| | - Ye Zhan
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Bryan Lajoie
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Marc A Marti-Renom
- Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain
- Genome Biology Program, Centre de Regulació Genòmica (CRG), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | - Job Dekker
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Kerstin Bystricky
- Molecular, Cellular and Developmental Biology Unit (MCD), Centre de Biologie Integrative (CBI), University of Toulouse, UPS, CNRS, Toulouse, France
- Institut Universitaire de France (IUF), Paris, France
| |
Collapse
|
4
|
Li Z, Schlick T. Hi-BDiSCO: folding 3D mesoscale genome structures from Hi-C data using brownian dynamics. Nucleic Acids Res 2024; 52:583-599. [PMID: 38015443 PMCID: PMC10810283 DOI: 10.1093/nar/gkad1121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 10/12/2023] [Accepted: 11/22/2023] [Indexed: 11/29/2023] Open
Abstract
The structure and dynamics of the eukaryotic genome are intimately linked to gene regulation and transcriptional activity. Many chromosome conformation capture experiments like Hi-C have been developed to detect genome-wide contact frequencies and quantify loop/compartment structures for different cellular contexts and time-dependent processes. However, a full understanding of these events requires explicit descriptions of representative chromatin and chromosome configurations. With the exponentially growing amount of data from Hi-C experiments, many methods for deriving 3D structures from contact frequency data have been developed. Yet, most reconstruction methods use polymer models with low resolution to predict overall genome structure. Here we present a Brownian Dynamics (BD) approach termed Hi-BDiSCO for producing 3D genome structures from Hi-C and Micro-C data using our mesoscale-resolution chromatin model based on the Discrete Surface Charge Optimization (DiSCO) model. Our approach integrates reconstruction with chromatin simulations at nucleosome resolution with appropriate biophysical parameters. Following a description of our protocol, we present applications to the NXN, HOXC, HOXA and Fbn2 mouse genes ranging in size from 50 to 100 kb. Such nucleosome-resolution genome structures pave the way for pursuing many biomedical applications related to the epigenomic regulation of chromatin and control of human disease.
Collapse
Affiliation(s)
- Zilong Li
- Department of Chemistry, 100 Washington Square East, Silver Building, New York University, New York, NY 10003, USA
- Simons Center for Computational Physical Chemistry, 24 Waverly Place, Silver Building, New York University, New York, NY 10003, USA
| | - Tamar Schlick
- Department of Chemistry, 100 Washington Square East, Silver Building, New York University, New York, NY 10003, USA
- Courant Institute of Mathematical Sciences, New York University, 251 Mercer St., New York, NY 10012, USA
- New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 200122, China
- Simons Center for Computational Physical Chemistry, 24 Waverly Place, Silver Building, New York University, New York, NY 10003, USA
| |
Collapse
|
5
|
Gong H, Zhang D, Zhang X. TOAST: A novel method for identifying topologically associated domains based on graph auto-encoders and clustering. Comput Struct Biotechnol J 2023; 21:4759-4768. [PMID: 37822562 PMCID: PMC10562672 DOI: 10.1016/j.csbj.2023.09.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 09/16/2023] [Accepted: 09/16/2023] [Indexed: 10/13/2023] Open
Abstract
Topologically associated domains (TADs) play a pivotal role in disease detection. This study introduces a novel TADs recognition approach named TOAST, leveraging graph auto-encoders and clustering techniques. TOAST conceptualizes each genomic bin as a node of a graph and employs the Hi-C contact matrix as the graph's adjacency matrix. By employing graph auto-encoders, TOAST generates informative embeddings as features. Subsequently, the unsupervised clustering algorithm HDBSCAN is utilized to assign labels to each genomic bin, facilitating the identification of contiguous regions with the same label as TADs. Our experimental analysis of several simulated Hi-C data sets shows that TOAST can quickly and accurately identify TADs from different types of simulated Hi-C contact matrices, outperforming existing algorithms. We also determined the anchoring ratio of TAD boundaries by analyzing different TAD recognition algorithms, and obtained an average ratio of anchoring CTCF, SMC3, RAD21, POLR2A, H3K36me3, H3K9me3, H3K4me3, H3K4me1, Enhancer, and Promoters of 0.66, 0.47, 0.54, 0.27, 0.24, 0.12, 0.32, 0.41, 0.26, and 0.13, respectively. In conclusion, TOAST is a method that can quickly identify TAD boundary parameters that are easy to understand and have important biological significance. The TOAST web server can be accessed via http://223.223.185.189:4005/. The code of TOAST is available online at https://github.com/ghaiyan/TOAST.
Collapse
Affiliation(s)
- Haiyan Gong
- Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing, 100083, China
- School of Computer and Communication Engineering, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing, 100083, China
- Shunde innovation School, University of Science and Technology Beijing, Foshan, 528399, Guangdong, China
| | - Dawei Zhang
- Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing, 100083, China
- Shunde innovation School, University of Science and Technology Beijing, Foshan, 528399, Guangdong, China
| | - Xiaotong Zhang
- School of Computer and Communication Engineering, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing, 100083, China
- Shunde innovation School, University of Science and Technology Beijing, Foshan, 528399, Guangdong, China
| |
Collapse
|
6
|
Li Z, Portillo-Ledesma S, Schlick T. Techniques for and challenges in reconstructing 3D genome structures from 2D chromosome conformation capture data. Curr Opin Cell Biol 2023; 83:102209. [PMID: 37506571 PMCID: PMC10529954 DOI: 10.1016/j.ceb.2023.102209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/07/2023] [Accepted: 06/26/2023] [Indexed: 07/30/2023]
Abstract
Chromosome conformation capture technologies that provide frequency information for contacts between genomic regions have been crucial for increasing our understanding of genome folding and regulation. However, such data do not provide direct evidence of the spatial 3D organization of chromatin. In this opinion article, we discuss the development and application of computational methods to reconstruct chromatin 3D structures from experimental 2D contact data, highlighting how such modeling provides biological insights and can suggest mechanisms anchored to experimental data. By applying different reconstruction methods to the same contact data, we illustrate some state-of-the-art of these techniques and discuss our gene resolution approach based on Brownian dynamics and Monte Carlo sampling.
Collapse
Affiliation(s)
- Zilong Li
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA
| | - Stephanie Portillo-Ledesma
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer St., New York, 10012, NY, USA; New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Room 340, Geography Building, 3663 North Zhongshan Road, Shanghai, 200122, China; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA.
| |
Collapse
|
7
|
Hovenga V, Kalita J, Oluwadare O. HiC-GNN: A generalizable model for 3D chromosome reconstruction using graph convolutional neural networks. Comput Struct Biotechnol J 2022; 21:812-836. [PMID: 36698967 PMCID: PMC9842867 DOI: 10.1016/j.csbj.2022.12.051] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 12/08/2022] [Accepted: 12/30/2022] [Indexed: 01/02/2023] Open
Abstract
Chromosome conformation capture (3 C) is a method of measuring chromosome topology in terms of loci interaction. The Hi-C method is a derivative of 3 C that allows for genome-wide quantification of chromosome interaction. From such interaction data, it is possible to infer the three-dimensional (3D) structure of the underlying chromosome. In this paper, we developed a novel method, HiC-GNN, for predicting the 3D structures of chromosomes from Hi-C data. HiC-GNN is unique from other methods for chromosome structure prediction in that the models learned by HiC-GNN can be generalized to data that is distinct from the training data. This aspect of HiC-GNN allows models that were trained on one Hi-C contact map to be used for inference on entirely different maps. To the authors' knowledge, this generalizing capability is not present in any existing methods. HiC-GNN uses a node embedding algorithm and a graph neural network to predict the 3D coordinates of each genomic loci from the corresponding Hi-C contact data. Unlike other methods, our algorithm allows for the storage of pre-trained parameters, thus enabling prediction on data that is entirely different from the training data. We show that our method can accurately generalize a single model across Hi-C resolutions, multiple restriction enzymes, and multiple cell populations while maintaining reconstruction accuracy across three Hi-C datasets. Our algorithm outperforms the state-of-the-art methods in accuracy of prediction and runtime and introduces a novel method for 3D structure prediction from Hi-C data. All our source codes and data are available at https://github.com/OluwadareLab/HiC-GNN.
Collapse
Affiliation(s)
- Van Hovenga
- Department of Mathematics, University of Colorado, Colorado Springs, CO, United States
| | - Jugal Kalita
- Department of Computer Science, University of Colorado, Colorado Springs, CO, United States
| | - Oluwatosin Oluwadare
- Department of Computer Science, University of Colorado, Colorado Springs, CO, United States,Corresponding author.
| |
Collapse
|
8
|
Gong H, Yang Y, Zhang X, Li M, Zhang S, Chen Y. CASPIAN: A method to identify chromatin topological associated domains based on spatial density cluster. Comput Struct Biotechnol J 2022; 20:4816-4824. [PMID: 36147659 PMCID: PMC9464881 DOI: 10.1016/j.csbj.2022.08.059] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 08/10/2022] [Accepted: 08/27/2022] [Indexed: 11/25/2022] Open
Abstract
With the development of Hi-C technology, the detection of topologically associated domains (TADs) boundaries plays an important role in exploring the relationship between gene structure and expression. However, a method that can identify accurate TAD boundaries from the Hi-C contact matrix with different resolutions is currently lacking. We proposed a method named CASPIAN that can identify chromatin TAD boundaries based on the spatial density clustering algorithm. CASPIAN requires few parameters to call TADs. This method is realized using the hierarchical density-based clustering method HDBSCAN, where the distance of pairwise bins is calculated based on three distance metrics (Euclidean, Manhattan, and Chebyshev distance metric) to adapt to the characteristics of the Hi-C contact matrix generated from simulation experiments or normalized methods. Our results show that, same as standard methods (e.g., Insulation Score, TopDom), CASPIAN can enrich factors related to promoting the gene expression, such as CTCF, H3K4me1, H3K4me3, RAD21, POLR2A, and SMC3. We also calculated the approximate proportion of various factors anchored at the TAD boundaries to observe the distribution of these factors surrounding the TAD boundaries. In conclusion, CASPIAN is an easy method to explore the relationship between transcription factors and TAD boundaries. CASPIAN is available online (https://gitee.com/ghaiyan/caspian).
Collapse
Affiliation(s)
- Haiyan Gong
- School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Yi Yang
- School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Xiaotong Zhang
- School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.,Shunde Graduate School, University of Science and Technology Beijing, Foshan 528399, Guangdong, China
| | - Minghong Li
- School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Sichen Zhang
- School of Computer and Communication Engineering, Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Yang Chen
- The State Key Laboratory of Medical Molecular Biology, Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, School of Basic Medicine, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing 100005, China
| |
Collapse
|
9
|
Abstract
Eukaryotic genomes are structurally organized via the formation of multiple loops that create gene expression regulatory units called topologically associating domains (TADs). Here we revealed the KSHV TAD structure at 500 bp resolution and constructed a 3D KSHV genomic structural model with 2 kb binning. The latent KSHV genome formed very similar genomic architectures in three different naturally infected PEL cell lines and in an experimentally infected epithelial cell line. The majority of the TAD boundaries were occupied by structural maintenance of chromosomes (SMC1) cohesin complex and CCCTC-binding factor (CTCF), and the KSHV transactivator was recruited to those sites during reactivation. Triggering KSHV gene expression decreased prewired genomic loops within the regulatory unit, while contacts extending outside of regulatory borders increased, leading to formation of a larger regulatory unit with a shift from repressive to active compartments (B to A). The 3D genomic structural model proposes that the immediate early promoter region is localized on the periphery of the 3D viral genome during latency, while highly inducible noncoding RNA regions moved toward the inner space of the structure, resembling the configuration of a "bird cage" during reactivation. The compartment-like properties of viral episomal chromatin structure and its reorganization during the transition from latency may help facilitate viral gene transcription. IMPORTANCE The 3D architecture of chromatin allows for efficient arrangement, expression, and replication of genetic material. The genomes of all organisms studied to date have been found to be organized through some form of tiered domain structures. However, the architectural framework of the genomes of large double-stranded DNA viruses such as the herpesvirus family has not been reported. Prior studies with Kaposi's sarcoma-associated herpesvirus (KSHV) have indicated that the viral chromatin shares many biological properties exhibited by the host cell genome, essentially behaving as a mini human chromosome. Thus, we hypothesized that the KSHV genome may be organized in a similar manner. In this report, we describe the domain structure of the latent and lytic KSHV genome at 500 bp resolution and present a 3D genomic structural model for KSHV under each condition. These results add new insights into the complex regulation of the viral life cycle.
Collapse
|
10
|
Hancock M, Peulen TO, Webb B, Poon B, Fraser JS, Adams P, Sali A. Integration of software tools for integrative modeling of biomolecular systems. J Struct Biol 2022; 214:107841. [PMID: 35149213 PMCID: PMC9278553 DOI: 10.1016/j.jsb.2022.107841] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 01/28/2022] [Accepted: 02/04/2022] [Indexed: 12/31/2022]
Abstract
Integrative modeling computes a model based on varied types of input information, be it from experiments or prior models. Often, a type of input information will be best handled by a specific modeling software package. In such a case, we desire to integrate our integrative modeling software package, Integrative Modeling Platform (IMP), with software specialized to the computational demands of the modeling problem at hand. After several attempts, however, we have concluded that even in collaboration with the software's developers, integration is either impractical or impossible. The reasons for the intractability of integration include software incompatibilities, differing modeling logic, the costs of collaboration, and academic incentives. In the integrative modeling software ecosystem, several large modeling packages exist with often redundant tools. We reason, therefore, that the other development groups have similarly concluded that the benefit of integration does not justify the cost. As a result, modelers are often restricted to the set of tools within a single software package. The inability to integrate tools from distinct software negatively impacts the quality of the models and the efficiency of the modeling. As the complexity of modeling problems grows, we seek to galvanize developers and modelers to consider the long-term benefit that software interoperability yields. In this article, we formulate a demonstrative set of software standards for implementing a model search using tools from independent software packages and discuss our efforts to integrate IMP and the crystallography suite Phenix within the Bayesian modeling framework.
Collapse
Affiliation(s)
- Matthew Hancock
- Biophysics Graduate Program, University of California, San Francisco, MC 2240 1600 16th St, San Francisco, CA 94143, United States; Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States.
| | - Thomas-Otavio Peulen
- Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States.
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States.
| | - Billy Poon
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Building 33 1 Cyclotron Rd, Berkeley, CA 94270, United States.
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States; Quantitative Biosciences Institute (QBI), University of California, San Francisco, 1700 4th St, San Francisco, CA, United States.
| | - Paul Adams
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Building 33 1 Cyclotron Rd, Berkeley, CA 94270, United States; Department of Bioengineering, University of California, Berkeley, MC 1762 306 Stanley Hall, Berkeley, CA 94720, United States.
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, UCSF Box 0775 1700 4th St, University of California, San Francisco, San Francisco, CA 94158, United States; Department of Pharmaceutical Chemistry, University of California, San Francisco, UCSF Box 2880 600 16th St, San Francisco, CA 94143, United States; Quantitative Biosciences Institute (QBI), University of California, San Francisco, 1700 4th St, San Francisco, CA, United States.
| |
Collapse
|
11
|
Collins B, Oluwadare O, Brown P. ChromeBat: A Bio-Inspired Approach to 3D Genome Reconstruction. Genes (Basel) 2021; 12:1757. [PMID: 34828363 PMCID: PMC8617892 DOI: 10.3390/genes12111757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 10/28/2021] [Accepted: 11/01/2021] [Indexed: 11/20/2022] Open
Abstract
With the advent of Next Generation Sequencing and the Hi-C experiment, high quality genome-wide contact data are becoming increasingly available. These data represents an empirical measure of how a genome interacts inside the nucleus. Genome conformation is of particular interest as it has been experimentally shown to be a driving force for many genomic functions from regulation to transcription. Thus, the Three Dimensional-Genome Reconstruction Problem (3D-GRP) seeks to take Hi-C data and produces a complete physical genome structure as it appears in the nucleus for genomic analysis. We propose and develop a novel method to solve the Chromosome and Genome Reconstruction problem based on the Bat Algorithm (BA) which we called ChromeBat. We demonstrate on real Hi-C data that ChromeBat is capable of state-of-the-art performance. Additionally, the domain of Genome Reconstruction has been criticized for lacking algorithmic diversity, and the bio-inspired nature of ChromeBat contributes algorithmic diversity to the problem domain. ChromeBat is an effective approach for solving the Genome Reconstruction Problem.
Collapse
Affiliation(s)
| | - Oluwatosin Oluwadare
- Department of Computer Science, University of Colorado, Colorado Springs, CO 80918, USA; (B.C.); (P.B.)
| | | |
Collapse
|
12
|
MacKay K, Kusalik A. Computational methods for predicting 3D genomic organization from high-resolution chromosome conformation capture data. Brief Funct Genomics 2021; 19:292-308. [PMID: 32353112 PMCID: PMC7388788 DOI: 10.1093/bfgp/elaa004] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 01/30/2020] [Accepted: 02/07/2020] [Indexed: 12/19/2022] Open
Abstract
The advent of high-resolution chromosome conformation capture assays (such as 5C, Hi-C and Pore-C) has allowed for unprecedented sequence-level investigations into the structure-function relationship of the genome. In order to comprehensively understand this relationship, computational tools are required that utilize data generated from these assays to predict 3D genome organization (the 3D genome reconstruction problem). Many computational tools have been developed that answer this need, but a comprehensive comparison of their underlying algorithmic approaches has not been conducted. This manuscript provides a comprehensive review of the existing computational tools (from November 2006 to September 2019, inclusive) that can be used to predict 3D genome organizations from high-resolution chromosome conformation capture data. Overall, existing tools were found to use a relatively small set of algorithms from one or more of the following categories: dimensionality reduction, graph/network theory, maximum likelihood estimation (MLE) and statistical modeling. Solutions in each category are far from maturity, and the breadth and depth of various algorithmic categories have not been fully explored. While the tools for predicting 3D structure for a genomic region or single chromosome are diverse, there is a general lack of algorithmic diversity among computational tools for predicting the complete 3D genome organization from high-resolution chromosome conformation capture data.
Collapse
|
13
|
Zha M, Wang N, Zhang C, Wang Z. Inferring Single-Cell 3D Chromosomal Structures Based on the Lennard-Jones Potential. Int J Mol Sci 2021; 22:ijms22115914. [PMID: 34072879 PMCID: PMC8199262 DOI: 10.3390/ijms22115914] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 05/23/2021] [Accepted: 05/28/2021] [Indexed: 11/16/2022] Open
Abstract
Reconstructing three-dimensional (3D) chromosomal structures based on single-cell Hi-C data is a challenging scientific problem due to the extreme sparseness of the single-cell Hi-C data. In this research, we used the Lennard-Jones potential to reconstruct both 500 kb and high-resolution 50 kb chromosomal structures based on single-cell Hi-C data. A chromosome was represented by a string of 500 kb or 50 kb DNA beads and put into a 3D cubic lattice for simulations. A 2D Gaussian function was used to impute the sparse single-cell Hi-C contact matrices. We designed a novel loss function based on the Lennard-Jones potential, in which the ε value, i.e., the well depth, was used to indicate how stable the binding of every pair of beads is. For the bead pairs that have single-cell Hi-C contacts and their neighboring bead pairs, the loss function assigns them stronger binding stability. The Metropolis-Hastings algorithm was used to try different locations for the DNA beads, and simulated annealing was used to optimize the loss function. We proved the correctness and validness of the reconstructed 3D structures by evaluating the models according to multiple criteria and comparing the models with 3D-FISH data.
Collapse
Affiliation(s)
- Mengsheng Zha
- School of Computing Sciences and Computer Engineering, University of Southern Mississippi, 118 College Dr, Hattiesburg, MS 39406, USA; (M.Z.); (C.Z.)
| | - Nan Wang
- Department of Computer Science, New Jersey City University, 2039 Kennedy Blvd, Jersey City, NJ 07305, USA;
| | - Chaoyang Zhang
- School of Computing Sciences and Computer Engineering, University of Southern Mississippi, 118 College Dr, Hattiesburg, MS 39406, USA; (M.Z.); (C.Z.)
| | - Zheng Wang
- Department of Computer Science, University of Miami, 1364 Memorial Drive, Coral Gables, FL 33124, USA
- Correspondence:
| |
Collapse
|
14
|
Mendieta-Esteban J, Di Stefano M, Castillo D, Farabella I, Marti-Renom MA. 3D reconstruction of genomic regions from sparse interaction data. NAR Genom Bioinform 2021; 3:lqab017. [PMID: 33778492 PMCID: PMC7985034 DOI: 10.1093/nargab/lqab017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Revised: 02/08/2021] [Accepted: 03/02/2021] [Indexed: 01/04/2023] Open
Abstract
Chromosome conformation capture (3C) technologies measure the interaction frequency between pairs of chromatin regions within the nucleus in a cell or a population of cells. Some of these 3C technologies retrieve interactions involving non-contiguous sets of loci, resulting in sparse interaction matrices. One of such 3C technologies is Promoter Capture Hi-C (pcHi-C) that is tailored to probe only interactions involving gene promoters. As such, pcHi-C provides sparse interaction matrices that are suitable to characterize short- and long-range enhancer-promoter interactions. Here, we introduce a new method to reconstruct the chromatin structural (3D) organization from sparse 3C-based datasets such as pcHi-C. Our method allows for data normalization, detection of significant interactions and reconstruction of the full 3D organization of the genomic region despite of the data sparseness. Specifically, it builds, with as low as the 2-3% of the data from the matrix, reliable 3D models of similar accuracy of those based on dense interaction matrices. Furthermore, the method is sensitive enough to detect cell-type-specific 3D organizational features such as the formation of different networks of active gene communities.
Collapse
Affiliation(s)
- Julen Mendieta-Esteban
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
| | - Marco Di Stefano
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
| | - David Castillo
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
| | - Irene Farabella
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
| | - Marc A Marti-Renom
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
| |
Collapse
|
15
|
Di Stefano M, Nützmann HW, Marti-Renom M, Jost D. Polymer modelling unveils the roles of heterochromatin and nucleolar organizing regions in shaping 3D genome organization in Arabidopsis thaliana. Nucleic Acids Res 2021; 49:1840-1858. [PMID: 33444439 PMCID: PMC7913674 DOI: 10.1093/nar/gkaa1275] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Revised: 11/16/2020] [Accepted: 01/13/2021] [Indexed: 01/10/2023] Open
Abstract
The 3D genome is characterized by a complex organization made of genomic and epigenomic layers with profound implications on gene regulation and cell function. However, the understanding of the fundamental mechanisms driving the crosstalk between nuclear architecture and (epi)genomic information is still lacking. The plant Arabidopsis thaliana is a powerful model organism to address these questions owing to its compact genome for which we have a rich collection of microscopy, chromosome conformation capture (Hi-C) and ChIP-seq experiments. Using polymer modelling, we investigate the roles of nucleolus formation and epigenomics-driven interactions in shaping the 3D genome of A. thaliana. By validation of several predictions with published data, we demonstrate that self-attracting nucleolar organizing regions and repulsive constitutive heterochromatin are major mechanisms to regulate the organization of chromosomes. Simulations also suggest that interphase chromosomes maintain a partial structural memory of the V-shapes, typical of (sub)metacentric chromosomes in anaphase. Additionally, self-attraction between facultative heterochromatin regions facilitates the formation of Polycomb bodies hosting H3K27me3-enriched gene-clusters. Since nucleolus and heterochromatin are highly-conserved in eukaryotic cells, our findings pave the way for a comprehensive characterization of the generic principles that are likely to shape and regulate the 3D genome in many species.
Collapse
Affiliation(s)
- Marco Di Stefano
- CNAG-CRG, The Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Hans-Wilhelm Nützmann
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath BA2 7AY, UK
| | - Marc A Marti-Renom
- CNAG-CRG, The Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
- CRG, The Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- ICREA, Barcelona, Spain
| | - Daniel Jost
- Université de Lyon, ENS de Lyon, Univ Claude Bernard, CNRS, Laboratoire de Biologie et Modélisation de la Cellule, Lyon, France
| |
Collapse
|
16
|
Franzini S, Di Stefano M, Micheletti C. essHi-C: Essential component analysis of Hi-C matrices. Bioinformatics 2021; 37:2088-2094. [PMID: 33523102 DOI: 10.1093/bioinformatics/btab062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 01/18/2021] [Accepted: 01/28/2021] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Hi-C matrices are cornerstones for qualitative and quantitative studies of genome folding, from its territorial organization to compartments and topological domains. The high dynamic range of genomic distances probed in Hi-C assays reflects in an inherent stochastic background of the interactions matrices, which inevitably convolve the features of interest with largely non-specific ones. RESULTS Here we introduce and discuss essHi-C, a method to isolate the specific, or essential component of Hi-C matrices from the non-specific portion of the spectrum that is compatible with random matrices. Systematic comparisons show that essHi-C improves the clarity of the interaction patterns, enhances the robustness against sequencing depth of topologically associating domains identification, allows the unsupervised clustering of experiments in different cell lines and recovers the cell-cycle phasing of single-cells based on Hi-C data. Thus, essHi-C provides means for isolating significant biological and physical features from Hi-C matrices. AVAILABILITY The essHi-C software package is available at: https://github.com/stefanofranzini/essHIC . SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Stefano Franzini
- SISSA - Scuola internazionale Superiore di Studi Avanzati, Via Bonomea 265, I-34077 Trieste, Italy
| | - Marco Di Stefano
- Structural genomics Group, CNAG-CRG Centre Nacional d'Análisi Genómica - Centre de Regulació Genómica, Barcelona, 08028, Spain
| | - Cristian Micheletti
- SISSA - Scuola internazionale Superiore di Studi Avanzati, Via Bonomea 265, I-34077 Trieste, Italy
| |
Collapse
|
17
|
Dynamics of genome architecture and chromatin function during human B cell differentiation and neoplastic transformation. Nat Commun 2021; 12:651. [PMID: 33510161 PMCID: PMC7844026 DOI: 10.1038/s41467-020-20849-y] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 12/21/2020] [Indexed: 02/06/2023] Open
Abstract
To investigate the three-dimensional (3D) genome architecture across normal B cell differentiation and in neoplastic cells from different subtypes of chronic lymphocytic leukemia and mantle cell lymphoma patients, here we integrate in situ Hi-C and nine additional omics layers. Beyond conventional active (A) and inactive (B) compartments, we uncover a highly-dynamic intermediate compartment enriched in poised and polycomb-repressed chromatin. During B cell development, 28% of the compartments change, mostly involving a widespread chromatin activation from naive to germinal center B cells and a reversal to the naive state upon further maturation into memory B cells. B cell neoplasms are characterized by both entity and subtype-specific alterations in 3D genome organization, including large chromatin blocks spanning key disease-specific genes. This study indicates that 3D genome interactions are extensively modulated during normal B cell differentiation and that the genome of B cell neoplasias acquires a tumor-specific 3D genome architecture.
Collapse
|
18
|
Sali A. From integrative structural biology to cell biology. J Biol Chem 2021; 296:100743. [PMID: 33957123 PMCID: PMC8203844 DOI: 10.1016/j.jbc.2021.100743] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 04/09/2021] [Accepted: 04/30/2021] [Indexed: 12/16/2022] Open
Abstract
Integrative modeling is an increasingly important tool in structural biology, providing structures by combining data from varied experimental methods and prior information. As a result, molecular architectures of large, heterogeneous, and dynamic systems, such as the ∼52-MDa Nuclear Pore Complex, can be mapped with useful accuracy, precision, and completeness. Key challenges in improving integrative modeling include expanding model representations, increasing the variety of input data and prior information, quantifying a match between input information and a model in a Bayesian fashion, inventing more efficient structural sampling, as well as developing better model validation, analysis, and visualization. In addition, two community-level challenges in integrative modeling are being addressed under the auspices of the Worldwide Protein Data Bank (wwPDB). First, the impact of integrative structures is maximized by PDB-Development, a prototype wwPDB repository for archiving, validating, visualizing, and disseminating integrative structures. Second, the scope of structural biology is expanded by linking the wwPDB resource for integrative structures with archives of data that have not been generally used for structure determination but are increasingly important for computing integrative structures, such as data from various types of mass spectrometry, spectroscopy, optical microscopy, proteomics, and genetics. To address the largest of modeling problems, a type of integrative modeling called metamodeling is being developed; metamodeling combines different types of input models as opposed to different types of data to compute an output model. Collectively, these developments will facilitate the structural biology mindset in cell biology and underpin spatiotemporal mapping of the entire cell.
Collapse
Affiliation(s)
- Andrej Sali
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, the Department of Bioengineering and Therapeutic Sciences, the Quantitative Biosciences Institute (QBI), and the Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California, USA.
| |
Collapse
|
19
|
Meluzzi D, Arya G. Computational approaches for inferring 3D conformations of chromatin from chromosome conformation capture data. Methods 2020; 181-182:24-34. [PMID: 31470090 PMCID: PMC7044057 DOI: 10.1016/j.ymeth.2019.08.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 06/24/2019] [Accepted: 08/23/2019] [Indexed: 02/08/2023] Open
Abstract
Chromosome conformation capture (3C) and its variants are powerful experimental techniques for probing intra- and inter-chromosomal interactions within cell nuclei at high resolution and in a high-throughput, quantitative manner. The contact maps derived from such experiments provide an avenue for inferring the 3D spatial organization of the genome. This review provides an overview of the various computational methods developed in the past decade for addressing the very important but challenging problem of deducing the detailed 3D structure or structure population of chromosomal domains, chromosomes, and even entire genomes from 3C contact maps.
Collapse
Affiliation(s)
- Dario Meluzzi
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, United States
| | - Gaurav Arya
- Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC 27708, United States.
| |
Collapse
|
20
|
Abstract
Chromatin Conformation Capture techniques have unveiled several layers of chromosome organization such as the segregation in compartments, the folding in topologically associating domains (TADs), and site-specific looping interactions. The discovery of this genome hierarchical organization emerged from the computational analysis of chromatin capture data. With the increasing availability of such data, automatic pipelines for the robust comparison, grouping, and classification of multiple experiments are needed. Here we present a pipeline based on the TADbit framework that emphasizes reproducibility, automation, quality check, and statistical robustness. This comprehensive modular pipeline covers all the steps from the sequencing products to the visualization of reconstructed 3D models of the chromatin.
Collapse
|
21
|
Di Stefano M, Stadhouders R, Farabella I, Castillo D, Serra F, Graf T, Marti-Renom MA. Transcriptional activation during cell reprogramming correlates with the formation of 3D open chromatin hubs. Nat Commun 2020; 11:2564. [PMID: 32444798 PMCID: PMC7244774 DOI: 10.1038/s41467-020-16396-1] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Accepted: 05/01/2020] [Indexed: 12/22/2022] Open
Abstract
Chromosome structure is a crucial regulatory factor for a wide range of nuclear processes. Chromosome conformation capture (3C)-based experiments combined with computational modelling are pivotal for unveiling 3D chromosome structure. Here, we introduce TADdyn, a tool that integrates time-course 3C data, restraint-based modelling, and molecular dynamics to simulate the structural rearrangements of genomic loci in a completely data-driven way. We apply TADdyn on in situ Hi-C time-course experiments studying the reprogramming of murine B cells to pluripotent cells, and characterize the structural rearrangements that take place upon changes in the transcriptional state of 21 genomic loci of diverse expression dynamics. By measuring various structural and dynamical properties, we find that during gene activation, the transcription starting site contacts with open and active regions in 3D chromatin domains. We propose that these 3D hubs of open and active chromatin may constitute a general feature to trigger and maintain gene transcription.
Collapse
Affiliation(s)
- Marco Di Stefano
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain. .,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.
| | - Ralph Stadhouders
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.,Department of Pulmonary Medicine and Department of Cell Biology, Erasmus MC, Rotterdam, the Netherlands
| | - Irene Farabella
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain.,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain
| | - David Castillo
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain.,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain
| | - François Serra
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain.,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.,Computational Biology Group-Barcelona Supercomputing Center (BSC), 08034, Barcelona, Spain
| | - Thomas Graf
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain.
| | - Marc A Marti-Renom
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028, Barcelona, Spain. .,Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003, Barcelona, Spain. .,Universitat Pompeu Fabra (UPF), 08002, Barcelona, Spain. .,ICREA, Pg. Lluís Companys 23, 08010, Barcelona, Spain.
| |
Collapse
|
22
|
Segal MR, Fletez-Brant K. Assessing stationary distributions derived from chromatin contact maps. BMC Bioinformatics 2020; 21:73. [PMID: 32093610 PMCID: PMC7041182 DOI: 10.1186/s12859-020-3424-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 02/17/2020] [Indexed: 11/20/2022] Open
Abstract
Background The spatial configuration of chromosomes is essential to various cellular processes, notably gene regulation, while architecture related alterations, such as translocations and gene fusions, are often cancer drivers. Thus, eliciting chromatin conformation is important, yet challenging due to compaction, dynamics and scale. However, a variety of recent assays, in particular Hi-C, have generated new details of chromatin structure, spawning a number of novel biological findings. Many findings have resulted from analyses on the level of native contact data as generated by the assays. Alternatively, reconstruction based approaches often proceed by first converting contact frequencies into distances, then generating a three dimensional (3D) chromatin configuration that best recapitulates these distances. Subsequent analyses can enrich contact level analyses via superposition of genomic attributes on the reconstruction. But, such advantages depend on the accuracy of the reconstruction which, absent gold standards, is inherently difficult to assess. Attempts at accuracy evaluation have relied on simulation and/or FISH imaging that typically features a handful of low resolution probes. While newly advanced multiplexed FISH imaging offers possibilities for refined 3D reconstruction accuracy evaluation, availability of such data is limited due to assay complexity and the resolution thereof is appreciably lower than the reconstructions being assessed. Accordingly, there is demand for new methods of reconstruction accuracy appraisal. Results Here we explore the potential of recently proposed stationary distributions, hereafter StatDns, derived from Hi-C contact matrices, to serve as a basis for reconstruction accuracy assessment. Current usage of such StatDns has focussed on the identification of highly interactive regions (HIRs): computationally defined regions of the genome purportedly involved in numerous long-range intra-chromosomal contacts. Consistent identification of HIRs would be informative with respect to inferred 3D architecture since the corresponding regions of the reconstruction would have an elevated number of k nearest neighbors (kNNs). More generally, we anticipate a monotone decreasing relationship between StatDn values and kNN distances. After initially evaluating the reproducibility of StatDns across replicate Hi-C data sets, we use this implied StatDn - kNN relationship to gauge the utility of StatDns for reconstruction validation, making recourse to both real and simulated examples. Conclusions Our analyses demonstrate that, as constructed, StatDns do not provide a suitable measure for assessing the accuracy of 3D genome reconstructions. Whether this is attributable to specific choices surrounding normalization in defining StatDns or to the logic underlying their very formulation remains to be determined.
Collapse
Affiliation(s)
- Mark R Segal
- Division of Bioinformatics, Department of Epidemiology and Biostatistics, UCSF, 550 16th Street, San Francisco, 94158, CA, USA.
| | - Kipper Fletez-Brant
- Computational Biology, 23andMe, Inc., 899 West Evelyn Avenue, Mountain View, 94041, CA, USA
| |
Collapse
|
23
|
Berman HM, Adams PD, Bonvin AA, Burley SK, Carragher B, Chiu W, DiMaio F, Ferrin TE, Gabanyi MJ, Goddard TD, Griffin PR, Haas J, Hanke CA, Hoch JC, Hummer G, Kurisu G, Lawson CL, Leitner A, Markley JL, Meiler J, Montelione GT, Phillips GN, Prisner T, Rappsilber J, Schriemer DC, Schwede T, Seidel CAM, Strutzenberg TS, Svergun DI, Tajkhorshid E, Trewhella J, Vallat B, Velankar S, Vuister GW, Webb B, Westbrook JD, White KL, Sali A. Federating Structural Models and Data: Outcomes from A Workshop on Archiving Integrative Structures. Structure 2019; 27:1745-1759. [PMID: 31780431 DOI: 10.1016/j.str.2019.11.002] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 10/31/2019] [Accepted: 11/06/2019] [Indexed: 12/23/2022]
Abstract
Structures of biomolecular systems are increasingly computed by integrative modeling. In this approach, a structural model is constructed by combining information from multiple sources, including varied experimental methods and prior models. In 2019, a Workshop was held as a Biophysical Society Satellite Meeting to assess progress and discuss further requirements for archiving integrative structures. The primary goal of the Workshop was to build consensus for addressing the challenges involved in creating common data standards, building methods for federated data exchange, and developing mechanisms for validating integrative structures. The summary of the Workshop and the recommendations that emerged are presented here.
Collapse
Affiliation(s)
- Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA; Bridge Institute, Michelson Center, University of Southern California, Los Angeles, CA 90089, USA.
| | - Paul D Adams
- Physical Biosciences Division, Lawrence Berkeley Laboratory, Berkeley, CA 94720-8235, USA; Department of Bioengineering, University of California-Berkeley, Berkeley, CA 94720, USA
| | - Alexandre A Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH Utrecht, the Netherlands
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Skaggs School of Pharmacy and Pharmaceutical Sciences and San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA; Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08903, USA
| | - Bridget Carragher
- Simons Electron Microscopy Center, New York Structural Biology Center, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Wah Chiu
- Department of Bioengineering, Department of Microbiology and Immunology, Stanford University, Stanford, CA 94305-5447, USA; SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | - Frank DiMaio
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Thomas E Ferrin
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA
| | - Margaret J Gabanyi
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Thomas D Goddard
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA
| | | | - Juergen Haas
- Swiss Institute of Bioinformatics and Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Christian A Hanke
- Molecular Physical Chemistry, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Jeffrey C Hoch
- Department of Molecular Biology and Biophysics, UConn Health, Farmington, CT 06030, USA
| | - Gerhard Hummer
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, 60438 Frankfurt am Main, Germany; Institute for Biophysics, Goethe University Frankfurt, 60438 Frankfurt am Main, Germany
| | - Genji Kurisu
- Protein Data Bank Japan (PDBj), Institute for Protein Research, Osaka University, Osaka 565-0871, Japan
| | - Catherine L Lawson
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Alexander Leitner
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093 Zurich, Switzerland
| | - John L Markley
- BioMagResBank (BMRB), Biochemistry Department, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, 465 21st Avenue South, Nashville, TN 37221, USA
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytech Institute, Troy, NY 12180, USA
| | - George N Phillips
- BioSciences at Rice and Department of Chemistry, Rice University, Houston, TX 77251, USA
| | - Thomas Prisner
- Institute of Physical and Theoretical Chemistry and Center of Biomolecular Magnetic Resonance, Goethe University Frankfurt, 60438 Frankfurt am Main, Germany
| | - Juri Rappsilber
- Wellcome Trust Centre for Cell Biology, Edinburgh EH9 3JR, Scotland
| | - David C Schriemer
- Department of Biochemistry & Molecular Biology, Robson DNA Science Centre, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Torsten Schwede
- Swiss Institute of Bioinformatics and Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Claus A M Seidel
- Molecular Physical Chemistry, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | | | - Dmitri I Svergun
- European Molecular Biology Laboratory (EMBL), Hamburg Outstation, Notkestrasse 85, 22607 Hamburg, Germany
| | - Emad Tajkhorshid
- Department of Biochemistry, NIH Center for Macromolecular Modeling and Bioinformatics, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Jill Trewhella
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006, Australia; Department of Chemistry, University of Utah, Salt Lake City, UT 84112, USA
| | - Brinda Vallat
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridgeshire CB10 1SD, UK
| | - Geerten W Vuister
- Department of Molecular and Cell Biology, Leicester Institute of Structural and Chemical Biology, University of Leicester, Leicester LE1 9HN, UK
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Kate L White
- Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA; Bridge Institute, Michelson Center, University of Southern California, Los Angeles, CA 90089, USA
| | - Andrej Sali
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA; Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; California Institute for Quantitative Biosciences, University of California, San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
24
|
Yildirim A, Feig M. High-resolution 3D models of Caulobacter crescentus chromosome reveal genome structural variability and organization. Nucleic Acids Res 2019. [PMID: 29529244 PMCID: PMC5934669 DOI: 10.1093/nar/gky141] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
High-resolution three-dimensional models of Caulobacter crescentus nucleoid structures were generated via a multi-scale modeling protocol. Models were built as a plectonemically supercoiled circular DNA and by incorporating chromosome conformation capture based data to generate an ensemble of base pair resolution models consistent with the experimental data. Significant structural variability was found with different degrees of bending and twisting but with overall similar topologies and shapes that are consistent with C. crescentus cell dimensions. The models allowed a direct mapping of the genomic sequence onto the three-dimensional nucleoid structures. Distinct spatial distributions were found for several genomic elements such as AT-rich sequence elements where nucleoid associated proteins (NAPs) are likely to bind, promoter sites, and some genes with common cellular functions. These findings shed light on the correlation between the spatial organization of the genome and biological functions.
Collapse
Affiliation(s)
- Asli Yildirim
- Department of Chemistry, Michigan State University, East Lansing, MI 48824, USA
| | - Michael Feig
- Department of Biochemistry & Molecular Biology, Michigan State University, MI 48824, USA
| |
Collapse
|
25
|
Zhu G, Deng W, Hu H, Ma R, Zhang S, Yang J, Peng J, Kaplan T, Zeng J. Reconstructing spatial organizations of chromosomes through manifold learning. Nucleic Acids Res 2019; 46:e50. [PMID: 29408992 PMCID: PMC5934626 DOI: 10.1093/nar/gky065] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2017] [Accepted: 01/23/2018] [Indexed: 01/09/2023] Open
Abstract
Decoding the spatial organizations of chromosomes has crucial implications for studying eukaryotic gene regulation. Recently, chromosomal conformation capture based technologies, such as Hi-C, have been widely used to uncover the interaction frequencies of genomic loci in a high-throughput and genome-wide manner and provide new insights into the folding of three-dimensional (3D) genome structure. In this paper, we develop a novel manifold learning based framework, called GEM (Genomic organization reconstructor based on conformational Energy and Manifold learning), to reconstruct the three-dimensional organizations of chromosomes by integrating Hi-C data with biophysical feasibility. Unlike previous methods, which explicitly assume specific relationships between Hi-C interaction frequencies and spatial distances, our model directly embeds the neighboring affinities from Hi-C space into 3D Euclidean space. Extensive validations demonstrated that GEM not only greatly outperformed other state-of-art modeling methods but also provided a physically and physiologically valid 3D representations of the organizations of chromosomes. Furthermore, we for the first time apply the modeled chromatin structures to recover long-range genomic interactions missing from original Hi-C data.
Collapse
Affiliation(s)
- Guangxiang Zhu
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| | - Wenxuan Deng
- Department of Biostatistics, Yale University, New Haven, CT, USA
| | - Hailin Hu
- School of Medicine, Tsinghua University, Beijing 100084, China
| | - Rui Ma
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| | - Sai Zhang
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| | - Jinglin Yang
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| | - Jian Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Tommy Kaplan
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, 91904, Israel
| | - Jianyang Zeng
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| |
Collapse
|
26
|
RNA proximity sequencing reveals the spatial organization of the transcriptome in the nucleus. Nat Biotechnol 2019; 37:793-802. [PMID: 31267103 DOI: 10.1038/s41587-019-0166-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Accepted: 05/22/2019] [Indexed: 02/01/2023]
Abstract
The global, three-dimensional organization of RNA molecules in the nucleus is difficult to determine using existing methods. Here we introduce Proximity RNA-seq, which identifies colocalization preferences for pairs or groups of nascent and fully transcribed RNAs in the nucleus. Proximity RNA-seq is based on massive-throughput RNA barcoding of subnuclear particles in water-in-oil emulsion droplets, followed by cDNA sequencing. Our results show RNAs of varying tissue-specificity of expression, speed of RNA polymerase elongation and extent of alternative splicing positioned at varying distances from nucleoli. The simultaneous detection of multiple RNAs in proximity to each other distinguishes RNA-dense from sparse compartments. Application of Proximity RNA-seq will facilitate study of the spatial organization of transcripts in the nucleus, including non-coding RNAs, and its functional relevance.
Collapse
|
27
|
Vallat B, Webb B, Westbrook J, Sali A, Berman HM. Archiving and disseminating integrative structure models. JOURNAL OF BIOMOLECULAR NMR 2019; 73:385-398. [PMID: 31278630 PMCID: PMC6692293 DOI: 10.1007/s10858-019-00264-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 06/25/2019] [Indexed: 05/04/2023]
Abstract
Limitations in the applicability, accuracy, and precision of individual structure characterization methods can sometimes be overcome via an integrative modeling approach that relies on information from all available sources, including all available experimental data and prior models. The open-source Integrative Modeling Platform (IMP) is one piece of software that implements all computational aspects of integrative modeling. To maximize the impact of integrative structures, the coordinates should be made publicly available, as is already the case for structures based on X-ray crystallography, NMR spectroscopy, and electron microscopy. Moreover, the associated experimental data and modeling protocols should also be archived, such that the original results can easily be reproduced. Finally, it is essential that the integrative structures are validated as part of their publication and deposition. A number of research groups have already developed software to implement integrative modeling and have generated a number of structures, prompting the formation of an Integrative/Hybrid Methods Task Force. Following the recommendations of this task force, the existing PDBx/mmCIF data representation used for atomic PDB structures has been extended to address the requirements for archiving integrative structural models. This IHM-dictionary adds a flexible model representation, including coarse graining, models in multiple states and/or related by time or other order, and multiple input experimental information sources. A prototype archiving system called PDB-Dev ( https://pdb-dev.wwpdb.org ) has also been created to archive integrative structural models, together with a Python library to facilitate handling of integrative models in PDBx/mmCIF format.
Collapse
Affiliation(s)
- Brinda Vallat
- Institute for Quantitative Biomedicine, Piscataway, USA
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, CA, 94143, USA
| | - John Westbrook
- Institute for Quantitative Biomedicine, Piscataway, USA
- RCSB Protein Data Bank, Piscataway, USA
| | - Andrej Sali
- RCSB Protein Data Bank, Piscataway, USA.
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, CA, 94143, USA.
- Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA, 94143, USA.
- Lead Contacts, San Francisco, USA.
| | - Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
- Lead Contacts, Piscataway, USA.
| |
Collapse
|
28
|
Oluwadare O, Highsmith M, Cheng J. An Overview of Methods for Reconstructing 3-D Chromosome and Genome Structures from Hi-C Data. Biol Proced Online 2019; 21:7. [PMID: 31049033 PMCID: PMC6482566 DOI: 10.1186/s12575-019-0094-0] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 04/01/2019] [Indexed: 01/08/2023] Open
Abstract
Over the past decade, methods for predicting three-dimensional (3-D) chromosome and genome structures have proliferated. This has been primarily due to the development of high-throughput, next-generation chromosome conformation capture (3C) technologies, which have provided next-generation sequencing data about chromosome conformations in order to map the 3-D genome structure. The introduction of the Hi-C technique-a variant of the 3C method-has allowed researchers to extract the interaction frequency (IF) for all loci of a genome at high-throughput and at a genome-wide scale. In this review we describe, categorize, and compare the various methods developed to map chromosome and genome structures from 3C data-particularly Hi-C data. We summarize the improvements introduced by these methods, describe the approach used for method evaluation, and discuss how these advancements shape the future of genome structure construction.
Collapse
Affiliation(s)
- Oluwatosin Oluwadare
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Max Highsmith
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
- Informatics Institute, University of Missouri, Columbia, MO 65211 USA
| |
Collapse
|
29
|
Caudai C, Salerno E, Zoppe M, Tonazzini A. Estimation of the Spatial Chromatin Structure Based on a Multiresolution Bead-Chain Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:550-559. [PMID: 29994172 DOI: 10.1109/tcbb.2018.2791439] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
We present a method to infer 3D chromatin configurations from Chromosome Conformation Capture data. Quite a few methods have been proposed to estimate the structure of the nuclear dna in homogeneous populations of cells from this kind of data. Many of them transform contact frequencies into euclidean distances between pairs of chromatin fragments, and then reconstruct the structure by solving a distance-to-geometry problem. To avoid inconsistencies, our method is based on a score function that does not require any frequency-to-distance translation. We propose a multiscale chromatin model where the chromatin fiber is suitably partitioned at each scale. The partial structures are estimated independently, and connected to rebuild the whole fiber. Our score function consists of a data-fit part and a penalty part, balanced automatically at each scale and each subchain. The penalty part enforces soft geometric constraints. As many different structures can fit the data, our sampling strategy produces a set of solutions with similar scores. The procedure contains a few parameters, independent of both the scale and the genomic segment treated. The partition of the fiber, along with intrinsically parallel parts, make this method computationally efficient. Results from human genome data support the biological plausibility of our solutions.
Collapse
|
30
|
Nir G, Farabella I, Pérez Estrada C, Ebeling CG, Beliveau BJ, Sasaki HM, Lee SD, Nguyen SC, McCole RB, Chattoraj S, Erceg J, AlHaj Abed J, Martins NMC, Nguyen HQ, Hannan MA, Russell S, Durand NC, Rao SSP, Kishi JY, Soler-Vila P, Di Pierro M, Onuchic JN, Callahan SP, Schreiner JM, Stuckey JA, Yin P, Aiden EL, Marti-Renom MA, Wu CT. Walking along chromosomes with super-resolution imaging, contact maps, and integrative modeling. PLoS Genet 2018; 14:e1007872. [PMID: 30586358 PMCID: PMC6324821 DOI: 10.1371/journal.pgen.1007872] [Citation(s) in RCA: 175] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 01/08/2019] [Accepted: 12/04/2018] [Indexed: 12/13/2022] Open
Abstract
Chromosome organization is crucial for genome function. Here, we present a method for visualizing chromosomal DNA at super-resolution and then integrating Hi-C data to produce three-dimensional models of chromosome organization. Using the super-resolution microscopy methods of OligoSTORM and OligoDNA-PAINT, we trace 8 megabases of human chromosome 19, visualizing structures ranging in size from a few kilobases to over a megabase. Focusing on chromosomal regions that contribute to compartments, we discover distinct structures that, in spite of considerable variability, can predict whether such regions correspond to active (A-type) or inactive (B-type) compartments. Imaging through the depths of entire nuclei, we capture pairs of homologous regions in diploid cells, obtaining evidence that maternal and paternal homologous regions can be differentially organized. Finally, using restraint-based modeling to integrate imaging and Hi-C data, we implement a method-integrative modeling of genomic regions (IMGR)-to increase the genomic resolution of our traces to 10 kb.
Collapse
MESH Headings
- Cells, Cultured
- Chromosome Painting/methods
- Chromosome Structures/chemistry
- Chromosome Structures/genetics
- Chromosome Structures/ultrastructure
- Chromosome Walking/methods
- Chromosomes, Human, Pair 19/chemistry
- Chromosomes, Human, Pair 19/genetics
- Chromosomes, Human, Pair 19/ultrastructure
- Female
- Fluorescent Dyes
- Humans
- Imaging, Three-Dimensional
- In Situ Hybridization, Fluorescence/methods
- Male
- Models, Genetic
- Oligonucleotide Probes
- Pedigree
Collapse
Affiliation(s)
- Guy Nir
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Irene Farabella
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Cynthia Pérez Estrada
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
| | - Carl G. Ebeling
- Bruker Nano Inc., Salt Lake City, Utah, United States of America
| | - Brian J. Beliveau
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, Massachusetts, United States of America
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Hiroshi M. Sasaki
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, Massachusetts, United States of America
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - S. Dean Lee
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Son C. Nguyen
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Ruth B. McCole
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Shyamtanu Chattoraj
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jelena Erceg
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jumana AlHaj Abed
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Nuno M. C. Martins
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Huy Q. Nguyen
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Mohammed A. Hannan
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Sheikh Russell
- Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Neva C. Durand
- Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Broad Institute of the Massachusetts Institute of Technology and Harvard University, Cambridge, Massachusetts, United States of America
| | - Suhas S. P. Rao
- Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
- Department of Structural Biology, Stanford University School of Medicine, Stanford, California, United States of America
| | - Jocelyn Y. Kishi
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, Massachusetts, United States of America
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Paula Soler-Vila
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Michele Di Pierro
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
| | - José N. Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
| | | | | | - Jeff A. Stuckey
- Bruker Nano Inc., Middleton, Wisconsin, United States of America
| | - Peng Yin
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, Massachusetts, United States of America
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Erez Lieberman Aiden
- Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, United States of America
- Broad Institute of the Massachusetts Institute of Technology and Harvard University, Cambridge, Massachusetts, United States of America
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, Texas, United States of America
| | - Marc A. Marti-Renom
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- ICREA, Barcelona, Spain
| | - C.-ting Wu
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, Massachusetts, United States of America
| |
Collapse
|
31
|
Goodstadt MN, Marti-Renom MA. Communicating Genome Architecture: Biovisualization of the Genome, from Data Analysis and Hypothesis Generation to Communication and Learning. J Mol Biol 2018; 431:1071-1087. [PMID: 30419242 DOI: 10.1016/j.jmb.2018.11.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 10/29/2018] [Accepted: 11/01/2018] [Indexed: 01/07/2023]
Abstract
Genome discoveries at the core of biology are made by visual description and exploration of the cell, from microscopic sketches and biochemical mapping to computational analysis and spatial modeling. We outline the experimental and visualization techniques that have been developed recently which capture the three-dimensional interactions regulating how genes are expressed. We detail the challenges faced in integration of the data to portray the components and organization and their dynamic landscape. The goal is more than a single data-driven representation as interactive visualization for de novo research is paramount to decipher insights on genome organization in space.
Collapse
Affiliation(s)
- Mike N Goodstadt
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona 08028, Spain; Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain.
| | - Marc A Marti-Renom
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona 08028, Spain; Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23, Barcelona 08010, Spain.
| |
Collapse
|
32
|
Challenges and guidelines toward 4D nucleome data and model standards. Nat Genet 2018; 50:1352-1358. [PMID: 30262815 DOI: 10.1038/s41588-018-0236-3] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 07/19/2018] [Indexed: 11/09/2022]
Abstract
Due to recent advances in experimental and theoretical approaches, the dynamic three-dimensional organization (3D) of the nucleus has become a very active area of research in life sciences. We now understand that the linear genome is folded in ways that may modulate how genes are expressed during the basic functioning of cells. Importantly, it is now possible to build 3D models of how the genome folds within the nucleus and changes over time (4D). Because genome folding influences its function, this opens exciting new possibilities to broaden our understanding of the mechanisms that determine cell fate. However, the rapid evolution of methods and the increasing complexity of data can result in ambiguity and reproducibility challenges, which may hamper the progress of this field. Here, we describe such challenges ahead and provide guidelines to think about strategies for shared standardized validation of experimental 4D nucleome data sets and models.
Collapse
|
33
|
Diament A, Tuller T. Modeling three-dimensional genomic organization in evolution and pathogenesis. Semin Cell Dev Biol 2018; 90:78-93. [PMID: 30030143 DOI: 10.1016/j.semcdb.2018.07.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 07/08/2018] [Indexed: 12/17/2022]
Abstract
The regulation of gene expression is mediated via the complex three-dimensional (3D) conformation of the genetic material and its interactions with various intracellular factors. Various experimental and computational approaches have been developed in recent years for understating the relation between the 3D conformation of the genome and the phenotypes of cells in normal condition and diseases. In this review, we will discuss novel approaches for analyzing and modeling the 3D genomic conformation, focusing on deciphering disease-causing mutations that affect gene expression. We conclude that as this is a very challenging mission, an important direction should involve the comparative analysis of various 3D models from various organisms or cells.
Collapse
Affiliation(s)
- Alon Diament
- Dept. of Biomedical Engineering, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Tamir Tuller
- Dept. of Biomedical Engineering, Tel Aviv University, Tel Aviv 6997801, Israel; The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv 6997801, Israel.
| |
Collapse
|
34
|
Vallat B, Webb B, Westbrook JD, Sali A, Berman HM. Development of a Prototype System for Archiving Integrative/Hybrid Structure Models of Biological Macromolecules. Structure 2018; 26:894-904.e2. [PMID: 29657133 DOI: 10.1016/j.str.2018.03.011] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 02/16/2018] [Accepted: 03/20/2018] [Indexed: 10/17/2022]
Abstract
Essential processes in biology are carried out by large macromolecular assemblies, whose structures are often difficult to determine by traditional methods. Increasingly, researchers combine measured data and computed information from several complementary methods to obtain "hybrid" or "integrative" structural models of macromolecules and their assemblies. These integrative/hybrid (I/H) models are not archived in the PDB because of the absence of standard data representations and processing mechanisms. Here we present the development of data standards and a prototype system for archiving I/H models. The data standards provide the definitions required for representing I/H models that span multiple spatiotemporal scales and conformational states, as well as spatial restraints derived from different experimental techniques. Based on these data definitions, we have built a prototype system called PDB-Dev, which provides the infrastructure necessary to archive I/H structural models. PDB-Dev is now accepting structures and is open to the community for new submissions.
Collapse
Affiliation(s)
- Brinda Vallat
- Research Collaboratory for Structural Bioinformatics, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA 94143, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA 94143, USA
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| |
Collapse
|
35
|
Abstract
The use of 3C-based methods has revealed the importance of the 3D organization of the chromatin for key aspects of genome biology. However, the different caveats of the variants of 3C techniques have limited their scope and the range of scientific fields that could benefit from these approaches. To address these limitations, we present 4Cin, a method to generate 3D models and derive virtual Hi-C (vHi-C) heat maps of genomic loci based on 4C-seq or any kind of 4C-seq-like data, such as those derived from NG Capture-C. 3D genome organization is determined by integrative consideration of the spatial distances derived from as few as four 4C-seq experiments. The 3D models obtained from 4C-seq data, together with their associated vHi-C maps, allow the inference of all chromosomal contacts within a given genomic region, facilitating the identification of Topological Associating Domains (TAD) boundaries. Thus, 4Cin offers a much cheaper, accessible and versatile alternative to other available techniques while providing a comprehensive 3D topological profiling. By studying TAD modifications in genomic structural variants associated to disease phenotypes and performing cross-species evolutionary comparisons of 3D chromatin structures in a quantitative manner, we demonstrate the broad potential and novel range of applications of our method. Chromatin conformation capture (3C) methods have revealed the importance of the 3D organization of the chromatin, which is key to understand many aspects of genome biology. But each of these methods have their own limitations. Here we present 4Cin, a software that generates 3D models of the chromatin from a small number of 4C-seq experiments, a 3C-based method that provides the frequency of contacts between one fragments and the genome (one vs all). These 3D models are used to infer all chromosomal contacts within a given genomic region (many vs many). The contact maps facilitate the identification of Topological Associating Domains boundaries. Our software offers a much cheaper, accessible and versatile alternative to other available techniques while providing a comprehensive 3D topological profiling. We applied our software to two different loci to study modifications in genomic structural variants associated to disease phenotypes and to compare the chromatin organization in two different species in a quantitative manner.
Collapse
|
36
|
Oluwadare O, Zhang Y, Cheng J. A maximum likelihood algorithm for reconstructing 3D structures of human chromosomes from chromosomal contact data. BMC Genomics 2018; 19:161. [PMID: 29471801 PMCID: PMC5824572 DOI: 10.1186/s12864-018-4546-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 02/13/2018] [Indexed: 01/07/2023] Open
Abstract
Background The development of chromosomal conformation capture techniques, particularly, the Hi-C technique, has made the analysis and study of the spatial conformation of a genome an important topic in bioinformatics and computational biology. Aided by high-throughput next generation sequencing techniques, the Hi-C technique can generate genome-wide, large-scale intra- and inter-chromosomal interaction data capable of describing in details the spatial interactions within a genome. These data can be used to reconstruct 3D structures of chromosomes that can be used to study DNA replication, gene regulation, genome interaction, genome folding, and genome function. Results Here, we introduce a maximum likelihood algorithm called 3DMax to construct the 3D structure of a chromosome from Hi-C data. 3DMax employs a maximum likelihood approach to infer the 3D structures of a chromosome, while automatically re-estimating the conversion factor (α) for converting Interaction Frequency (IF) to distance. Our results show that the models generated by 3DMax from a simulated Hi-C dataset match the true models better than most of the existing methods. 3DMax is more robust to structural variability and noise. Compared on a real Hi-C dataset, 3DMax constructs chromosomal models that fit the data better than most methods, and it is faster than all other methods. The models reconstructed by 3DMax were consistent with fluorescent in situ hybridization (FISH) experiments and existing knowledge about the organization of human chromosomes, such as chromosome compartmentalization. Conclusions 3DMax is an effective approach to reconstructing 3D chromosomal models. The results, and the models generated for the simulated and real Hi-C datasets are available here: http://sysbio.rnet.missouri.edu/bdm_download/3DMax/. The source code is available here: https://github.com/BDM-Lab/3DMax. A short video demonstrating how to use 3DMax can be found here: https://youtu.be/ehQUFWoHwfo.
Collapse
Affiliation(s)
- Oluwatosin Oluwadare
- Electrical Engineering & Computer Science Department, University of Missouri, Columbia, MO, 65211, USA
| | - Yuxiang Zhang
- Electrical Engineering & Computer Science Department, University of Missouri, Columbia, MO, 65211, USA
| | - Jianlin Cheng
- Electrical Engineering & Computer Science Department, University of Missouri, Columbia, MO, 65211, USA. .,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA.
| |
Collapse
|
37
|
Abstract
In this review, we describe how the interplay among science, technology and community interests contributed to the evolution of four structural biology data resources. We present the method by which data deposited by scientists are prepared for worldwide distribution, and argue that data archiving in a trusted repository must be an integral part of any scientific investigation.
Collapse
Affiliation(s)
- Helen M. Berman
- Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, 174 Frelinghuysen Road, Piscataway New Jersey 08854
| | - Catherine L. Lawson
- Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, 174 Frelinghuysen Road, Piscataway New Jersey 08854
| | - Brinda Vallat
- Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, 174 Frelinghuysen Road, Piscataway New Jersey 08854
| | - Margaret J. Gabanyi
- Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, 174 Frelinghuysen Road, Piscataway New Jersey 08854
| |
Collapse
|
38
|
Goodstadt M, Marti‐Renom MA. Challenges for visualizing three-dimensional data in genomic browsers. FEBS Lett 2017; 591:2505-2519. [PMID: 28771695 PMCID: PMC5638070 DOI: 10.1002/1873-3468.12778] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Revised: 07/30/2017] [Accepted: 07/31/2017] [Indexed: 12/14/2022]
Abstract
Genomic interactions reveal the spatial organization of genomes and genomic domains, which is known to play key roles in cell function. Physical proximity can be represented as two-dimensional heat maps or matrices. From these, three-dimensional (3D) conformations of chromatin can be computed revealing coherent structures that highlight the importance of nonsequential relationships across genomic features. Mainstream genomic browsers have been classically developed to display compact, stacked tracks based on a linear, sequential, per-chromosome coordinate system. Genome-wide comparative analysis demands new approaches to data access and new layouts for analysis. The legibility can be compromised when displaying track-aligned second dimension matrices, which require greater screen space. Moreover, 3D representations of genomes defy vertical alignment in track-based genome browsers. Furthermore, investigation at previously unattainable levels of detail is revealing multiscale, multistate, time-dependent complexity. This article outlines how these challenges are currently handled in mainstream browsers as well as how novel techniques in visualization are being explored to address them. A set of requirements for coherent visualization of novel spatial genomic data is defined and the resulting potential for whole genome visualization is described.
Collapse
Affiliation(s)
- Mike Goodstadt
- Structural Genomics GroupCNAG‐CRGThe Barcelona Institute of Science and Technology (BIST)Spain
- Gene Regulation, Stem Cells and Cancer ProgramCentre for Genomic Regulation (CRG)The Barcelona Institute of Science and Technology (BIST)Spain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
| | - Marc A. Marti‐Renom
- Structural Genomics GroupCNAG‐CRGThe Barcelona Institute of Science and Technology (BIST)Spain
- Gene Regulation, Stem Cells and Cancer ProgramCentre for Genomic Regulation (CRG)The Barcelona Institute of Science and Technology (BIST)Spain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- Institució Catalana de Recerca i Estudis Avançats (ICREA)BarcelonaSpain
| |
Collapse
|
39
|
Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors. PLoS Comput Biol 2017; 13:e1005665. [PMID: 28723903 PMCID: PMC5540598 DOI: 10.1371/journal.pcbi.1005665] [Citation(s) in RCA: 178] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Revised: 08/02/2017] [Accepted: 07/03/2017] [Indexed: 11/19/2022] Open
Abstract
The sequence of a genome is insufficient to understand all genomic processes carried out in the cell nucleus. To achieve this, the knowledge of its three-dimensional architecture is necessary. Advances in genomic technologies and the development of new analytical methods, such as Chromosome Conformation Capture (3C) and its derivatives, provide unprecedented insights in the spatial organization of genomes. Here we present TADbit, a computational framework to analyze and model the chromatin fiber in three dimensions. Our package takes as input the sequencing reads of 3C-based experiments and performs the following main tasks: (i) pre-process the reads, (ii) map the reads to a reference genome, (iii) filter and normalize the interaction data, (iv) analyze the resulting interaction matrices, (v) build 3D models of selected genomic domains, and (vi) analyze the resulting models to characterize their structural properties. To illustrate the use of TADbit, we automatically modeled 50 genomic domains from the fly genome revealing differential structural features of the previously defined chromatin colors, establishing a link between the conformation of the genome and the local chromatin composition. TADbit provides three-dimensional models built from 3C-based experiments, which are ready for visualization and for characterizing their relation to gene expression and epigenetic states. TADbit is an open-source Python library available for download from https://github.com/3DGenomes/tadbit.
Collapse
|
40
|
Le Dily F, Serra F, Marti-Renom MA. 3D modeling of chromatin structure: is there a way to integrate and reconcile single cell and population experimental data? WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2017. [DOI: 10.1002/wcms.1308] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- François Le Dily
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology; Dr. Aiguader 88; Barcelona Spain
- Universitat Pompeu Fabra (UPF); Barcelona Spain
| | - François Serra
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology; Dr. Aiguader 88; Barcelona Spain
- Universitat Pompeu Fabra (UPF); Barcelona Spain
- Structural Genomic Group, CNAG-CRG, Centre for Genomic Regulation (CRG); The Barcelona Institute of Science and Technology, Baldiri Reixac 4; Barcelona Spain
| | - Marc A. Marti-Renom
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology; Dr. Aiguader 88; Barcelona Spain
- Universitat Pompeu Fabra (UPF); Barcelona Spain
- Structural Genomic Group, CNAG-CRG, Centre for Genomic Regulation (CRG); The Barcelona Institute of Science and Technology, Baldiri Reixac 4; Barcelona Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23; Barcelona Spain
| |
Collapse
|
41
|
Trussart M, Yus E, Martinez S, Baù D, Tahara YO, Pengo T, Widjaja M, Kretschmer S, Swoger J, Djordjevic S, Turnbull L, Whitchurch C, Miyata M, Marti-Renom MA, Lluch-Senar M, Serrano L. Defined chromosome structure in the genome-reduced bacterium Mycoplasma pneumoniae. Nat Commun 2017; 8:14665. [PMID: 28272414 PMCID: PMC5344976 DOI: 10.1038/ncomms14665] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 01/20/2017] [Indexed: 12/24/2022] Open
Abstract
DNA-binding proteins are central regulators of chromosome organization; however, in genome-reduced bacteria their diversity is largely diminished. Whether the chromosomes of such bacteria adopt defined three-dimensional structures remains unexplored. Here we combine Hi-C and super-resolution microscopy to determine the structure of the Mycoplasma pneumoniae chromosome at a 10 kb resolution. We find a defined structure, with a global symmetry between two arms that connect opposite poles, one bearing the chromosomal Ori and the other the midpoint. Analysis of local structures at a 3 kb resolution indicates that the chromosome is organized into domains ranging from 15 to 33 kb. We provide evidence that genes within the same domain tend to be co-regulated, suggesting that chromosome organization influences transcriptional regulation, and that supercoiling regulates local organization. This study extends the current understanding of bacterial genome organization and demonstrates that a defined chromosomal structure is a universal feature of living systems.
Collapse
Affiliation(s)
- Marie Trussart
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Eva Yus
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Sira Martinez
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
| | - Davide Baù
- Gene Regulation, Stem Cells and Cancer Program. Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona 08028, Spain
| | - Yuhei O Tahara
- Department of Biology, Graduate School of Science, Osaka City University, 558-8585 Osaka, Japan.,OCU Advanced Research Institute for Natural Science and Technology (OCARNA), Osaka City University, 558-8585 Osaka, Japan
| | - Thomas Pengo
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,Advanced Light Microscopy Unit, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Michael Widjaja
- The ithree Institute, The University of Technology Sydney, Sydney, New South Wales 2007, Australia
| | - Simon Kretschmer
- Department of Cellular and Molecular Biophysics, Max Planck Institute of Biochemistry, 82152, Martinsried, Germany
| | - Jim Swoger
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Steven Djordjevic
- The ithree Institute, The University of Technology Sydney, Sydney, New South Wales 2007, Australia
| | - Lynne Turnbull
- The ithree Institute, The University of Technology Sydney, Sydney, New South Wales 2007, Australia
| | - Cynthia Whitchurch
- The ithree Institute, The University of Technology Sydney, Sydney, New South Wales 2007, Australia
| | - Makoto Miyata
- Department of Biology, Graduate School of Science, Osaka City University, 558-8585 Osaka, Japan.,OCU Advanced Research Institute for Natural Science and Technology (OCARNA), Osaka City University, 558-8585 Osaka, Japan
| | - Marc A Marti-Renom
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain.,Gene Regulation, Stem Cells and Cancer Program. Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona 08028, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain
| | - Maria Lluch-Senar
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Luís Serrano
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain.,Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain
| |
Collapse
|
42
|
Szałaj P, Tang Z, Michalski P, Pietal MJ, Luo OJ, Sadowski M, Li X, Radew K, Ruan Y, Plewczynski D. An integrated 3-Dimensional Genome Modeling Engine for data-driven simulation of spatial genome organization. Genome Res 2016; 26:1697-1709. [PMID: 27789526 PMCID: PMC5131821 DOI: 10.1101/gr.205062.116] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 10/20/2016] [Indexed: 02/03/2023]
Abstract
ChIA-PET is a high-throughput mapping technology that reveals long-range chromatin interactions and provides insights into the basic principles of spatial genome organization and gene regulation mediated by specific protein factors. Recently, we showed that a single ChIA-PET experiment provides information at all genomic scales of interest, from the high-resolution locations of binding sites and enriched chromatin interactions mediated by specific protein factors, to the low resolution of nonenriched interactions that reflect topological neighborhoods of higher-order chromosome folding. This multilevel nature of ChIA-PET data offers an opportunity to use multiscale 3D models to study structural-functional relationships at multiple length scales, but doing so requires a structural modeling platform. Here, we report the development of 3D-GNOME (3-Dimensional Genome Modeling Engine), a complete computational pipeline for 3D simulation using ChIA-PET data. 3D-GNOME consists of three integrated components: a graph-distance-based heat map normalization tool, a 3D modeling platform, and an interactive 3D visualization tool. Using ChIA-PET and Hi-C data derived from human B-lymphocytes, we demonstrate the effectiveness of 3D-GNOME in building 3D genome models at multiple levels, including the entire genome, individual chromosomes, and specific segments at megabase (Mb) and kilobase (kb) resolutions of single average and ensemble structures. Further incorporation of CTCF-motif orientation and high-resolution looping patterns in 3D simulation provided additional reliability of potential biologically plausible topological structures.
Collapse
Affiliation(s)
- Przemysław Szałaj
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland.,Centre for Innovative Research, Medical University of Bialystok, 15-089 Białystok, Poland.,I-BioStat, Hasselt University, BE3590 Hasselt, Belgium
| | - Zhonghui Tang
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Paul Michalski
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Michal J Pietal
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Oscar J Luo
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Michał Sadowski
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Xingwang Li
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Kamen Radew
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland
| | - Yijun Ruan
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA.,Department of Genetics and Genome Sciences, UConn Health, Farmington, Connecticut 06032, USA
| | - Dariusz Plewczynski
- Centre of New Technologies, Warsaw University, 02-097 Warsaw, Poland.,Centre for Innovative Research, Medical University of Bialystok, 15-089 Białystok, Poland.,Faculty of Pharmacy, Medical University of Warsaw, 02-097 Warsaw, Poland
| |
Collapse
|
43
|
Brackley CA, Johnson J, Kelly S, Cook PR, Marenduzzo D. Simulated binding of transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and topological domains. Nucleic Acids Res 2016; 44:3503-12. [PMID: 27060145 PMCID: PMC4856988 DOI: 10.1093/nar/gkw135] [Citation(s) in RCA: 119] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 02/22/2016] [Accepted: 02/24/2016] [Indexed: 01/12/2023] Open
Abstract
Biophysicists are modeling conformations of interphase chromosomes, often basing the strengths of interactions between segments distant on the genetic map on contact frequencies determined experimentally. Here, instead, we develop a fitting-free, minimal model: bivalent or multivalent red and green 'transcription factors' bind to cognate sites in strings of beads ('chromatin') to form molecular bridges stabilizing loops. In the absence of additional explicit forces, molecular dynamic simulations reveal that bound factors spontaneously cluster-red with red, green with green, but rarely red with green-to give structures reminiscent of transcription factories. Binding of just two transcription factors (or proteins) to active and inactive regions of human chromosomes yields rosettes, topological domains and contact maps much like those seen experimentally. This emergent 'bridging-induced attraction' proves to be a robust, simple and generic force able to organize interphase chromosomes at all scales.
Collapse
Affiliation(s)
- Chris A Brackley
- SUPA, School of Physics & Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh, EH9 3FD, UK
| | - James Johnson
- SUPA, School of Physics & Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh, EH9 3FD, UK
| | - Steven Kelly
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, UK
| | - Peter R Cook
- Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford, OX1 3RE, UK
| | - Davide Marenduzzo
- SUPA, School of Physics & Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh, EH9 3FD, UK
| |
Collapse
|
44
|
Abstract
The role of the spatial organization of chromatin in gene regulation is a long-standing but still open question. Experimentally it has been shown that the genome is segmented into epigenomic chromatin domains that are organized into hierarchical sub-nuclear spatial compartments. However, whether this non-random spatial organization only reflects or indeed contributes-and how-to the regulation of genome function remains to be elucidated. To address this question, we recently proposed a quantitative description of the folding properties of the fly genome as a function of its epigenomic landscape using a polymer model with epigenomic-driven attractions. We propose in this article, to characterize more deeply the physical properties of the 3D epigenome folding. Using an efficient lattice version of the original block copolymer model, we study the structural and dynamical properties of chromatin and show that the size of epigenomic domains and asymmetries in sizes and in interaction strengths play a critical role in the chromatin organization. Finally, we discuss the biological implications of our findings. In particular, our predictions are quantitatively compatible with experimental data and suggest a different mean of self-interaction in euchromatin versus heterochromatin domains.
Collapse
Affiliation(s)
- Juan D Olarte-Plata
- École Normale Supérieure de Lyon, CNRS, Laboratoire de Physique, UMR 5672, Lyon, France
| | | | | | | |
Collapse
|
45
|
Zou C, Zhang Y, Ouyang Z. HSA: integrating multi-track Hi-C data for genome-scale reconstruction of 3D chromatin structure. Genome Biol 2016; 17:40. [PMID: 26936376 PMCID: PMC4774023 DOI: 10.1186/s13059-016-0896-1] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 02/10/2016] [Indexed: 11/24/2022] Open
Abstract
Genome-wide 3C technologies (Hi-C) are being increasingly employed to study three-dimensional (3D) genome conformations. Existing computational approaches are unable to integrate accumulating data to facilitate studying 3D chromatin structure and function. We present HSA (http://ouyanglab.jax.org/hsa/), a flexible tool that jointly analyzes multiple contact maps to infer 3D chromatin structure at the genome scale. HSA globally searches the latent structure underlying different cleavage footprints. Its robustness and accuracy outperform or rival existing tools on extensive simulations and orthogonal experiment validations. Applying HSA to recent in situ Hi-C data, we found the 3D chromatin structures are highly conserved across various human cell types.
Collapse
Affiliation(s)
- Chenchen Zou
- The Jackson Laboratory for Genomic Medicine, Farmington, 06032, CT, USA.
| | - Yuping Zhang
- Department of Statistics, University of Connecticut, Storrs, 06269, CT, USA. .,Institute for Systems Genomics, University of Connecticut, Farmington, 06030, CT, USA. .,Institute for Collaboration on Health, Intervention, and Policy, University of Connecticut, Storrs, 06269, CT, USA. .,Center for Quantitative Medicine, University of Connecticut, Farmington, 06030, CT, USA. .,The Connecticut Institute for the Brain and Cognitive Sciences, University of Connecticut, Storrs, 06269, CT, USA.
| | - Zhengqing Ouyang
- The Jackson Laboratory for Genomic Medicine, Farmington, 06032, CT, USA. .,Institute for Systems Genomics, University of Connecticut, Farmington, 06030, CT, USA. .,Department of Biomedical Engineering, University of Connecticut, Storrs, 06269, CT, USA. .,Department of Genetics and Genome Sciences, University of Connecticut, Farmington, 06030, CT, USA.
| |
Collapse
|
46
|
Trieu T, Cheng J. MOGEN: a tool for reconstructing 3D models of genomes from chromosomal conformation capturing data. Bioinformatics 2015; 32:1286-92. [PMID: 26722115 DOI: 10.1093/bioinformatics/btv754] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2015] [Accepted: 12/19/2015] [Indexed: 12/22/2022] Open
Abstract
MOTIVATION The three-dimensional (3D) conformation of chromosomes and genomes play an important role in cellular processes such as gene regulation, DNA replication and genome methylation. Several methods have been developed to reconstruct 3D structures of individual chromosomes from chromosomal conformation capturing data such as Hi-C data. However, few methods can effectively reconstruct the 3D structures of an entire genome due to the difficulty of handling noisy and inconsistent inter-chromosomal contact data. RESULTS We generalized a 3D chromosome reconstruction method to make it capable of reconstructing 3D models of genomes from both intra- and inter-chromosomal Hi-C contact data and implemented it as a software tool called MOGEN. We validated MOGEN on synthetic datasets of a polymer worm-like chain model and a yeast genome at first, and then applied it to generate an ensemble of 3D structural models of the genome of human B-cells from a Hi-C dataset. These genome models not only were validated by some known structural patterns of the human genome, such as chromosome compartmentalization, chromosome territories, co-localization of small chromosomes in the nucleus center with the exception of chromosome 18, enriched center-toward inter-chromosomal interactions between elongated or telomere regions of chromosomes, but also demonstrated the intrinsically dynamic orientations between chromosomes. Therefore, MOGEN is a useful tool for converting chromosomal contact data into 3D genome models to provide a better view into the spatial organization of genomes. AVAILABILITY AND IMPLEMENTATION The software of MOGEN is available at: http://calla.rnet.missouri.edu/mogen/ CONTACT : chengji@missouri.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tuan Trieu
- Computer Science Department, University of Missouri, Columbia, MO 65201, USA
| | - Jianlin Cheng
- Computer Science Department, University of Missouri, Columbia, MO 65201, USA
| |
Collapse
|
47
|
Junier I, Spill YG, Marti-Renom MA, Beato M, le Dily F. On the demultiplexing of chromosome capture conformation data. FEBS Lett 2015; 589:3005-13. [PMID: 26054977 DOI: 10.1016/j.febslet.2015.05.049] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Revised: 05/24/2015] [Accepted: 05/26/2015] [Indexed: 11/29/2022]
Abstract
How to describe the multiple chromosome structures that underlie interactions among genome loci and how to quantify the occurrence of these structures in a cell population remain important challenges to solve, which can be addressed via a proper demultiplexing of chromosome capture conformation related data. Here, we first aim to review two main methodologies that have been proposed to tackle this problem: restrained-based methods, in which the resulting chromosome structures stem from the multiple solutions of a distance satisfaction problem; and thermodynamic-based methods, in which the structures stem from the simulation of polymer models. Next, we propose a novel demultiplexing method based on a matrix decomposition of contact maps. To this end, we extend the notion of topologically associated domains (TADs) by introducing that of statistical interaction domains (SIDs). SIDs can overlap and occur in a cell population at certain frequencies, and we propose a simple method to estimate these frequency values. As an application, we show that SIDs that measure 100kb to tens of Mb long occur both frequently and specifically in the human genome.
Collapse
Affiliation(s)
- Ivan Junier
- Laboratoire Adaptation et Pathogénie des Micro-organismes - UMR 5163, Université Grenoble 1, CNRS, BP 170, F-38042 Grenoble Cedex 9, France; Gene Regulacion, Stem Cells, and Cancer Program, Centre de Regulaciò Genòmica (CRG), 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain.
| | - Yannick G Spill
- Gene Regulacion, Stem Cells, and Cancer Program, Centre de Regulaciò Genòmica (CRG), 08003 Barcelona, Spain; Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
| | - Marc A Marti-Renom
- Gene Regulacion, Stem Cells, and Cancer Program, Centre de Regulaciò Genòmica (CRG), 08003 Barcelona, Spain; Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), 08028 Barcelona, Spain
| | - Miguel Beato
- Gene Regulacion, Stem Cells, and Cancer Program, Centre de Regulaciò Genòmica (CRG), 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - François le Dily
- Gene Regulacion, Stem Cells, and Cancer Program, Centre de Regulaciò Genòmica (CRG), 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| |
Collapse
|
48
|
Serra F, Di Stefano M, Spill YG, Cuartero Y, Goodstadt M, Baù D, Marti-Renom MA. Restraint-based three-dimensional modeling of genomes and genomic domains. FEBS Lett 2015; 589:2987-95. [PMID: 25980604 DOI: 10.1016/j.febslet.2015.05.012] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Revised: 05/05/2015] [Accepted: 05/05/2015] [Indexed: 10/23/2022]
Abstract
Chromosomes are large polymer molecules composed of nucleotides. In some species, such as humans, this polymer can sum up to meters long and still be properly folded within the nuclear space of few microns in size. The exact mechanisms of how the meters long DNA is folded into the nucleus, as well as how the regulatory machinery can access it, is to a large extend still a mystery. However, and thanks to newly developed molecular, genomic and computational approaches based on the Chromosome Conformation Capture (3C) technology, we are now obtaining insight on how genomes are spatially organized. Here we review a new family of computational approaches that aim at using 3C-based data to obtain spatial restraints for modeling genomes and genomic domains.
Collapse
Affiliation(s)
- François Serra
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Marco Di Stefano
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Yannick G Spill
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Yasmina Cuartero
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Michael Goodstadt
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Davide Baù
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Marc A Marti-Renom
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain.
| |
Collapse
|