1
|
Li Z, Portillo-Ledesma S, Schlick T. Techniques for and challenges in reconstructing 3D genome structures from 2D chromosome conformation capture data. Curr Opin Cell Biol 2023; 83:102209. [PMID: 37506571 PMCID: PMC10529954 DOI: 10.1016/j.ceb.2023.102209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/07/2023] [Accepted: 06/26/2023] [Indexed: 07/30/2023]
Abstract
Chromosome conformation capture technologies that provide frequency information for contacts between genomic regions have been crucial for increasing our understanding of genome folding and regulation. However, such data do not provide direct evidence of the spatial 3D organization of chromatin. In this opinion article, we discuss the development and application of computational methods to reconstruct chromatin 3D structures from experimental 2D contact data, highlighting how such modeling provides biological insights and can suggest mechanisms anchored to experimental data. By applying different reconstruction methods to the same contact data, we illustrate some state-of-the-art of these techniques and discuss our gene resolution approach based on Brownian dynamics and Monte Carlo sampling.
Collapse
Affiliation(s)
- Zilong Li
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA
| | - Stephanie Portillo-Ledesma
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer St., New York, 10012, NY, USA; New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Room 340, Geography Building, 3663 North Zhongshan Road, Shanghai, 200122, China; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA.
| |
Collapse
|
2
|
Beagrie RA, Thieme CJ, Annunziatella C, Baugher C, Zhang Y, Schueler M, Kukalev A, Kempfer R, Chiariello AM, Bianco S, Li Y, Davis T, Scialdone A, Welch LR, Nicodemi M, Pombo A. Multiplex-GAM: genome-wide identification of chromatin contacts yields insights overlooked by Hi-C. Nat Methods 2023:10.1038/s41592-023-01903-1. [PMID: 37336949 PMCID: PMC10333126 DOI: 10.1038/s41592-023-01903-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 05/01/2023] [Indexed: 06/21/2023]
Abstract
Technology for measuring 3D genome topology is increasingly important for studying gene regulation, for genome assembly and for mapping of genome rearrangements. Hi-C and other ligation-based methods have become routine but have specific biases. Here, we develop multiplex-GAM, a faster and more affordable version of genome architecture mapping (GAM), a ligation-free technique that maps chromatin contacts genome-wide. We perform a detailed comparison of multiplex-GAM and Hi-C using mouse embryonic stem cells. When examining the strongest contacts detected by either method, we find that only one-third of these are shared. The strongest contacts specifically found in GAM often involve 'active' regions, including many transcribed genes and super-enhancers, whereas in Hi-C they more often contain 'inactive' regions. Our work shows that active genomic regions are involved in extensive complex contacts that are currently underestimated in ligation-based approaches, and highlights the need for orthogonal advances in genome-wide contact mapping technologies.
Collapse
Affiliation(s)
- Robert A Beagrie
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany
- Laboratory of Gene Regulation, Weatherall Institute of Molecular Medicine, Oxford, UK
- Chromatin and Disease Group, Wellcome Centre for Human Genetics, Oxford, UK
| | - Christoph J Thieme
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany
| | - Carlo Annunziatella
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, CNR-SPIN, Complesso Universitario di Monte Sant'Angelo, Naples, Italy
| | - Catherine Baugher
- School of Electrical Engineering and Computer Science, Ohio University, Athens, OH, USA
| | - Yingnan Zhang
- School of Electrical Engineering and Computer Science, Ohio University, Athens, OH, USA
| | - Markus Schueler
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany
| | - Alexander Kukalev
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany
| | - Rieke Kempfer
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany
- Humboldt-Universität zu Berlin, Berlin, Germany
| | - Andrea M Chiariello
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, CNR-SPIN, Complesso Universitario di Monte Sant'Angelo, Naples, Italy
| | - Simona Bianco
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, CNR-SPIN, Complesso Universitario di Monte Sant'Angelo, Naples, Italy
| | - Yichao Li
- School of Electrical Engineering and Computer Science, Ohio University, Athens, OH, USA
| | - Trenton Davis
- School of Electrical Engineering and Computer Science, Ohio University, Athens, OH, USA
| | - Antonio Scialdone
- Institute of Epigenetics and Stem Cells, Helmholtz Zentrum München - German Research Center for Environmental Health, Munich, Germany
- Institute of Functional Epigenetics, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
| | - Lonnie R Welch
- School of Electrical Engineering and Computer Science, Ohio University, Athens, OH, USA.
| | - Mario Nicodemi
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany.
- Dipartimento di Fisica, Università di Napoli Federico II, and INFN Napoli, CNR-SPIN, Complesso Universitario di Monte Sant'Angelo, Naples, Italy.
- Berlin Institute of Health (BIH), MDC-Berlin, Berlin, Germany.
| | - Ana Pombo
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Epigenetic Regulation and Chromatin Architecture Group, Berlin, Germany.
- Humboldt-Universität zu Berlin, Berlin, Germany.
| |
Collapse
|
3
|
Liu L, Cao X, Zhang B, Hyeon C. Dissecting the cosegregation probability from genome architecture mapping. Biophys J 2022; 121:3774-3784. [PMID: 36146938 PMCID: PMC9674989 DOI: 10.1016/j.bpj.2022.09.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Revised: 08/16/2022] [Accepted: 09/16/2022] [Indexed: 11/21/2022] Open
Abstract
Genome architecture mapping (GAM) is a recently developed methodology that offers the cosegregation probability of two genomic segments from an ensemble of thinly sliced nuclear profiles, enabling us to probe and decipher three-dimensional chromatin organization. The cosegregation probability from GAM binned at 1 Mb, which thus probes the length scale associated with the genomic separation greater than 1 Mb, is, however, not identical to the contact probability obtained from Hi-C, and its correlation with interlocus distance measured with fluorescence in situ hybridization is not so good as the contact probability. In this study, by using a polymer-based model of chromatins, we derive a theoretical expression of the cosegregation probability as well as that of the contact probability and carry out quantitative analyses of how they differ from each other. The results from our study, validated with in silico GAM analysis on three-dimensional genome structures from fluorescence in situ hybridization, suggest that to attain strong correlation with the interlocus distance, a properly normalized version of cosegregation probability needs to be calculated based on a large number of nuclear slices (n>103).
Collapse
Affiliation(s)
- Lei Liu
- Key Laboratory of Optical Field Manipulation of Zhejiang Province, Department of Physics, Zhejiang Sci-Tech University, Hangzhou, China.
| | - Xinmeng Cao
- Key Laboratory of Optical Field Manipulation of Zhejiang Province, Department of Physics, Zhejiang Sci-Tech University, Hangzhou, China
| | - Bokai Zhang
- School of Physical Science and Technology, Southwest University, Chongqing, China
| | - Changbong Hyeon
- School of Computational Sciences, Korea Institute for Advanced Study, Seoul, Korea.
| |
Collapse
|
4
|
Matthey-Doret C, Baudry L, Mortaza S, Moreau P, Koszul R, Cournac A. Normalization of Chromosome Contact Maps: Matrix Balancing and Visualization. Methods Mol Biol 2022; 2301:1-15. [PMID: 34415528 DOI: 10.1007/978-1-0716-1390-0_1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Over the last decade, genomic proximity ligation approaches have reshaped our vision of chromosomes 3D organizations, from bacteria nucleoids to larger eukaryotic genomes. The different protocols (3Cseq, Hi-C, TCC, MicroC [XL], Hi-CO, etc.) rely on common steps (chemical fixation digestion, ligation…) to detect pairs of genomic positions in close proximity. The most common way to represent these data is a matrix, or contact map, which allows visualizing the different chromatin structures (compartments, loops, etc.) that can be associated to other signals such as transcription, protein occupancy, etc. as well as, in some instances, to biological functions.In this chapter we present and discuss the filtering of the events recovered in proximity ligation experiments as well as the application of the balancing normalization procedure on the resulting contact map. We also describe a computational tool for visualizing normalized contact data dubbed Scalogram.The different processes described here are illustrated and supported by the laboratory custom-made scripts pooled into "hicstuff," an open-access python package accessible on github ( https://github.com/koszullab/hicstuff ).
Collapse
Affiliation(s)
- Cyril Matthey-Doret
- Institut Pasteur, Unité Régulation Spatiale des Génomes, Paris, France
- Sorbonne Université, Collège Doctoral, Paris, France
| | - Lyam Baudry
- Institut Pasteur, Unité Régulation Spatiale des Génomes, Paris, France
- Sorbonne Université, Collège Doctoral, Paris, France
| | - Shogofa Mortaza
- Institut Pasteur, Unité Régulation Spatiale des Génomes, Paris, France
| | - Pierrick Moreau
- Institut Pasteur, Unité Régulation Spatiale des Génomes, Paris, France
| | - Romain Koszul
- Institut Pasteur, Unité Régulation Spatiale des Génomes, Paris, France
| | - Axel Cournac
- Institut Pasteur, Unité Régulation Spatiale des Génomes, Paris, France.
| |
Collapse
|
5
|
Nie G, Yang Z, He J, Liu A, Chen J, Wang S, Wang X, Feng G, Li D, Peng Y, Huang L, Zhang X. Genome-Wide Investigation of the NAC Transcription Factor Family in Miscanthus sinensis and Expression Analysis Under Various Abiotic Stress. FRONTIERS IN PLANT SCIENCE 2021; 12:766550. [PMID: 34804100 PMCID: PMC8600139 DOI: 10.3389/fpls.2021.766550] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 10/11/2021] [Indexed: 06/13/2023]
Abstract
The NAC transcription factor family is deemed to be a large plant-specific gene family that plays important roles in plant development and stress response. Miscanthus sinensis is commonly planted in vast marginal land as forage, ornamental grass, or bioenergy crop which demand a relatively high resistance to abiotic stresses. The recent release of a draft chromosome-scale assembly genome of M. sinensis provided a basic platform for the genome-wide investigation of NAC proteins. In this study, a total of 261 M. sinensis NAC genes were identified and a complete overview of the gene family was presented, including gene structure, conserved motif compositions, chromosomal distribution, and gene duplications. Results showed that gene length, molecular weights (MW), and theoretical isoelectric points (pI) of NAC family were varied, while gene structure and motifs were relatively conserved. Chromosomal mapping analysis found that the M. sinensis NAC genes were unevenly distributed on 19 M. sinensis chromosomes, and the interchromosomal evolutionary analysis showed that nine pairs of tandem duplicates genes and 121 segmental duplications were identified, suggesting that gene duplication, especially segmental duplication, is possibly associated with the amplification of M. sinensis NAC gene family. The expression patterns of 14 genes from M. sinensis SNAC subgroup were analyzed under high salinity, PEG, and heavy metals, and multiple NAC genes could be induced by the treatment. These results will provide a very useful reference for follow-up study of the functional characteristics of NAC genes in the mechanism of stress-responsive and potential roles in the development of M. sinensis.
Collapse
|
6
|
Mohanta TK, Mishra AK, Al-Harrasi A. The 3D Genome: From Structure to Function. Int J Mol Sci 2021; 22:11585. [PMID: 34769016 PMCID: PMC8584255 DOI: 10.3390/ijms222111585] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 10/18/2021] [Accepted: 10/20/2021] [Indexed: 01/09/2023] Open
Abstract
The genome is the most functional part of a cell, and genomic contents are organized in a compact three-dimensional (3D) structure. The genome contains millions of nucleotide bases organized in its proper frame. Rapid development in genome sequencing and advanced microscopy techniques have enabled us to understand the 3D spatial organization of the genome. Chromosome capture methods using a ligation approach and the visualization tool of a 3D genome browser have facilitated detailed exploration of the genome. Topologically associated domains (TADs), lamin-associated domains, CCCTC-binding factor domains, cohesin, and chromatin structures are the prominent identified components that encode the 3D structure of the genome. Although TADs are the major contributors to 3D genome organization, they are absent in Arabidopsis. However, a few research groups have reported the presence of TAD-like structures in the plant kingdom.
Collapse
Affiliation(s)
- Tapan Kumar Mohanta
- Natural and Medical Sciences Research Center, University of Nizwa, Nizwa 616, Oman
| | - Awdhesh Kumar Mishra
- Department of Biotechnology, Yeungnam University, Gyeongsan 38541, Gyeongsangbuk-do, Korea; or
| | - Ahmed Al-Harrasi
- Natural and Medical Sciences Research Center, University of Nizwa, Nizwa 616, Oman
| |
Collapse
|
7
|
Wu H, Wang X, Chu M, Li D, Cheng L, Zhou K. HCMB: A stable and efficient algorithm for processing the normalization of highly sparse Hi-C contact data. Comput Struct Biotechnol J 2021; 19:2637-2645. [PMID: 34025950 PMCID: PMC8120939 DOI: 10.1016/j.csbj.2021.04.064] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 04/11/2021] [Accepted: 04/24/2021] [Indexed: 11/17/2022] Open
Abstract
The high-throughput genome-wide chromosome conformation capture (Hi-C) method has recently become an important tool to study chromosomal interactions where one can extract meaningful biological information including P(s) curve, topologically associated domains, A/B compartments, and other biologically relevant signals. Normalization is a critical pre-processing step of downstream analyses for the elimination of systematic and technical biases from chromatin contact matrices due to different mappability, GC content, and restriction fragment lengths. Especially, the problem of high sparsity puts forward a huge challenge on the correction, indicating the urgent need for a stable and efficient method for Hi-C data normalization. Recently, some matrix balancing methods have been developed to normalize Hi-C data, such as the Knight-Ruiz (KR) algorithm, but it failed to normalize contact matrices with high sparsity. Here, we presented an algorithm, Hi-C Matrix Balancing (HCMB), based on an iterative solution of equations, combining with linear search and projection strategy to normalize the Hi-C original interaction data. Both the simulated and experimental data demonstrated that HCMB is robust and efficient in normalizing Hi-C data and preserving the biologically relevant Hi-C features even facing very high sparsity. HCMB is implemented in Python and is freely accessible to non-commercial users at GitHub: https://github.com/HUST-DataMan/HCMB.
Collapse
Affiliation(s)
- Honglong Wu
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, Hubei 430000, China
- BGI PathoGenesis Pharmaceutical Technology, BGI-Shenzhen, Shenzhen 518083, China
| | - Xuebin Wang
- BGI PathoGenesis Pharmaceutical Technology, BGI-Shenzhen, Shenzhen 518083, China
| | - Mengtian Chu
- BGI PathoGenesis Pharmaceutical Technology, BGI-Shenzhen, Shenzhen 518083, China
| | - Dongfang Li
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, Hubei 430000, China
- BGI PathoGenesis Pharmaceutical Technology, BGI-Shenzhen, Shenzhen 518083, China
| | - Lixin Cheng
- Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen 518020, China
| | - Ke Zhou
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, Hubei 430000, China
| |
Collapse
|
8
|
Mathé E, Zhang C, Wang K, Ning X, Guo Y, Zhao Z. The International Conference on Intelligent Biology and Medicine 2019 (ICIBM 2019): conference summary and innovations in genomics. BMC Genomics 2019; 20:1005. [PMID: 31888451 PMCID: PMC6936133 DOI: 10.1186/s12864-019-6326-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The goal of this editorial is to summarize the 2019 International Conference on Intelligent Biology and Medicine (ICIBM 2019) conference that took place on June 9–11, 2019 in The Ohio State University, Columbus, OH, and to provide an introductory summary of the seven articles presented in this supplement issue. ICIBM 2019 hosted four keynote speakers, four eminent scholar speakers, five tutorials and workshops, twelve concurrent sessions and a poster session, totaling 23 posters, spanning state-of-the-art developments in bioinformatics, genomics, next-generation sequencing (NGS) analysis, scientific databases, cancer and medical genomics, and computational drug discovery. A total of 105 original manuscripts were submitted to ICIBM 2019, and after careful review, seven were selected for this supplement issue. These articles cover methods and applications for functional annotations of miRNA targeting, clonal evolution of bacterial cells, gene co-expression networks that describe a given phenotype, functional binding site analysis of RNA-binding proteins, normalization of genome architecture mapping data, sample predictions based on multiple NGS data types, and prediction of an individual’s genetic admixture given exonic single nucleotide polymorphisms data.
Collapse
Affiliation(s)
- Ewy Mathé
- Department of Biomedical Informatics, The Ohio State University, Columbus, 43210, USA.
| | - Chi Zhang
- Department of Medical & Molecular Genetics, School of Medicine, Indiana University, Indianapolis, Indiana, 46202, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Xia Ning
- Department of Biomedical Informatics, The Ohio State University, Columbus, 43210, USA
| | - Yan Guo
- Department of Internal Medicine, Comprehensive Cancer Center, University of New Mexico, Albuquerque, NM, 87131, USA
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA. .,Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
| |
Collapse
|