1
|
Drogaris C, Zhang Y, Zhang E, Nazarova E, Sarrazin-Gendron R, Wilhelm-Landry S, Cyr Y, Majewski J, Blanchette M, Waldispühl J. ARGV: 3D genome structure exploration using augmented reality. BMC Bioinformatics 2024; 25:277. [PMID: 39192184 DOI: 10.1186/s12859-024-05882-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Accepted: 07/25/2024] [Indexed: 08/29/2024] Open
Abstract
Over the past two decades, scientists have increasingly realized the importance of the three-dimensional (3D) genome organization in regulating cellular activity. Hi-C and related experiments yield 2D contact matrices that can be used to infer 3D models of chromosome structure. Visualizing and analyzing genomes in 3D space remains challenging. Here, we present ARGV, an augmented reality 3D Genome Viewer. ARGV contains more than 350 pre-computed and annotated genome structures inferred from Hi-C and imaging data. It offers interactive and collaborative visualization of genomes in 3D space, using standard mobile phones or tablets. A user study comparing ARGV to existing tools demonstrates its benefits.
Collapse
Affiliation(s)
| | - Yanlin Zhang
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | - Eric Zhang
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | - Elena Nazarova
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | | | | | - Yan Cyr
- Beam Me Up Inc., 5925 Monkland Ave, Suite, 100, Montréal, H4A 1G7, Canada
| | - Jacek Majewski
- Department of Human Genetics, McGill University, Montréal, QC, H3A 1B1, Canada
| | - Mathieu Blanchette
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada.
| |
Collapse
|
2
|
Chowdhury HMAM, Boult T, Oluwadare O. Comparative study on chromatin loop callers using Hi-C data reveals their effectiveness. BMC Bioinformatics 2024; 25:123. [PMID: 38515011 PMCID: PMC10958853 DOI: 10.1186/s12859-024-05713-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 02/19/2024] [Indexed: 03/23/2024] Open
Abstract
BACKGROUND Chromosome is one of the most fundamental part of cell biology where DNA holds the hierarchical information. DNA compacts its size by forming loops, and these regions house various protein particles, including CTCF, SMC3, H3 histone. Numerous sequencing methods, such as Hi-C, ChIP-seq, and Micro-C, have been developed to investigate these properties. Utilizing these data, scientists have developed a variety of loop prediction techniques that have greatly improved their methods for characterizing loop prediction and related aspects. RESULTS In this study, we categorized 22 loop calling methods and conducted a comprehensive study of 11 of them. Additionally, we have provided detailed insights into the methodologies underlying these algorithms for loop detection, categorizing them into five distinct groups based on their fundamental approaches. Furthermore, we have included critical information such as resolution, input and output formats, and parameters. For this analysis, we utilized the GM12878 Hi-C datasets at 5 KB, 10 KB, 100 KB and 250 KB resolutions. Our evaluation criteria encompassed various factors, including memory usages, running time, sequencing depth, and recovery of protein-specific sites such as CTCF, H3K27ac, and RNAPII. CONCLUSION This analysis offers insights into the loop detection processes of each method, along with the strengths and weaknesses of each, enabling readers to effectively choose suitable methods for their datasets. We evaluate the capabilities of these tools and introduce a novel Biological, Consistency, and Computational robustness score ( B C C score ) to measure their overall robustness ensuring a comprehensive evaluation of their performance.
Collapse
Affiliation(s)
- H M A Mohit Chowdhury
- Department of Computer Science, University of Colorado at Colorado Springs, 1420 Austin Bluffs Pkwy, Colorado Springs, CO, 80918, USA
| | - Terrance Boult
- Department of Computer Science, University of Colorado at Colorado Springs, 1420 Austin Bluffs Pkwy, Colorado Springs, CO, 80918, USA
| | - Oluwatosin Oluwadare
- Department of Computer Science, University of Colorado at Colorado Springs, 1420 Austin Bluffs Pkwy, Colorado Springs, CO, 80918, USA.
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| |
Collapse
|
3
|
Gilbert BR, Luthey-Schulten Z. Replicating Chromosomes in Whole-Cell Models of Bacteria. Methods Mol Biol 2024; 2819:625-653. [PMID: 39028527 DOI: 10.1007/978-1-0716-3930-6_29] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Computational models of cells cannot be considered complete unless they include the most fundamental process of life, the replication of genetic material. In a recent study, we presented a computational framework to model systems of replicating bacterial chromosomes as polymers at 10 bp resolution with Brownian dynamics. This approach was used to investigate changes in chromosome organization during replication and extend the applicability of an existing whole-cell model (WCM) for a genetically minimal bacterium, JCVI-syn3A, to the entire cell cycle. To achieve cell-scale chromosome structures that are realistic, we modeled the chromosome as a self-avoiding homopolymer with bending and torsional stiffnesses that capture the essential mechanical properties of dsDNA in Syn3A. Additionally, the polymer interacts with ribosomes distributed according to cryo-electron tomograms of Syn3A. The polymer model was further augmented by computational models of loop extrusion by structural maintenance of chromosomes (SMC) protein complexes and topoisomerase action, and the modeling and analysis of multi-fork replication states.
Collapse
Affiliation(s)
- Benjamin R Gilbert
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- NSF Science and Technology Center for Quantitative Cell Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| |
Collapse
|
4
|
Holmgren A, Bernenko D, Lizana L. Mapping robust multiscale communities in chromosome contact networks. Sci Rep 2023; 13:12979. [PMID: 37563218 PMCID: PMC10415398 DOI: 10.1038/s41598-023-39522-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 07/26/2023] [Indexed: 08/12/2023] Open
Abstract
To better understand DNA's 3D folding in cell nuclei, researchers developed chromosome capture methods such as Hi-C that measure the contact frequencies between all DNA segment pairs across the genome. As Hi-C data sets often are massive, it is common to use bioinformatics methods to group DNA segments into 3D regions with correlated contact patterns, such as Topologically associated domains and A/B compartments. Recently, another research direction emerged that treats the Hi-C data as a network of 3D contacts. In this representation, one can use community detection algorithms from complex network theory that group nodes into tightly connected mesoscale communities. However, because Hi-C networks are so densely connected, several node partitions may represent feasible solutions to the community detection problem but are indistinguishable unless including other data. Because this limitation is a fundamental property of the network, this problem persists regardless of the community-finding or data-clustering method. To help remedy this problem, we developed a method that charts the solution landscape of network partitions in Hi-C data from human cells. Our approach allows us to scan seamlessly through the scales of the network and determine regimes where we can expect reliable community structures. We find that some scales are more robust than others and that strong clusters may differ significantly. Our work highlights that finding a robust community structure hinges on thoughtful algorithm design or method cross-evaluation.
Collapse
Affiliation(s)
- Anton Holmgren
- Integrated Science Lab, Department of Physics, Umeå University, Umeå, Sweden
| | - Dolores Bernenko
- Integrated Science Lab, Department of Physics, Umeå University, Umeå, Sweden
| | - Ludvig Lizana
- Integrated Science Lab, Department of Physics, Umeå University, Umeå, Sweden.
| |
Collapse
|
5
|
Li Z, Portillo-Ledesma S, Schlick T. Techniques for and challenges in reconstructing 3D genome structures from 2D chromosome conformation capture data. Curr Opin Cell Biol 2023; 83:102209. [PMID: 37506571 PMCID: PMC10529954 DOI: 10.1016/j.ceb.2023.102209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/07/2023] [Accepted: 06/26/2023] [Indexed: 07/30/2023]
Abstract
Chromosome conformation capture technologies that provide frequency information for contacts between genomic regions have been crucial for increasing our understanding of genome folding and regulation. However, such data do not provide direct evidence of the spatial 3D organization of chromatin. In this opinion article, we discuss the development and application of computational methods to reconstruct chromatin 3D structures from experimental 2D contact data, highlighting how such modeling provides biological insights and can suggest mechanisms anchored to experimental data. By applying different reconstruction methods to the same contact data, we illustrate some state-of-the-art of these techniques and discuss our gene resolution approach based on Brownian dynamics and Monte Carlo sampling.
Collapse
Affiliation(s)
- Zilong Li
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA
| | - Stephanie Portillo-Ledesma
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 100 Washington Square East, Silver Building, New York, 10003, NY, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer St., New York, 10012, NY, USA; New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Room 340, Geography Building, 3663 North Zhongshan Road, Shanghai, 200122, China; Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, Silver Building, New York, NY, 10003, USA.
| |
Collapse
|
6
|
Tolokh IS, Kinney NA, Sharakhov IV, Onufriev AV. Strong interactions between highly dynamic lamina-associated domains and the nuclear envelope stabilize the 3D architecture of Drosophila interphase chromatin. Epigenetics Chromatin 2023; 16:21. [PMID: 37254161 PMCID: PMC10228000 DOI: 10.1186/s13072-023-00492-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 05/04/2023] [Indexed: 06/01/2023] Open
Abstract
BACKGROUND Interactions among topologically associating domains (TADs), and between the nuclear envelope (NE) and lamina-associated domains (LADs) are expected to shape various aspects of three-dimensional (3D) chromatin structure and dynamics; however, relevant genome-wide experiments that may provide statistically significant conclusions remain difficult. RESULTS We have developed a coarse-grained dynamical model of D. melanogaster nuclei at TAD resolution that explicitly accounts for four distinct epigenetic classes of TADs and LAD-NE interactions. The model is parameterized to reproduce the experimental Hi-C map of the wild type (WT) nuclei; it describes time evolution of the chromatin over the G1 phase of the interphase. The simulations include an ensemble of nuclei, corresponding to the experimentally observed set of several possible mutual arrangements of chromosomal arms. The model is validated against multiple structural features of chromatin from several different experiments not used in model development. Predicted positioning of all LADs at the NE is highly dynamic-the same LAD can attach, detach and move far away from the NE multiple times during interphase. The probabilities of LADs to be in contact with the NE vary by an order of magnitude, despite all having the same affinity to the NE in the model. These probabilities are mostly determined by a highly variable local linear density of LADs along the genome, which also has the same strong effect on the predicted positioning of individual TADs -- higher probability of a TAD to be near NE is largely determined by a higher linear density of LADs surrounding this TAD. The distribution of LADs along the chromosome chains plays a notable role in maintaining a non-random average global structure of chromatin. Relatively high affinity of LADs to the NE in the WT nuclei substantially reduces sensitivity of the global radial chromatin distribution to variations in the strength of TAD-TAD interactions compared to the lamin depleted nuclei, where a small (0.5 kT) increase of cross-type TAD-TAD interactions doubles the chromatin density in the central nucleus region. CONCLUSIONS A dynamical model of the entire fruit fly genome makes multiple genome-wide predictions of biological interest. The distribution of LADs along the chromatin chains affects their probabilities to be in contact with the NE and radial positioning of highly mobile TADs, playing a notable role in creating a non-random average global structure of the chromatin. We conjecture that an important role of attractive LAD-NE interactions is to stabilize global chromatin structure against inevitable cell-to-cell variations in TAD-TAD interactions.
Collapse
Affiliation(s)
- Igor S. Tolokh
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061 USA
| | - Nicholas Allen Kinney
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061 USA
- Department of Entomology, Virginia Tech, Blacksburg, VA 24061 USA
- Edward Via College of Osteopathic Medicine, 2265 Kraft Drive, Blacksburg, VA 24060 USA
| | | | - Alexey V. Onufriev
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061 USA
- Department of Physics, Virginia Tech, Blacksburg, VA 24061 USA
- Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, VA 24061 USA
| |
Collapse
|
7
|
Bi Y, Wang X, Ding H, He F, Han L, Zhang Y. Transporter-mediated Natural Product-Drug Interactions. PLANTA MEDICA 2023; 89:119-133. [PMID: 35304735 DOI: 10.1055/a-1803-1744] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The increasing use of natural products in clinical practice has raised great concerns about the potential natural product-drug interactions (NDIs). Drug transporters mediate the transmembrane passage of a broad range of drugs, and thus are important determinants for drug pharmacokinetics and pharmacodynamics. Generally, transporters can be divided into ATP binding cassette (ABC) family and solute carrier (SLC) family. Numerous natural products have been identified as inhibitors, substrates, inducers, and/or activators of drug transporters. This review article aims to provide a comprehensive summary of the recent progress on the research of NDIs, focusing on the main drug transporters, such as P-glycoprotein (P-gp), breast cancer resistance protein (BCRP), organic anion transporter 1 and 3 (OAT1/OAT3), organic anion-transporting polypeptide 1B1 and 1B3 (OATP1B1/OATP1B3), organic cation transporter 2 (OCT2), multidrug and toxin extrusion protein 1 and 2-K (MATE1/MATE2-K). Additionally, the challenges and strategies of studying NDIs are also discussed.
Collapse
Affiliation(s)
- Yajuan Bi
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, P. R. China
| | - Xue Wang
- Department of Nutritional Sciences and Toxicology, University of California Berkeley, Berkeley, USA
| | - Hui Ding
- Tianjin State Key Laboratory of Modern Chinese Medicine, Tianjin Key Laboratory of TCM Chemistry and Analysis, Tianjin University of Traditional Chinese Medicine, Tianjin, P. R. China
| | - Feng He
- School of Pharmaceutical Sciences, Guizhou Medical University, Guiyang, Guizhou, P. R. China
| | - Lifeng Han
- Tianjin State Key Laboratory of Modern Chinese Medicine, Tianjin Key Laboratory of TCM Chemistry and Analysis, Tianjin University of Traditional Chinese Medicine, Tianjin, P. R. China
| | - Youcai Zhang
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, P. R. China
| |
Collapse
|
8
|
Vadnais D, Middleton M, Oluwadare O. ParticleChromo3D: a Particle Swarm Optimization algorithm for chromosome 3D structure prediction from Hi-C data. BioData Min 2022; 15:19. [PMID: 36131326 PMCID: PMC9494900 DOI: 10.1186/s13040-022-00305-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 08/31/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
The three-dimensional (3D) structure of chromatin has a massive effect on its function. Because of this, it is desirable to have an understanding of the 3D structural organization of chromatin. To gain greater insight into the spatial organization of chromosomes and genomes and the functions they perform, chromosome conformation capture (3C) techniques, particularly Hi-C, have been developed. The Hi-C technology is widely used and well-known because of its ability to profile interactions for all read pairs in an entire genome. The advent of Hi-C has greatly expanded our understanding of the 3D genome, genome folding, gene regulation and has enabled the development of many 3D chromosome structure reconstruction methods.
Results
Here, we propose a novel approach for 3D chromosome and genome structure reconstruction from Hi-C data using Particle Swarm Optimization (PSO) approach called ParticleChromo3D. This algorithm begins with a grouping of candidate solution locations for each chromosome bin, according to the particle swarm algorithm, and then iterates its position towards a global best candidate solution. While moving towards the optimal global solution, each candidate solution or particle uses its own local best information and a randomizer to choose its path. Using several metrics to validate our results, we show that ParticleChromo3D produces a robust and rigorous representation of the 3D structure for input Hi-C data. We evaluated our algorithm on simulated and real Hi-C data in this work. Our results show that ParticleChromo3D is more accurate than most of the existing algorithms for 3D structure reconstruction.
Conclusions
Our results also show that constructed ParticleChromo3D structures are very consistent, hence indicating that it will always arrive at the global solution at every iteration. The source code for ParticleChromo3D, the simulated and real Hi-C datasets, and the models generated for these datasets are available here: https://github.com/OluwadareLab/ParticleChromo3D
Collapse
|
9
|
Hirata Y, Oda AH, Motono C, Shiro M, Ohta K. Imputation-free reconstructions of three-dimensional chromosome architectures in human diploid single-cells using allele-specified contacts. Sci Rep 2022; 12:11757. [PMID: 35817790 PMCID: PMC9273635 DOI: 10.1038/s41598-022-15038-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 06/16/2022] [Indexed: 11/18/2022] Open
Abstract
Single-cell Hi-C analysis of diploid human cells is difficult because of the lack of dense chromosome contact information and the presence of homologous chromosomes with very similar nucleotide sequences. Thus here, we propose a new algorithm to reconstruct the three-dimensional (3D) chromosomal architectures from the Hi-C dataset of single diploid human cells using allele-specific single-nucleotide variations (SNVs). We modified our recurrence plot-based algorithm, which is suitable for the estimation of the 3D chromosome structure from sparse Hi-C datasets, by newly incorporating a function of discriminating SNVs specific to each homologous chromosome. Here, we eventually regard a contact map as a recurrence plot. Importantly, the proposed method does not require any imputation for ambiguous segment information, but could efficiently reconstruct 3D chromosomal structures in single human diploid cells at a 1-Mb resolution. Datasets of segments without allele-specific SNVs, which were considered to be of little value, can also be used to validate the estimated chromosome structure. Introducing an additional mathematical measure called a refinement further improved the resolution to 40-kb or 100-kb. The reconstruction data supported the notion that human chromosomes form chromosomal territories and take fractal structures where the dimension for the underlying chromosome structure is a non-integer value.
Collapse
Affiliation(s)
- Yoshito Hirata
- Faculty of Engineering, Information and Systems, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki, 305-8573, Japan.
| | - Arisa H Oda
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Meguro-ku, Tokyo, 153-8902, Japan
| | - Chie Motono
- Cellular and Molecular Biotechnology Research Institute, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo, 135-0064, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, 169-0072, Japan
| | - Masanori Shiro
- Mathematical Neuroscience Research Group, Human Informatics and Interaction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba, Ibaraki, 305-8568, Japan
| | - Kunihiro Ohta
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Meguro-ku, Tokyo, 153-8902, Japan.,Research Center for Complex Systems Biology, Universal Biology Institute, 3-8-1 Komaba, Meguro-ku, Tokyo, 153-8902, Japan
| |
Collapse
|
10
|
Yildirim A, Boninsegna L, Zhan Y, Alber F. Uncovering the Principles of Genome Folding by 3D Chromatin Modeling. Cold Spring Harb Perspect Biol 2022; 14:a039693. [PMID: 34400556 PMCID: PMC9248826 DOI: 10.1101/cshperspect.a039693] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Our understanding of how genomic DNA is tightly packed inside the nucleus, yet is still accessible for vital cellular processes, has grown dramatically over recent years with advances in microscopy and genomics technologies. Computational methods have played a pivotal role in the structural interpretation of experimental data, which helped unravel some organizational principles of genome folding. Here, we give an overview of current computational efforts in mechanistic and data-driven 3D chromatin structure modeling. We discuss strengths and limitations of different methods and evaluate the added value and benefits of computational approaches to infer the 3D structural and dynamic properties of the genome and its underlying mechanisms at different scales and resolution, ranging from the dynamic formation of chromatin loops and topological associated domains to nuclear compartmentalization of chromatin and nuclear bodies.
Collapse
Affiliation(s)
- Asli Yildirim
- Institute for Quantitative and Computational Biosciences, Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Lorenzo Boninsegna
- Institute for Quantitative and Computational Biosciences, Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Yuxiang Zhan
- Institute for Quantitative and Computational Biosciences, Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, California 90095, USA
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, USA
| | - Frank Alber
- Institute for Quantitative and Computational Biosciences, Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, California 90095, USA
- Quantitative and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, USA
| |
Collapse
|
11
|
Boninsegna L, Yildirim A, Zhan Y, Alber F. Integrative approaches in genome structure analysis. Structure 2022; 30:24-36. [PMID: 34963059 PMCID: PMC8959402 DOI: 10.1016/j.str.2021.12.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 11/13/2021] [Accepted: 12/01/2021] [Indexed: 12/17/2022]
Abstract
New technological advances in integrated imaging, sequencing-based assays, and computational analysis have revolutionized our view of genomes in terms of their structure and dynamics in space and time. These advances promise a deeper understanding of genome functions and mechanistic insights into how the nucleus is spatially organized and functions. These wide arrays of complementary data provide an opportunity to produce quantitative integrative models of nuclear organization. In this article, we highlight recent key developments and discuss the outlook for these fields.
Collapse
Affiliation(s)
- Lorenzo Boninsegna
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Asli Yildirim
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Yuxiang Zhan
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Frank Alber
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA.
| |
Collapse
|
12
|
Di Stefano M, Nützmann HW. Modeling the 3D genome of plants. Nucleus 2021; 12:65-81. [PMID: 34057011 PMCID: PMC8168717 DOI: 10.1080/19491034.2021.1927503] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 04/29/2021] [Accepted: 05/05/2021] [Indexed: 12/11/2022] Open
Abstract
Chromosomes are the carriers of inheritable traits and define cell function and development. This is not only based on the linear DNA sequence of chromosomes but also on the additional molecular information they are associated with, including the transcription machinery, histone modifications, and their three-dimensional folding. The synergistic application of experimental approaches and computer simulations has helped to unveil how these organizational layers of the genome interplay in various organisms. However, such multidisciplinary approaches are still rarely explored in the plant kingdom. Here, we provide an overview of our current knowledge on plant 3D genome organization and review recent efforts to integrate cutting-edge experiments from microscopy and next-generation sequencing approaches with theoretical models. Building on these recent approaches, we propose possible avenues to extend the application of theoretical modeling in the characterization of the 3D genome organization in plants.
Collapse
Affiliation(s)
- Marco Di Stefano
- Institute of Human Genetics, Centre National de la Recherche Scientifique, University of Montpellier, Montpellier, France
| | - Hans-Wilhelm Nützmann
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
| |
Collapse
|
13
|
Li R, Li L, Xu Y, Yang J. Machine learning meets omics: applications and perspectives. Brief Bioinform 2021; 23:6425809. [PMID: 34791021 DOI: 10.1093/bib/bbab460] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 09/29/2021] [Accepted: 10/07/2021] [Indexed: 02/07/2023] Open
Abstract
The innovation of biotechnologies has allowed the accumulation of omics data at an alarming rate, thus introducing the era of 'big data'. Extracting inherent valuable knowledge from various omics data remains a daunting problem in bioinformatics. Better solutions often need some kind of more innovative methods for efficient handlings and effective results. Recent advancements in integrated analysis and computational modeling of multi-omics data helped address such needs in an increasingly harmonious manner. The development and application of machine learning have largely advanced our insights into biology and biomedicine and greatly promoted the development of therapeutic strategies, especially for precision medicine. Here, we propose a comprehensive survey and discussion on what happened, is happening and will happen when machine learning meets omics. Specifically, we describe how artificial intelligence can be applied to omics studies and review recent advancements at the interface between machine learning and the ever-widest range of omics including genomics, transcriptomics, proteomics, metabolomics, radiomics, as well as those at the single-cell resolution. We also discuss and provide a synthesis of ideas, new insights, current challenges and perspectives of machine learning in omics.
Collapse
Affiliation(s)
- Rufeng Li
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China
| | - Lixin Li
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China
| | - Yungang Xu
- School of Electronics and Information, Northwestern Polytechnical University, Xi'an, 710129, China
| | - Juan Yang
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China.,Key Laboratory of Environment and Genes Related to Diseases (Xi'an Jiaotong University), Ministry of Education of China, Xi'an 710061, P. R. China
| |
Collapse
|
14
|
Collins B, Oluwadare O, Brown P. ChromeBat: A Bio-Inspired Approach to 3D Genome Reconstruction. Genes (Basel) 2021; 12:1757. [PMID: 34828363 PMCID: PMC8617892 DOI: 10.3390/genes12111757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 10/28/2021] [Accepted: 11/01/2021] [Indexed: 11/20/2022] Open
Abstract
With the advent of Next Generation Sequencing and the Hi-C experiment, high quality genome-wide contact data are becoming increasingly available. These data represents an empirical measure of how a genome interacts inside the nucleus. Genome conformation is of particular interest as it has been experimentally shown to be a driving force for many genomic functions from regulation to transcription. Thus, the Three Dimensional-Genome Reconstruction Problem (3D-GRP) seeks to take Hi-C data and produces a complete physical genome structure as it appears in the nucleus for genomic analysis. We propose and develop a novel method to solve the Chromosome and Genome Reconstruction problem based on the Bat Algorithm (BA) which we called ChromeBat. We demonstrate on real Hi-C data that ChromeBat is capable of state-of-the-art performance. Additionally, the domain of Genome Reconstruction has been criticized for lacking algorithmic diversity, and the bio-inspired nature of ChromeBat contributes algorithmic diversity to the problem domain. ChromeBat is an effective approach for solving the Genome Reconstruction Problem.
Collapse
Affiliation(s)
| | - Oluwatosin Oluwadare
- Department of Computer Science, University of Colorado, Colorado Springs, CO 80918, USA; (B.C.); (P.B.)
| | | |
Collapse
|
15
|
Liang J, Perez-Rathke A. Minimalistic 3D chromatin models: Sparse interactions in single cells drive the chromatin fold and form many-body units. Curr Opin Struct Biol 2021; 71:200-214. [PMID: 34399301 DOI: 10.1016/j.sbi.2021.06.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/27/2021] [Accepted: 06/29/2021] [Indexed: 11/26/2022]
Abstract
Computational three-dimensional chromatin modeling has helped uncover principles of genome organization. Here, we discuss methods for modeling three-dimensional chromatin structures, with focus on a minimalistic polymer model which inverts population Hi-C into single-cell conformations. Utilizing only basic physical properties, this model reveals that a few specific Hi-C interactions can fold chromatin into conformations consistent with single-cell imaging, Dip-C, and FISH measurements. Aggregated single-cell chromatin conformations also reproduce Hi-C frequencies. This approach allows quantification of structural heterogeneity and discovery of many-body interaction units and has revealed additional insights, including (1) topologically associating domains as a byproduct of folding driven by specific interactions, (2) cell subpopulations with different structural scaffolds are developmental stage dependent, and (3) the functional landscape of many-body units within enhancer-rich regions. We also discuss these findings in relation to the genome structure-function relationship.
Collapse
Affiliation(s)
- Jie Liang
- Center for Bioinformatics and Quantitative Biology & Richard and Loan Hill Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, 60612, USA.
| | - Alan Perez-Rathke
- Center for Bioinformatics and Quantitative Biology & Richard and Loan Hill Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, 60612, USA
| |
Collapse
|
16
|
Gong H, Yang Y, Zhang S, Li M, Zhang X. Application of Hi-C and other omics data analysis in human cancer and cell differentiation research. Comput Struct Biotechnol J 2021; 19:2070-2083. [PMID: 33995903 PMCID: PMC8086027 DOI: 10.1016/j.csbj.2021.04.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 04/04/2021] [Accepted: 04/04/2021] [Indexed: 02/07/2023] Open
Abstract
With the development of 3C (chromosome conformation capture) and its derivative technology Hi-C (High-throughput chromosome conformation capture) research, the study of the spatial structure of the genomic sequence in the nucleus helps researchers understand the functions of biological processes such as gene transcription, replication, repair, and regulation. In this paper, we first introduce the research background and purpose of Hi-C data visualization analysis. After that, we discuss the Hi-C data analysis methods from genome 3D structure, A/B compartment, TADs (topologically associated domain), and loop detection. We also discuss how to apply genome visualization technologies to the identification of chromosome feature structures. We continue with a review of correlation analysis differences among multi-omics data, and how to apply Hi-C and other omics data analysis into cancer and cell differentiation research. Finally, we summarize the various problems in joint analyses based on Hi-C and other multi-omics data. We believe this review can help researchers better understand the progress and applications of 3D genome technology.
Collapse
Affiliation(s)
- Haiyan Gong
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, China
- Shunde Graduate School of University of Science and Technology Beijing, Foshan 528000, China
| | - Yi Yang
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
| | - Sichen Zhang
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
| | - Minghong Li
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
| | - Xiaotong Zhang
- Department of Computer Science and Technology, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China
- Beijing Key Laboratory of Knowledge Engineering for Materials Science, Beijing 100083, China
- Shunde Graduate School of University of Science and Technology Beijing, Foshan 528000, China
| |
Collapse
|
17
|
Belokopytova P, Fishman V. Predicting Genome Architecture: Challenges and Solutions. Front Genet 2021; 11:617202. [PMID: 33552135 PMCID: PMC7862721 DOI: 10.3389/fgene.2020.617202] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Accepted: 12/15/2020] [Indexed: 12/22/2022] Open
Abstract
Genome architecture plays a pivotal role in gene regulation. The use of high-throughput methods for chromatin profiling and 3-D interaction mapping provide rich experimental data sets describing genome organization and dynamics. These data challenge development of new models and algorithms connecting genome architecture with epigenetic marks. In this review, we describe how chromatin architecture could be reconstructed from epigenetic data using biophysical or statistical approaches. We discuss the applicability and limitations of these methods for understanding the mechanisms of chromatin organization. We also highlight the emergence of new predictive approaches for scoring effects of structural variations in human cells.
Collapse
Affiliation(s)
- Polina Belokopytova
- Natural Sciences Department, Novosibirsk State University, Novosibirsk, Russia
- Institute of Cytology and Genetics Siberian Branch of Russian Academy of Sciences (SB RAS), Novosibirsk, Russia
| | - Veniamin Fishman
- Natural Sciences Department, Novosibirsk State University, Novosibirsk, Russia
- Institute of Cytology and Genetics Siberian Branch of Russian Academy of Sciences (SB RAS), Novosibirsk, Russia
| |
Collapse
|
18
|
Di Stefano M, Paulsen J, Jost D, Marti-Renom MA. 4D nucleome modeling. Curr Opin Genet Dev 2020; 67:25-32. [PMID: 33253996 PMCID: PMC8098745 DOI: 10.1016/j.gde.2020.10.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 10/18/2020] [Accepted: 10/24/2020] [Indexed: 02/01/2023]
Abstract
The intrinsic dynamic nature of chromosomes is emerging as a fundamental component in regulating DNA transcription, replication, and damage-repair among other nuclear functions. With this increased awareness, reinforced over the last ten years, many new experimental techniques, mainly based on microscopy and chromosome conformation capture, have been introduced to study the genome in space and time. Owing to the increasing complexity of these cutting-edge techniques, computational approaches have become of paramount importance to interpret, contextualize, and complement such experiments with new insights. Hence, it is becoming crucial for experimental biologists to have a clear understanding of the diverse theoretical modeling approaches available and the biological information each of them can provide.
Collapse
Affiliation(s)
- Marco Di Stefano
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain.
| | - Jonas Paulsen
- EVOGENE, Department of Biosciences, Faculty of Natural Sciences, University of Oslo, 0316 Oslo, Norway
| | - Daniel Jost
- Université de Lyon, ENS de Lyon, Univ Claude Bernard, CNRS, Laboratoire de Biologie et Modélisation de la Cellule, Lyon, France
| | - Marc A Marti-Renom
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain; Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain; ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain.
| |
Collapse
|