1
|
Drogaris C, Zhang Y, Zhang E, Nazarova E, Sarrazin-Gendron R, Wilhelm-Landry S, Cyr Y, Majewski J, Blanchette M, Waldispühl J. ARGV: 3D genome structure exploration using augmented reality. BMC Bioinformatics 2024; 25:277. [PMID: 39192184 DOI: 10.1186/s12859-024-05882-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Accepted: 07/25/2024] [Indexed: 08/29/2024] Open
Abstract
Over the past two decades, scientists have increasingly realized the importance of the three-dimensional (3D) genome organization in regulating cellular activity. Hi-C and related experiments yield 2D contact matrices that can be used to infer 3D models of chromosome structure. Visualizing and analyzing genomes in 3D space remains challenging. Here, we present ARGV, an augmented reality 3D Genome Viewer. ARGV contains more than 350 pre-computed and annotated genome structures inferred from Hi-C and imaging data. It offers interactive and collaborative visualization of genomes in 3D space, using standard mobile phones or tablets. A user study comparing ARGV to existing tools demonstrates its benefits.
Collapse
Affiliation(s)
| | - Yanlin Zhang
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | - Eric Zhang
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | - Elena Nazarova
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | | | | | - Yan Cyr
- Beam Me Up Inc., 5925 Monkland Ave, Suite, 100, Montréal, H4A 1G7, Canada
| | - Jacek Majewski
- Department of Human Genetics, McGill University, Montréal, QC, H3A 1B1, Canada
| | - Mathieu Blanchette
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, Montréal, QC, H3A 0E9, Canada.
| |
Collapse
|
2
|
Poinsignon T, Gallopin M, Grognet P, Malagnac F, Lelandais G, Poulain P. 3D models of fungal chromosomes to enhance visual integration of omics data. NAR Genom Bioinform 2023; 5:lqad104. [PMID: 38058589 PMCID: PMC10696920 DOI: 10.1093/nargab/lqad104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 10/11/2023] [Accepted: 11/17/2023] [Indexed: 12/08/2023] Open
Abstract
The functions of eukaryotic chromosomes and their spatial architecture in the nucleus are reciprocally dependent. Hi-C experiments are routinely used to study chromosome 3D organization by probing chromatin interactions. Standard representation of the data has relied on contact maps that show the frequency of interactions between parts of the genome. In parallel, it has become easier to build 3D models of the entire genome based on the same Hi-C data, and thus benefit from the methodology and visualization tools developed for structural biology. 3D modeling of entire genomes leverages the understanding of their spatial organization. However, this opportunity for original and insightful modeling is underexploited. In this paper, we show how seeing the spatial organization of chromosomes can bring new perspectives to omics data integration. We assembled state-of-the-art tools into a workflow that goes from Hi-C raw data to fully annotated 3D models and we re-analysed public omics datasets available for three fungal species. Besides the well-described properties of the spatial organization of their chromosomes (Rabl conformation, hypercoiling and chromosome territories), our results highlighted (i) in Saccharomyces cerevisiae, the backbones of the cohesin anchor regions, which were aligned all along the chromosomes, (ii) in Schizosaccharomyces pombe, the oscillations of the coiling of chromosome arms throughout the cell cycle and (iii) in Neurospora crassa, the massive relocalization of histone marks in mutants of heterochromatin regulators. 3D modeling of the chromosomes brings new opportunities for visual integration of omics data. This holistic perspective supports intuition and lays the foundation for building new concepts.
Collapse
Affiliation(s)
- Thibault Poinsignon
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Mélina Gallopin
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Pierre Grognet
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Fabienne Malagnac
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Gaëlle Lelandais
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, 91198 Gif-sur-Yvette, France
| | - Pierre Poulain
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| |
Collapse
|
3
|
Halladjian S, Kouril D, Miao H, Groller ME, Viola I, Isenberg T. Multiscale Unfolding: Illustratively Visualizing the Whole Genome at a Glance. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2022; 28:3456-3470. [PMID: 33705319 DOI: 10.1109/tvcg.2021.3065443] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
We present Multiscale Unfolding, an interactive technique for illustratively visualizing multiple hierarchical scales of DNA in a single view, showing the genome at different scales and demonstrating how one scale spatially folds into the next. The DNA's extremely long sequential structure-arranged differently on several distinct scale levels-is often lost in traditional 3D depictions, mainly due to its multiple levels of dense spatial packing and the resulting occlusion. Furthermore, interactive exploration of this complex structure is cumbersome, requiring visibility management like cut-aways. In contrast to existing temporally controlled multiscale data exploration, we allow viewers to always see and interact with any of the involved scales. For this purpose we separate the depiction into constant-scale and scale transition zones. Constant-scale zones maintain a single-scale representation, while still linearly unfolding the DNA. Inspired by illustration, scale transition zones connect adjacent constant-scale zones via level unfolding, scaling, and transparency. We thus represent the spatial structure of the whole DNA macro-molecule, maintain its local organizational characteristics, linearize its higher-level organization, and use spatially controlled, understandable interpolation between neighboring scales. We also contribute interaction techniques that provide viewers with a coarse-to-fine control for navigating within our all-scales-in-one-view representations and visual aids to illustrate the size differences. Overall, Multiscale Unfolding allows viewers to grasp the DNA's structural composition from chromosomes to the atoms, with increasing levels of "unfoldedness," and can be applied in data-driven illustration and communication.
Collapse
|
4
|
Sehnal D, Bittrich S, Deshpande M, Svobodová R, Berka K, Bazgier V, Velankar S, Burley SK, Koča J, Rose AS. Mol* Viewer: modern web app for 3D visualization and analysis of large biomolecular structures. Nucleic Acids Res 2021; 49:W431-W437. [PMID: 33956157 PMCID: PMC8262734 DOI: 10.1093/nar/gkab314] [Citation(s) in RCA: 483] [Impact Index Per Article: 161.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 04/12/2021] [Accepted: 04/26/2021] [Indexed: 12/31/2022] Open
Abstract
Large biomolecular structures are being determined experimentally on a daily basis using established techniques such as crystallography and electron microscopy. In addition, emerging integrative or hybrid methods (I/HM) are producing structural models of huge macromolecular machines and assemblies, sometimes containing 100s of millions of non-hydrogen atoms. The performance requirements for visualization and analysis tools delivering these data are increasing rapidly. Significant progress in developing online, web-native three-dimensional (3D) visualization tools was previously accomplished with the introduction of the LiteMol suite and NGL Viewers. Thereafter, Mol* development was jointly initiated by PDBe and RCSB PDB to combine and build on the strengths of LiteMol (developed by PDBe) and NGL (developed by RCSB PDB). The web-native Mol* Viewer enables 3D visualization and streaming of macromolecular coordinate and experimental data, together with capabilities for displaying structure quality, functional, or biological context annotations. High-performance graphics and data management allows users to simultaneously visualise up to hundreds of (superimposed) protein structures, stream molecular dynamics simulation trajectories, render cell-level models, or display huge I/HM structures. It is the primary 3D structure viewer used by PDBe and RCSB PDB. It can be easily integrated into third-party services. Mol* Viewer is open source and freely available at https://molstar.org/.
Collapse
Affiliation(s)
- David Sehnal
- CEITEC - Central European Institute of Technology, Masaryk University, Brno 625 00, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno 602 00, Czech Republic.,Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Sebastian Bittrich
- Research Collaboratory for Structural Bioinformatics (RCSB), San Diego Supercomputer Center, University of California San Diego, San Diego, CA 92093-0743, USA
| | - Mandar Deshpande
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Radka Svobodová
- CEITEC - Central European Institute of Technology, Masaryk University, Brno 625 00, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno 602 00, Czech Republic
| | - Karel Berka
- Department of Physical Chemistry, Faculty of Science, Palacký University Olomouc, Olomouc 771 46, Czech Republic
| | - Václav Bazgier
- Department of Physical Chemistry, Faculty of Science, Palacký University Olomouc, Olomouc 771 46, Czech Republic
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854-8076, USA.,Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08903-2681, USA.,Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), San Diego Supercomputer Center and Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA 92093-0654, USA
| | - Jaroslav Koča
- CEITEC - Central European Institute of Technology, Masaryk University, Brno 625 00, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno 602 00, Czech Republic
| | - Alexander S Rose
- Research Collaboratory for Structural Bioinformatics (RCSB), San Diego Supercomputer Center, University of California San Diego, San Diego, CA 92093-0743, USA
| |
Collapse
|
5
|
O'Donoghue SI. Grand Challenges in Bioinformatics Data Visualization. FRONTIERS IN BIOINFORMATICS 2021; 1:669186. [PMID: 36303723 PMCID: PMC9581027 DOI: 10.3389/fbinf.2021.669186] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 04/30/2021] [Indexed: 01/17/2023] Open
Affiliation(s)
- Seán I. O'Donoghue
- Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Kensington, NSW, Australia
- CSIRO Data61, Eveleigh, NSW, Australia
| |
Collapse
|
6
|
Todd S, Todd P, McGowan SJ, Hughes JR, Kakui Y, Leymarie FF, Latham W, Taylor S. CSynth: an interactive modelling and visualization tool for 3D chromatin structure. Bioinformatics 2021; 37:951-955. [PMID: 32866221 PMCID: PMC8128456 DOI: 10.1093/bioinformatics/btaa757] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Revised: 08/18/2020] [Accepted: 08/24/2020] [Indexed: 01/22/2023] Open
Abstract
MOTIVATION The 3D structure of chromatin in the nucleus is important for gene expression and regulation. Chromosome conformation capture techniques, such as Hi-C, generate large amounts of data showing interaction points on the genome but these are hard to interpret using standard tools. RESULTS We have developed CSynth, an interactive 3D genome browser and real-time chromatin restraint-based modeller to visualize models of any chromosome conformation capture (3C) data. Unlike other modelling systems, CSynth allows dynamic interaction with the modelling parameters to allow experimentation and effects on the model. It also allows comparison of models generated from data in different tissues/cell states and the results of third-party 3D modelling outputs. In addition, we include an option to view and manipulate these complicated structures using Virtual Reality (VR) so scientists can immerse themselves in the models for further understanding. This VR component has also proven to be a valuable teaching and a public engagement tool. AVAILABILITYAND IMPLEMENTATION CSynth is web based and available to use at csynth.org. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Stephen Todd
- Department of Computing, Goldsmiths, University of London, London, UK
- London Geometry, Ltd., London, UK
| | | | - Simon J McGowan
- Analysis, Visualization and Informatics, MRC Weatherall Institute of Molecular Medicine, Oxford, UK
| | - James R Hughes
- Genome Biology Group, MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, Oxford, UK
| | - Yasutaka Kakui
- The Francis Crick Institute, Chromosome Segregation Laboratory, London, UK
| | - Frederic Fol Leymarie
- Department of Computing, Goldsmiths, University of London, London, UK
- London Geometry, Ltd., London, UK
| | - William Latham
- Department of Computing, Goldsmiths, University of London, London, UK
- London Geometry, Ltd., London, UK
| | - Stephen Taylor
- Analysis, Visualization and Informatics, MRC Weatherall Institute of Molecular Medicine, Oxford, UK
| |
Collapse
|
7
|
Roy U. Insight into the structures of Interleukin-18 systems. Comput Biol Chem 2020; 88:107353. [PMID: 32769049 PMCID: PMC7392904 DOI: 10.1016/j.compbiolchem.2020.107353] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 07/01/2020] [Accepted: 07/28/2020] [Indexed: 02/08/2023]
Abstract
Structure-based molecular designs play a critical role in the context of next generation drug development. Besides their fundamental scientific aspects, the findings established in this approach have significant implications in the expansions of target-based therapies and vaccines. Interleukin-18 (IL-18), also known as interferon gamma (IFN-γ) inducing factor, is a pro-inflammatory cytokine. The IL-18 binds first to the IL-18α receptor and forms a lower affinity complex. Upon binding with IL-18β a hetero-trimeric complex with higher affinity is formed that initiates the signal transduction process. The present study, including structural and molecular dynamics simulations, takes a close look at the structural stabilities of IL-18 and IL-18 receptor-bound ligand structures as functions of time. The results help to identify the conformational changes of the ligand due to receptor binding, as well as the structural orders of the apo and holo IL-18 protein complexes.
Collapse
Affiliation(s)
- Urmi Roy
- Department of Chemistry & Biomolecular Science, Clarkson University, 8 Clarkson Avenue, Potsdam, NY 13699-5820, United States.
| |
Collapse
|
8
|
|
9
|
Halladjian S, Miao H, Kouril D, Groller ME, Viola I, Isenberg T. Scale Trotter: Illustrative Visual Travels Across Negative Scales. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2020; 26:654-664. [PMID: 31425102 DOI: 10.1109/tvcg.2019.2934334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
We present ScaleTrotter, a conceptual framework for an interactive, multi-scale visualization of biological mesoscale data and, specifically, genome data. ScaleTrotter allows viewers to smoothly transition from the nucleus of a cell to the atomistic composition of the DNA, while bridging several orders of magnitude in scale. The challenges in creating an interactive visualization of genome data are fundamentally different in several ways from those in other domains like astronomy that require a multi-scale representation as well. First, genome data has intertwined scale levels-the DNA is an extremely long, connected molecule that manifests itself at all scale levels. Second, elements of the DNA do not disappear as one zooms out-instead the scale levels at which they are observed group these elements differently. Third, we have detailed information and thus geometry for the entire dataset and for all scale levels, posing a challenge for interactive visual exploration. Finally, the conceptual scale levels for genome data are close in scale space, requiring us to find ways to visually embed a smaller scale into a coarser one. We address these challenges by creating a new multi-scale visualization concept. We use a scale-dependent camera model that controls the visual embedding of the scales into their respective parents, the rendering of a subset of the scale hierarchy, and the location, size, and scope of the view. In traversing the scales, ScaleTrotter is roaming between 2D and 3D visual representations that are depicted in integrated visuals. We discuss, specifically, how this form of multi-scale visualization follows from the specific characteristics of the genome data and describe its implementation. Finally, we discuss the implications of our work to the general illustrative depiction of multi-scale data.
Collapse
|
10
|
Oluwadare O, Highsmith M, Cheng J. An Overview of Methods for Reconstructing 3-D Chromosome and Genome Structures from Hi-C Data. Biol Proced Online 2019; 21:7. [PMID: 31049033 PMCID: PMC6482566 DOI: 10.1186/s12575-019-0094-0] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 04/01/2019] [Indexed: 01/08/2023] Open
Abstract
Over the past decade, methods for predicting three-dimensional (3-D) chromosome and genome structures have proliferated. This has been primarily due to the development of high-throughput, next-generation chromosome conformation capture (3C) technologies, which have provided next-generation sequencing data about chromosome conformations in order to map the 3-D genome structure. The introduction of the Hi-C technique-a variant of the 3C method-has allowed researchers to extract the interaction frequency (IF) for all loci of a genome at high-throughput and at a genome-wide scale. In this review we describe, categorize, and compare the various methods developed to map chromosome and genome structures from 3C data-particularly Hi-C data. We summarize the improvements introduced by these methods, describe the approach used for method evaluation, and discuss how these advancements shape the future of genome structure construction.
Collapse
Affiliation(s)
- Oluwatosin Oluwadare
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Max Highsmith
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211 USA
- Informatics Institute, University of Missouri, Columbia, MO 65211 USA
| |
Collapse
|
11
|
Lin D, Bonora G, Yardımcı GG, Noble WS. Computational methods for analyzing and modeling genome structure and organization. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2019; 11:e1435. [PMID: 30022617 PMCID: PMC6294685 DOI: 10.1002/wsbm.1435] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2017] [Revised: 06/07/2018] [Accepted: 06/16/2018] [Indexed: 12/31/2022]
Abstract
Recent advances in chromosome conformation capture technologies have led to the discovery of previously unappreciated structural features of chromatin. Computational analysis has been critical in detecting these features and thereby helping to uncover the building blocks of genome architecture. Algorithms are being developed to integrate these architectural features to construct better three-dimensional (3D) models of the genome. These computational methods have revealed the importance of 3D genome organization to essential biological processes. In this article, we review the state of the art in analytic and modeling techniques with a focus on their application to answering various biological questions related to chromatin structure. We summarize the limitations of these computational techniques and suggest future directions, including the importance of incorporating multiple sources of experimental data in building a more comprehensive model of the genome. This article is categorized under: Analytical and Computational Methods > Computational Methods Laboratory Methods and Technologies > Genetic/Genomic Methods Models of Systems Properties and Processes > Mechanistic Models.
Collapse
Affiliation(s)
- Dejun Lin
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Giancarlo Bonora
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | - William S. Noble
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| |
Collapse
|
12
|
Goodstadt MN, Marti-Renom MA. Communicating Genome Architecture: Biovisualization of the Genome, from Data Analysis and Hypothesis Generation to Communication and Learning. J Mol Biol 2018; 431:1071-1087. [PMID: 30419242 DOI: 10.1016/j.jmb.2018.11.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 10/29/2018] [Accepted: 11/01/2018] [Indexed: 01/07/2023]
Abstract
Genome discoveries at the core of biology are made by visual description and exploration of the cell, from microscopic sketches and biochemical mapping to computational analysis and spatial modeling. We outline the experimental and visualization techniques that have been developed recently which capture the three-dimensional interactions regulating how genes are expressed. We detail the challenges faced in integration of the data to portray the components and organization and their dynamic landscape. The goal is more than a single data-driven representation as interactive visualization for de novo research is paramount to decipher insights on genome organization in space.
Collapse
Affiliation(s)
- Mike N Goodstadt
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona 08028, Spain; Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain.
| | - Marc A Marti-Renom
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona 08028, Spain; Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23, Barcelona 08010, Spain.
| |
Collapse
|
13
|
Waldispühl J, Zhang E, Butyaev A, Nazarova E, Cyr Y. Storage, visualization, and navigation of 3D genomics data. Methods 2018; 142:74-80. [PMID: 29792917 DOI: 10.1016/j.ymeth.2018.05.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Revised: 05/07/2018] [Accepted: 05/09/2018] [Indexed: 01/27/2023] Open
Abstract
The field of 3D genomics grew at increasing rates in the last decade. The volume and complexity of 2D and 3D data produced is progressively outpacing the capacities of the technology previously used for distributing genome sequences. The emergence of new technologies provides also novel opportunities for the development of innovative approaches. In this paper, we review the state-of-the-art computing technology, as well as the solutions adopted by the platforms currently available.
Collapse
Affiliation(s)
| | - Eric Zhang
- School of Computer Science, McGill University, Montréal, Canada
| | | | - Elena Nazarova
- School of Computer Science, McGill University, Montréal, Canada
| | - Yan Cyr
- Beam Me Up Labs, Montréal, Canada
| |
Collapse
|
14
|
Abstract
It is well known that the chromosomes are organized in the nucleus and this spatial arrangement of genome play a crucial role in gene regulation and genome stability. Different techniques have been developed and applied to uncover the intrinsic mechanism of genome architecture, especially the chromosome conformation capture (3C) and 3C-derived methods. 3C and 3C-derived techniques provide us approaches to perform high-throughput chromatin architecture assays at the genome scale. However, the advantage and disadvantage of current methodologies of C-technologies have not been discussed extensively. In this review, we described and compared the methodologies of C-technologies used in genome organization studies with an emphasis on Hi-C method. We also discussed the crucial challenges facing current genome architecture studies based on 3C and 3C-derived technologies and the direction of future technologies to address currently outstanding questions in the field. These latest news contribute to our current understanding of genome structure, and provide a comprehensive reference for researchers to choose the appropriate method in future application. We consider that these constantly improving technologies will offer a finer and more accurate contact profiles of entire genome and ultimately reveal specific molecular machines govern its shape and function.
Collapse
|
15
|
Miao H, De Llano E, Sorger J, Ahmadi Y, Kekic T, Isenberg T, Groller ME, Barisic I, Viola I. Multiscale Visualization and Scale-Adaptive Modification of DNA Nanostructures. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2018; 24:1014-1024. [PMID: 28866510 DOI: 10.1109/tvcg.2017.2743981] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
We present an approach to represent DNA nanostructures in varying forms of semantic abstraction, describe ways to smoothly transition between them, and thus create a continuous multiscale visualization and interaction space for applications in DNA nanotechnology. This new way of observing, interacting with, and creating DNA nanostructures enables domain experts to approach their work in any of the semantic abstraction levels, supporting both low-level manipulations and high-level visualization and modifications. Our approach allows them to deal with the increasingly complex DNA objects that they are designing, to improve their features, and to add novel functions in a way that no existing single-scale approach offers today. For this purpose we collaborated with DNA nanotechnology experts to design a set of ten semantic scales. These scales take the DNA's chemical and structural behavior into account and depict it from atoms to the targeted architecture with increasing levels of abstraction. To create coherence between the discrete scales, we seamlessly transition between them in a well-defined manner. We use special encodings to allow experts to estimate the nanoscale object's stability. We also add scale-adaptive interactions that facilitate the intuitive modification of complex structures at multiple scales. We demonstrate the applicability of our approach on an experimental use case. Moreover, feedback from our collaborating domain experts confirmed an increased time efficiency and certainty for analysis and modification tasks on complex DNA structures. Our method thus offers exciting new opportunities with promising applications in medicine and biotechnology.
Collapse
|
16
|
Goodstadt M, Marti‐Renom MA. Challenges for visualizing three-dimensional data in genomic browsers. FEBS Lett 2017; 591:2505-2519. [PMID: 28771695 PMCID: PMC5638070 DOI: 10.1002/1873-3468.12778] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Revised: 07/30/2017] [Accepted: 07/31/2017] [Indexed: 12/14/2022]
Abstract
Genomic interactions reveal the spatial organization of genomes and genomic domains, which is known to play key roles in cell function. Physical proximity can be represented as two-dimensional heat maps or matrices. From these, three-dimensional (3D) conformations of chromatin can be computed revealing coherent structures that highlight the importance of nonsequential relationships across genomic features. Mainstream genomic browsers have been classically developed to display compact, stacked tracks based on a linear, sequential, per-chromosome coordinate system. Genome-wide comparative analysis demands new approaches to data access and new layouts for analysis. The legibility can be compromised when displaying track-aligned second dimension matrices, which require greater screen space. Moreover, 3D representations of genomes defy vertical alignment in track-based genome browsers. Furthermore, investigation at previously unattainable levels of detail is revealing multiscale, multistate, time-dependent complexity. This article outlines how these challenges are currently handled in mainstream browsers as well as how novel techniques in visualization are being explored to address them. A set of requirements for coherent visualization of novel spatial genomic data is defined and the resulting potential for whole genome visualization is described.
Collapse
Affiliation(s)
- Mike Goodstadt
- Structural Genomics GroupCNAG‐CRGThe Barcelona Institute of Science and Technology (BIST)Spain
- Gene Regulation, Stem Cells and Cancer ProgramCentre for Genomic Regulation (CRG)The Barcelona Institute of Science and Technology (BIST)Spain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
| | - Marc A. Marti‐Renom
- Structural Genomics GroupCNAG‐CRGThe Barcelona Institute of Science and Technology (BIST)Spain
- Gene Regulation, Stem Cells and Cancer ProgramCentre for Genomic Regulation (CRG)The Barcelona Institute of Science and Technology (BIST)Spain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- Institució Catalana de Recerca i Estudis Avançats (ICREA)BarcelonaSpain
| |
Collapse
|
17
|
Han Z, Wei G. Computational tools for Hi-C data analysis. QUANTITATIVE BIOLOGY 2017. [DOI: 10.1007/s40484-017-0113-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
18
|
Djekidel MN, Wang M, Zhang MQ, Gao J. HiC-3DViewer: a new tool to visualize Hi-C data in 3D space. QUANTITATIVE BIOLOGY 2016. [DOI: 10.1007/s40484-017-0091-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
19
|
Gao J, Yang X, Djekidel MN, Wang Y, Xi P, Zhang MQ. Developing bioimaging and quantitative methods to study 3D genome. QUANTITATIVE BIOLOGY 2016. [DOI: 10.1007/s40484-016-0065-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
20
|
Nowotny J, Wells A, Oluwadare O, Xu L, Cao R, Trieu T, He C, Cheng J. GMOL: An Interactive Tool for 3D Genome Structure Visualization. Sci Rep 2016; 6:20802. [PMID: 26868282 PMCID: PMC4751627 DOI: 10.1038/srep20802] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 01/12/2016] [Indexed: 02/06/2023] Open
Abstract
It has been shown that genome spatial structures largely affect both genome activity and DNA function. Knowing this, many researchers are currently attempting to accurately model genome structures. Despite these increased efforts there still exists a shortage of tools dedicated to visualizing the genome. Creating a tool that can accurately visualize the genome can aid researchers by highlighting structural relationships that may not be obvious when examining the sequence information alone. Here we present a desktop application, known as GMOL, designed to effectively visualize genome structures so that researchers may better analyze genomic data. GMOL was developed based upon our multi-scale approach that allows a user to scale between six separate levels within the genome. With GMOL, a user can choose any unit at any scale and scale it up or down to visualize its structure and retrieve corresponding genome sequences. Users can also interactively manipulate and measure the whole genome structure and extract static images and machine-readable data files in PDB format from the multi-scale structure. By using GMOL researchers will be able to better understand and analyze genome structure models and the impact their structural relations have on genome activity and DNA function.
Collapse
Affiliation(s)
- Jackson Nowotny
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA
| | - Avery Wells
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA
| | | | - Lingfei Xu
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA
| | - Renzhi Cao
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA
| | - Tuan Trieu
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA
| | - Chenfeng He
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA
| | - Jianlin Cheng
- Computer Science Department, University of Missouri, Columbia, MO 65211, USA.,Informatics Institute, University of Missouri, Columbia, MO 65211, USA.,C.S. Bond Life Science Center, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
21
|
Capurso D, Bengtsson H, Segal MR. Discovering hotspots in functional genomic data superposed on 3D chromatin configuration reconstructions. Nucleic Acids Res 2016; 44:2028-35. [PMID: 26869583 PMCID: PMC4797302 DOI: 10.1093/nar/gkw070] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2015] [Accepted: 01/25/2016] [Indexed: 11/14/2022] Open
Abstract
The spatial organization of the genome influences cellular function, notably gene regulation. Recent studies have assessed the three-dimensional (3D) co-localization of functional annotations (e.g. centromeres, long terminal repeats) using 3D genome reconstructions from Hi-C (genome-wide chromosome conformation capture) data; however, corresponding assessments for continuous functional genomic data (e.g. chromatin immunoprecipitation-sequencing (ChIP-seq) peak height) are lacking. Here, we demonstrate that applying bump hunting via the patient rule induction method (PRIM) to ChIP-seq data superposed on a Saccharomyces cerevisiae 3D genome reconstruction can discover ‘functional 3D hotspots’, regions in 3-space for which the mean ChIP-seq peak height is significantly elevated. For the transcription factor Swi6, the top hotspot by P-value contains MSB2 and ERG11 – known Swi6 target genes on different chromosomes. We verify this finding in a number of ways. First, this top hotspot is relatively stable under PRIM across parameter settings. Second, this hotspot is among the top hotspots by mean outcome identified by an alternative algorithm, k-Nearest Neighbor (k-NN) regression. Third, the distance between MSB2 and ERG11 is smaller than expected (by resampling) in two other 3D reconstructions generated via different normalization and reconstruction algorithms. This analytic approach can discover functional 3D hotspots and potentially reveal novel regulatory interactions.
Collapse
Affiliation(s)
- Daniel Capurso
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158, USA
| | - Henrik Bengtsson
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94158, USA
| | - Mark R Segal
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
22
|
Ay F, Noble WS. Analysis methods for studying the 3D architecture of the genome. Genome Biol 2015; 16:183. [PMID: 26328929 PMCID: PMC4556012 DOI: 10.1186/s13059-015-0745-7] [Citation(s) in RCA: 96] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 08/10/2015] [Indexed: 11/10/2022] Open
Abstract
The rapidly increasing quantity of genome-wide chromosome conformation capture data presents great opportunities and challenges in the computational modeling and interpretation of the three-dimensional genome. In particular, with recent trends towards higher-resolution high-throughput chromosome conformation capture (Hi-C) data, the diversity and complexity of biological hypotheses that can be tested necessitates rigorous computational and statistical methods as well as scalable pipelines to interpret these datasets. Here we review computational tools to interpret Hi-C data, including pipelines for mapping, filtering, and normalization, and methods for confidence estimation, domain calling, visualization, and three-dimensional modeling.
Collapse
Affiliation(s)
- Ferhat Ay
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA. .,Feinberg School of Medicine, Northwestern University, Chicago, 60661, IL, USA.
| | - William S Noble
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA. .,Department of Computer Science and Engineering, University of Washington, Seattle, 98195, WA, USA.
| |
Collapse
|
23
|
Butyaev A, Mavlyutov R, Blanchette M, Cudré-Mauroux P, Waldispühl J. A low-latency, big database system and browser for storage, querying and visualization of 3D genomic data. Nucleic Acids Res 2015; 43:e103. [PMID: 25990738 PMCID: PMC4652742 DOI: 10.1093/nar/gkv476] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2015] [Accepted: 04/29/2015] [Indexed: 01/19/2023] Open
Abstract
Recent releases of genome three-dimensional (3D) structures have the potential to transform our understanding of genomes. Nonetheless, the storage technology and visualization tools need to evolve to offer to the scientific community fast and convenient access to these data. We introduce simultaneously a database system to store and query 3D genomic data (3DBG), and a 3D genome browser to visualize and explore 3D genome structures (3DGB). We benchmark 3DBG against state-of-the-art systems and demonstrate that it is faster than previous solutions, and importantly gracefully scales with the size of data. We also illustrate the usefulness of our 3D genome Web browser to explore human genome structures. The 3D genome browser is available at http://3dgb.cs.mcgill.ca/.
Collapse
|
24
|
Serra F, Di Stefano M, Spill YG, Cuartero Y, Goodstadt M, Baù D, Marti-Renom MA. Restraint-based three-dimensional modeling of genomes and genomic domains. FEBS Lett 2015; 589:2987-95. [PMID: 25980604 DOI: 10.1016/j.febslet.2015.05.012] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Revised: 05/05/2015] [Accepted: 05/05/2015] [Indexed: 10/23/2022]
Abstract
Chromosomes are large polymer molecules composed of nucleotides. In some species, such as humans, this polymer can sum up to meters long and still be properly folded within the nuclear space of few microns in size. The exact mechanisms of how the meters long DNA is folded into the nucleus, as well as how the regulatory machinery can access it, is to a large extend still a mystery. However, and thanks to newly developed molecular, genomic and computational approaches based on the Chromosome Conformation Capture (3C) technology, we are now obtaining insight on how genomes are spatially organized. Here we review a new family of computational approaches that aim at using 3C-based data to obtain spatial restraints for modeling genomes and genomic domains.
Collapse
Affiliation(s)
- François Serra
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Marco Di Stefano
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Yannick G Spill
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Yasmina Cuartero
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Michael Goodstadt
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Davide Baù
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Marc A Marti-Renom
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain; Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain.
| |
Collapse
|
25
|
Shepherd JJ, Zhou L, Arndt W, Zhang Y, Zheng WJ, Tang J. Exploring genomes with a game engine. Faraday Discuss 2014; 169:443-53. [DOI: 10.1039/c3fd00152k] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
More and more evidence indicates that the 3D conformation of eukaryotic genomes is a critical part of genome function. However, due to the lack of accurate and reliable 3D genome structural data, this information is largely ignored and most of these studies have to use information systems that view the DNA in a linear structure. Visualizing genomes in real time 3D can give researchers more insight, but this is fraught with hardware limitations since each element contains vast amounts of information that cannot be processed on the fly. Using a game engine and sophisticated video game visualization techniques enables us to construct a multi-platform real-time 3D genome viewer. The game engine-based viewer achieves much better rendering speed and can handle much larger amounts of data compared to our previous implementation using OpenGL. Combining this viewer with 3D genome models from experimental data could provide unprecedented opportunities to gain insight into the conformation–function relationships of a genome.
Collapse
Affiliation(s)
| | - Lingxi Zhou
- Department of Computer Science and Engineering
- University of South Carolina
- , USA
| | - William Arndt
- Howard Hughes Medical Institute
- Janelia Farm Research Campus
- Ashburn, USA
| | - Yan Zhang
- Department of Computer Science and Engineering
- University of South Carolina
- , USA
| | - W. Jim Zheng
- School of Biomedical Informatics
- U. Texas Health Science Centre at Houston
- , USA
| | - Jijun Tang
- Department of Computer Science and Engineering
- University of South Carolina
- , USA
- Key Laboratory of Systems Bioengineering of the Ministry of Education
- Tianjin University
| |
Collapse
|
26
|
Mercer TR, Mattick JS. Understanding the regulatory and transcriptional complexity of the genome through structure. Genome Res 2013; 23:1081-8. [PMID: 23817049 PMCID: PMC3698501 DOI: 10.1101/gr.156612.113] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
An expansive functionality and complexity has been ascribed to the majority of the human genome that was unanticipated at the outset of the draft sequence and assembly a decade ago. We are now faced with the challenge of integrating and interpreting this complexity in order to achieve a coherent view of genome biology. We argue that the linear representation of the genome exacerbates this complexity and an understanding of its three-dimensional structure is central to interpreting the regulatory and transcriptional architecture of the genome. Chromatin conformation capture techniques and high-resolution microscopy have afforded an emergent global view of genome structure within the nucleus. Chromosomes fold into complex, territorialized three-dimensional domains in concert with specialized subnuclear bodies that harbor concentrations of transcription and splicing machinery. The signature of these folds is retained within the layered regulatory landscapes annotated by chromatin immunoprecipitation, and we propose that genome contacts are reflected in the organization and expression of interweaved networks of overlapping coding and noncoding transcripts. This pervasive impact of genome structure favors a preeminent role for the nucleoskeleton and RNA in regulating gene expression by organizing these folds and contacts. Accordingly, we propose that the local and global three-dimensional structure of the genome provides a consistent, integrated, and intuitive framework for interpreting and understanding the regulatory and transcriptional complexity of the human genome.
Collapse
Affiliation(s)
- Tim R Mercer
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
| | | |
Collapse
|