1
|
Hartmann T, Middendorf M, Bernt M. Genome Rearrangement Analysis : Cut and Join Genome Rearrangements and Gene Cluster Preserving Approaches. Methods Mol Biol 2024; 2802:215-245. [PMID: 38819562 DOI: 10.1007/978-1-0716-3838-5_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Genome rearrangements are mutations that change the gene content of a genome or the arrangement of the genes on a genome. Several years of research on genome rearrangements have established different algorithmic approaches for solving some fundamental problems in comparative genomics based on gene order information. This review summarizes the literature on genome rearrangement analysis along two lines of research. The first line considers rearrangement models that are particularly well suited for a theoretical analysis. These models use rearrangement operations that cut chromosomes into fragments and then join the fragments into new chromosomes. The second line works with rearrangement models that reflect several biologically motivated constraints, e.g., the constraint that gene clusters have to be preserved. In this chapter, the border between algorithmically "easy" and "hard" rearrangement problems is sketched and a brief review is given on the available software tools for genome rearrangement analysis.
Collapse
Affiliation(s)
- Tom Hartmann
- Swarm Intelligence and Complex Systems Group, Institute of Computer Science, University Leipzig, Leipzig, Germany
| | - Martin Middendorf
- Swarm Intelligence and Complex Systems Group, Institute of Computer Science, University Leipzig, Leipzig, Germany.
| | | |
Collapse
|
2
|
Robert NSM, Sarigol F, Zieger E, Simakov O. SYNPHONI: scale-free and phylogeny-aware reconstruction of synteny conservation and transformation across animal genomes. Bioinformatics 2022; 38:5434-5436. [PMID: 36269177 PMCID: PMC9750109 DOI: 10.1093/bioinformatics/btac695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 09/24/2022] [Accepted: 10/19/2022] [Indexed: 12/25/2022] Open
Abstract
SUMMARY Current approaches detect conserved genomic order either at chromosomal (macrosynteny) or at subchromosomal scales (microsynteny). The latter generally requires collinearity and hard thresholds on syntenic region size, thus excluding a major proportion of syntenies with recent expansions or minor rearrangements. 'SYNPHONI' bridges the gap between micro- and macrosynteny detection, providing detailed information on both synteny conservation and transformation throughout the evolutionary history of animal genomes. AVAILABILITY AND IMPLEMENTATION Source code is freely available at https://github.com/nsmro/SYNPHONI, implemented in Python 3.9. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Fatih Sarigol
- Department of Neurosciences and Developmental Biology, University of Vienna, Vienna A-1030, Austria
| | - Elisabeth Zieger
- Department of Evolutionary Biology, University of Vienna, Vienna A-1030, Austria
| | | |
Collapse
|
3
|
Zhao C, Liu T, Wang Z. Functional Similarities of Protein-Coding Genes in Topologically Associating Domains and Spatially-Proximate Genomic Regions. Genes (Basel) 2022; 13:genes13030480. [PMID: 35328034 PMCID: PMC8951421 DOI: 10.3390/genes13030480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2022] [Revised: 02/26/2022] [Accepted: 03/05/2022] [Indexed: 02/01/2023] Open
Abstract
Topologically associating domains (TADs) are the structural and functional units of the genome. However, the functions of protein-coding genes existing in the same or different TADs have not been fully investigated. We compared the functional similarities of protein-coding genes existing in the same TAD and between different TADs, and also in the same gap region (the region between two consecutive TADs) and between different gap regions. We found that the protein-coding genes from the same TAD or gap region are more likely to share similar protein functions, and this trend is more obvious with TADs than the gap regions. We further created two types of gene–gene spatial interaction networks: the first type is based on Hi-C contacts, whereas the second type is based on both Hi-C contacts and the relationship of being in the same TAD. A graph auto-encoder was applied to learn the network topology, reconstruct the two types of networks, and predict the functions of the central genes/nodes based on the functions of the neighboring genes/nodes. It was found that better performance was achieved with the second type of network. Furthermore, we detected long-range spatially-interactive regions based on Hi-C contacts and calculated the functional similarities of the gene pairs from these regions.
Collapse
|
4
|
Dai Z. Gene Repositioning Is Under Constraints After Evolutionary Conserved Gene Neighborhood Separate. Front Genet 2019; 10:1030. [PMID: 31632448 PMCID: PMC6785632 DOI: 10.3389/fgene.2019.01030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 09/25/2019] [Indexed: 11/13/2022] Open
Abstract
Genes are not randomly distributed on eukaryotic chromosomes. Some neighboring genes show order conservation among species, while some neighboring genes separate during evolution even though their neighborhoods are conserved in some species. Here, I investigated whether after-separation gene repositioning is under natural selection for evolutionary conserved gene neighborhoods compared with nonconserved neighborhoods. After separation, genes with conserved neighborhoods show low-expression divergence between the after-separation species and the before-separation species. After genes separate from their conserved gene neighbors, their after-separation gene neighbors tend to show coexpression and coprotein complex with their before-separation gene neighbors. These results indicate evolutionary constraints on the selection of neighboring genes after evolutionary conserved gene neighborhoods separate.
Collapse
Affiliation(s)
- Zhiming Dai
- School of Data and Computer Science, Sun Yat-Sen University, Guangzhou, China.,Guangdong Province Key Laboratory of Big Data Analysis and Processing, Sun Yat-Sen University, Guangzhou, China
| |
Collapse
|
5
|
Simonaitis P, Chateau A, Swenson KM. A general framework for genome rearrangement with biological constraints. Algorithms Mol Biol 2019; 14:15. [PMID: 31360217 PMCID: PMC6642580 DOI: 10.1186/s13015-019-0149-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 06/12/2019] [Indexed: 11/25/2022] Open
Abstract
This paper generalizes previous studies on genome rearrangement under biological constraints, using double cut and join (DCJ). We propose a model for weighted DCJ, along with a family of optimization problems called \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\varphi$$\end{document}φ-MCPS (Minimum Cost Parsimonious Scenario), that are based on labeled graphs. We show how to compute solutions to general instances of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\varphi$$\end{document}φ-MCPS, given an algorithm to compute \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\varphi$$\end{document}φ-MCPS on a circular genome with exactly one occurrence of each gene. These general instances can have an arbitrary number of circular and linear chromosomes, and arbitrary gene content. The practicality of the framework is displayed by presenting polynomial-time algorithms that generalize the results of Bulteau, Fertin, and Tannier on the Sorting by wDCJs and indels in intergenes problem, and that generalize previous results on the Minimum Local Parsimonious Scenario problem.
Collapse
|
6
|
Swenson KM, Blanchette M. Large-scale mammalian genome rearrangements coincide with chromatin interactions. Bioinformatics 2019; 35:i117-i126. [PMID: 31510664 PMCID: PMC6612848 DOI: 10.1093/bioinformatics/btz343] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Motivation Genome rearrangements drastically change gene order along great stretches of a chromosome. There has been initial evidence that these apparently non-local events in the 1D sense may have breakpoints that are close in the 3D sense. We harness the power of the Double Cut and Join model of genome rearrangement, along with Hi-C chromosome conformation capture data to test this hypothesis between human and mouse. Results We devise novel statistical tests that show that indeed, rearrangement scenarios that transform the human into the mouse gene order are enriched for pairs of breakpoints that have frequent chromosome interactions. This is observed for both intra-chromosomal breakpoint pairs, as well as for inter-chromosomal pairs. For intra-chromosomal rearrangements, the enrichment exists from close (<20 Mb) to very distant (100 Mb) pairs. Further, the pattern exists across multiple cell lines in Hi-C data produced by different laboratories and at different stages of the cell cycle. We show that similarities in the contact frequencies between these many experiments contribute to the enrichment. We conclude that either (i) rearrangements usually involve breakpoints that are spatially close or (ii) there is selection against rearrangements that act on spatially distant breakpoints. Availability and implementation Our pipeline is freely available at https://bitbucket.org/thekswenson/locality. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Krister M Swenson
- Laboratoire d'Informatique, de Robotique, et de Microelectronique de Montpellier (LIRMM), Université Montpellier, Montpellier, France.,Centre Nationale de la Recherche Scientifique (CNRS), France
| | | |
Collapse
|
7
|
Evolutionary Loss of Genomic Proximity to Conserved Noncoding Elements Impacted the Gene Expression Dynamics During Mammalian Brain Development. Genetics 2019; 211:1239-1254. [PMID: 30796012 DOI: 10.1534/genetics.119.301973] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Accepted: 02/14/2019] [Indexed: 01/05/2023] Open
Abstract
Conserved noncoding elements (CNEs) have a significant regulatory influence on their neighboring genes. Loss of proximity to CNEs through genomic rearrangements can, therefore, impact the transcriptional states of the cognate genes. Yet, the evolutionary implications of such chromosomal alterations have not been studied. Through genome-wide analysis of CNEs and the cognate genes of representative species from five different mammalian orders, we observed a significant loss of genes' linear proximity to CNEs in the rat lineage. The CNEs and the genes losing proximity had a significant association with fetal, but not postnatal, brain development as assessed through ontology terms, developmental gene expression, chromatin marks, and genetic mutations. The loss of proximity to CNEs correlated with the independent evolutionary loss of fetus-specific upregulation of nearby genes in the rat brain. DNA breakpoints implicated in brain abnormalities of germline origin had significant representation between a CNE and the gene that exhibited loss of proximity, signifying the underlying developmental tolerance of genomic rearrangements that allowed the evolutionary splits of CNEs and the cognate genes in the rodent lineage. Our observations highlighted a nontrivial impact of chromosomal rearrangements in shaping the evolutionary dynamics of mammalian brain development and might explain the loss of brain traits, like cerebral folding of the cortex, in the rodent lineage.
Collapse
|
8
|
Skinner BM, Bacon J, Rathje CC, Larson EL, Kopania EEK, Good JM, Affara NA, Ellis PJI. Automated Nuclear Cartography Reveals Conserved Sperm Chromosome Territory Localization across 2 Million Years of Mouse Evolution. Genes (Basel) 2019; 10:genes10020109. [PMID: 30717218 PMCID: PMC6409866 DOI: 10.3390/genes10020109] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2018] [Revised: 01/27/2019] [Accepted: 01/28/2019] [Indexed: 12/15/2022] Open
Abstract
Measurements of nuclear organization in asymmetric nuclei in 2D images have traditionally been manual. This is exemplified by attempts to measure chromosome position in sperm samples, typically by dividing the nucleus into zones, and manually scoring which zone a fluorescence in-situ hybridisation (FISH) signal lies in. This is time consuming, limiting the number of nuclei that can be analyzed, and prone to subjectivity. We have developed a new approach for automated mapping of FISH signals in asymmetric nuclei, integrated into an existing image analysis tool for nuclear morphology. Automatic landmark detection defines equivalent structural regions in each nucleus, then dynamic warping of the FISH images to a common shape allows us to generate a composite of the signal within the entire cell population. Using this approach, we mapped the positions of the sex chromosomes and two autosomes in three mouse lineages (Mus musculus domesticus, Mus musculus musculus and Mus spretus). We found that in all three, chromosomes 11 and 19 tend to interact with each other, but are shielded from interactions with the sex chromosomes. This organization is conserved across 2 million years of mouse evolution.
Collapse
Affiliation(s)
| | - Joanne Bacon
- Department of Pathology, University of Cambridge, Cambridge, CB2 1QP, UK.
| | | | - Erica Lee Larson
- Department of Biological Sciences, University of Denver, Denver, CO 80208, USA.
- Division of Biological Sciences, University of Montana, MT 59812, USA.
| | | | | | | | | |
Collapse
|
9
|
O'Connor RE, Kiazim L, Skinner B, Fonseka G, Joseph S, Jennings R, Larkin DM, Griffin DK. Patterns of microchromosome organization remain highly conserved throughout avian evolution. Chromosoma 2018; 128:21-29. [PMID: 30448925 PMCID: PMC6394684 DOI: 10.1007/s00412-018-0685-6] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 10/31/2018] [Accepted: 11/05/2018] [Indexed: 01/08/2023]
Abstract
The structure and organization of a species genome at a karyotypic level, and in interphase nuclei, have broad functional significance. Although regular sized chromosomes are studied extensively in this regard, microchromosomes, which are present in many terrestrial vertebrates, remain poorly explored. Birds have more cytologically indistinguishable microchromosomes (~ 30 pairs) than other vertebrates; however, the degree to which genome organization patterns at a karyotypic and interphase level differ between species is unknown. In species where microchromosomes have fused to other chromosomes, they retain genomic features such as gene density and GC content; however, the extent to which they retain a central nuclear position has not been investigated. In studying 22 avian species from 10 orders, we established that, other than in species where microchromosomal fusion is obvious (Falconiformes and Psittaciformes), there was no evidence of microchromosomal rearrangement, suggesting an evolutionarily stable avian genome (karyotypic) organization. Moreover, in species where microchromosomal fusion has occurred, they retain a central nuclear location, suggesting that the nuclear position of microchromosomes is a function of their genomic features rather than their physical size.
Collapse
Affiliation(s)
- Rebecca E O'Connor
- School of Biosciences, University of Kent, Canterbury, CT2 7NJ, UK. r.o'
| | - Lucas Kiazim
- School of Biosciences, University of Kent, Canterbury, CT2 7NJ, UK
| | - Ben Skinner
- Department of Pathology, Cambridge University, Cambridge, CB2 1QP, UK
| | - Gothami Fonseka
- Cytocell Ltd, 3-4 Technopark Newmarket Road Cambridge, Cambridge, CB5 8PB, UK
| | - Sunitha Joseph
- School of Biosciences, University of Kent, Canterbury, CT2 7NJ, UK
| | - Rebecca Jennings
- School of Biosciences, University of Kent, Canterbury, CT2 7NJ, UK
| | - Denis M Larkin
- Department of Comparative Biomedical Sciences, Royal Veterinary College, University of London, London, NW1 0TU, UK
| | - Darren K Griffin
- School of Biosciences, University of Kent, Canterbury, CT2 7NJ, UK
| |
Collapse
|
10
|
Simonaitis P, Swenson KM. Finding local genome rearrangements. Algorithms Mol Biol 2018; 13:9. [PMID: 29755580 PMCID: PMC5934872 DOI: 10.1186/s13015-018-0127-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 04/16/2018] [Indexed: 11/24/2022] Open
Abstract
Background The double cut and join (DCJ) model of genome rearrangement is well studied due to its mathematical simplicity and power to account for the many events that transform gene order. These studies have mostly been devoted to the understanding of minimum length scenarios transforming one genome into another. In this paper we search instead for rearrangement scenarios that minimize the number of rearrangements whose breakpoints are unlikely due to some biological criteria. One such criterion has recently become accessible due to the advent of the Hi-C experiment, facilitating the study of 3D spacial distance between breakpoint regions. Results We establish a link between the minimum number of unlikely rearrangements required by a scenario and the problem of finding a maximum edge-disjoint cycle packing on a certain transformed version of the adjacency graph. This link leads to a 3/2-approximation as well as an exact integer linear programming formulation for our problem, which we prove to be NP-complete. We also present experimental results on fruit flies, showing that Hi-C data is informative when used as a criterion for rearrangements. Conclusions A new variant of the weighted DCJ distance problem is addressed that ignores scenario length in its objective function. A solution to this problem provides a lower bound on the number of unlikely moves necessary when transforming one gene order into another. This lower bound aids in the study of rearrangement scenarios with respect to chromatin structure, and could eventually be used in the design of a fixed parameter algorithm with a more general objective function.
Collapse
|
11
|
Genome Rearrangement Analysis: Cut and Join Genome Rearrangements and Gene Cluster Preserving Approaches. Methods Mol Biol 2018; 1704:261-289. [PMID: 29277869 DOI: 10.1007/978-1-4939-7463-4_9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Genome rearrangements are mutations that change the gene content of a genome or the arrangement of the genes on a genome. Several years of research on genome rearrangements have established different algorithmic approaches for solving some fundamental problems in comparative genomics based on gene order information. This review summarizes the literature on genome rearrangement analysis along two lines of research. The first line considers rearrangement models that are particularly well suited for a theoretical analysis. These models use rearrangement operations that cut chromosomes into fragments and then join the fragments into new chromosomes. The second line works with rearrangement models that reflect several biologically motivated constraints, e.g., the constraint that gene clusters have to be preserved. In this chapter, the border between algorithmically "easy" and "hard" rearrangement problems is sketched and a brief review is given on the available software tools for genome rearrangement analysis.
Collapse
|
12
|
Fradin H, Kiontke K, Zegar C, Gutwein M, Lucas J, Kovtun M, Corcoran DL, Baugh LR, Fitch DHA, Piano F, Gunsalus KC. Genome Architecture and Evolution of a Unichromosomal Asexual Nematode. Curr Biol 2017; 27:2928-2939.e6. [PMID: 28943090 PMCID: PMC5659720 DOI: 10.1016/j.cub.2017.08.038] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Revised: 08/14/2017] [Accepted: 08/15/2017] [Indexed: 10/24/2022]
Abstract
Asexual reproduction in animals, though rare, is the main or exclusive mode of reproduction in some long-lived lineages. The longevity of asexual clades may be correlated with the maintenance of heterozygosity by mechanisms that rearrange genomes and reduce recombination. Asexual species thus provide an opportunity to gain insight into the relationship between molecular changes, genome architecture, and cellular processes. Here we report the genome sequence of the parthenogenetic nematode Diploscapter pachys with only one chromosome pair. We show that this unichromosomal architecture is shared by a long-lived clade of asexual nematodes closely related to the genetic model organism Caenorhabditis elegans. Analysis of the genome assembly reveals that the unitary chromosome arose through fusion of six ancestral chromosomes, with extensive rearrangement among neighboring regions. Typical nematode telomeres and telomeric protection-encoding genes are lacking. Most regions show significant heterozygosity; homozygosity is largely concentrated to one region and attributed to gene conversion. Cell-biological and molecular evidence is consistent with the absence of key features of meiosis I, including synapsis and recombination. We propose that D. pachys preserves heterozygosity and produces diploid embryos without fertilization through a truncated meiosis. As a prelude to functional studies, we demonstrate that D. pachys is amenable to experimental manipulation by RNA interference.
Collapse
Affiliation(s)
- Hélène Fradin
- Department of Biology, New York University, New York, NY 10003, USA; Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Karin Kiontke
- Department of Biology, New York University, New York, NY 10003, USA
| | - Charles Zegar
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Michelle Gutwein
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Jessica Lucas
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
| | - Mikhail Kovtun
- Duke Center for Genomic and Computational Biology, Duke University, Durham, NC 27708, USA
| | - David L Corcoran
- Duke Center for Genomic and Computational Biology, Duke University, Durham, NC 27708, USA
| | - L Ryan Baugh
- Department of Biology, Duke University, Durham, NC 27708, USA
| | - David H A Fitch
- Department of Biology, New York University, New York, NY 10003, USA.
| | - Fabio Piano
- Department of Biology, New York University, New York, NY 10003, USA; Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA; Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates.
| | - Kristin C Gunsalus
- Department of Biology, New York University, New York, NY 10003, USA; Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA; Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
13
|
|
14
|
Shapiro JA. Nothing in Evolution Makes Sense Except in the Light of Genomics: Read-Write Genome Evolution as an Active Biological Process. BIOLOGY 2016; 5:E27. [PMID: 27338490 PMCID: PMC4929541 DOI: 10.3390/biology5020027] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 02/12/2016] [Revised: 05/20/2016] [Accepted: 06/02/2016] [Indexed: 01/15/2023]
Abstract
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago, GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
15
|
Swenson KM, Simonaitis P, Blanchette M. Models and algorithms for genome rearrangement with positional constraints. Algorithms Mol Biol 2016; 11:13. [PMID: 27190550 PMCID: PMC4869402 DOI: 10.1186/s13015-016-0065-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Accepted: 03/30/2016] [Indexed: 11/10/2022] Open
Abstract
Background Traditionally, the merit of a rearrangement scenario between two gene orders has been measured based on a parsimony criteria alone; two scenarios with the same number of rearrangements are considered equally good. In this paper, we acknowledge that each rearrangement has a certain likelihood of occurring based on biological constraints, e.g. physical proximity of the DNA segments implicated or repetitive sequences. Results We propose optimization problems with the objective of maximizing overall likelihood, by weighting the rearrangements. We study a binary weight function suitable to the representation of sets of genome positions that are most likely to have swapped adjacencies. We give a polynomial-time algorithm for the problem of finding a minimum weight double cut and join scenario among all minimum length scenarios. In the process we solve an optimization problem on colored noncrossing partitions, which is a generalization of the Maximum Independent Set problem on circle graphs. Conclusions We introduce a model for weighting genome rearrangements and show that under simple yet reasonable conditions, a fundamental distance can be computed in polynomial time. This is achieved by solving a generalization of the Maximum Independent Set problem on circle graphs. Several variants of the problem are also mentioned.
Collapse
|
16
|
Bagadia M, Singh A, Singh Sandhu K. Three Dimensional Organization of Genome Might Have Guided the Dynamics of Gene Order Evolution in Eukaryotes. Genome Biol Evol 2016; 8:946-54. [PMID: 26957031 PMCID: PMC4824123 DOI: 10.1093/gbe/evw050] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
In eukaryotes, genes are nonrandomly organized into short gene-dense regions or "gene-clusters" interspersed by long gene-poor regions. How these gene-clusters have evolved is not entirely clear. Gene duplication may not account for all the gene-clusters since the genes in most of the clusters do not exhibit significant sequence similarity. In this study, using genome-wide data sets from budding yeast, fruit-fly, and human, we show that: 1) long-range evolutionary repositioning of genes strongly associate with their spatial proximity in the nucleus; 2) presence of evolutionary DNA break-points at involved loci hints at their susceptibility to undergo long-range genomic rearrangements; and 3) correlated epigenetic and transcriptional states of engaged genes highlight the underlying evolutionary constraints. The significance of observation 1, 2, and 3 are particularly stronger for the instances of inferred evolutionary gain, as compared with loss, of linear gene-clustering. These observations suggest that the long-range genomic rearrangements guided through 3D genome organization might have contributed to the evolution of gene order. We further hypothesize that the evolution of linear gene-clusters in eukaryotic genomes might have been mediated through spatial interactions among distant loci in order to optimize co-ordinated regulation of genes. We model this hypothesis through a heuristic model of gene-order evolution.
Collapse
Affiliation(s)
- Meenakshi Bagadia
- Department of Biological Sciences, Indian Institute of Science Education and Research (IISER)-Mohali, SAS Nagar 140306, India
| | - Arashdeep Singh
- Department of Biological Sciences, Indian Institute of Science Education and Research (IISER)-Mohali, SAS Nagar 140306, India
| | - Kuljeet Singh Sandhu
- Department of Biological Sciences, Indian Institute of Science Education and Research (IISER)-Mohali, SAS Nagar 140306, India
| |
Collapse
|
17
|
Khrameeva EE, Fudenberg G, Gelfand MS, Mirny LA. History of chromosome rearrangements reflects the spatial organization of yeast chromosomes. J Bioinform Comput Biol 2016; 14:1641002. [PMID: 27021249 DOI: 10.1142/s021972001641002x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Three-dimensional (3D) organization of genomes affects critical cellular processes such as transcription, replication, and deoxyribo nucleic acid (DNA) repair. While previous studies have investigated the natural role, the 3D organization plays in limiting a possible set of genomic rearrangements following DNA repair, the influence of specific organizational principles on this process, particularly over longer evolutionary time scales, remains relatively unexplored. In budding yeast S.cerevisiae, chromosomes are organized into a Rabl-like configuration, with clustered centromeres and telomeres tethered to the nuclear periphery. Hi-C data for S.cerevisiae show that a consequence of this Rabl-like organization is that regions equally distant from centromeres are more frequently in contact with each other, between arms of both the same and different chromosomes. Here, we detect rearrangement events in Saccharomyces species using an automatic approach, and observe increased rearrangement frequency between regions with higher contact frequencies. Together, our results underscore how specific principles of 3D chromosomal organization can influence evolutionary events.
Collapse
Affiliation(s)
- Ekaterina E Khrameeva
- 1 Institute for Information Transmission, Problems (the Kharkevich Institute), Russian Academy of Sciences, Bolshoy Karetny per. 19, build. 1, Moscow 127051, Russian Federation.,2 Skolkovo Institute of Science and Technology, Skolkovo Innovation Center, build. 3, Moscow 143026, Russian Federation
| | - Geoffrey Fudenberg
- 3 Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| | - Mikhail S Gelfand
- 1 Institute for Information Transmission, Problems (the Kharkevich Institute), Russian Academy of Sciences, Bolshoy Karetny per. 19, build. 1, Moscow 127051, Russian Federation.,4 Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Leninskiye Gory 1-73, Moscow 119991, Russian Federation
| | - Leonid A Mirny
- 3 Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
18
|
Berthelot C, Muffato M, Abecassis J, Roest Crollius H. The 3D organization of chromatin explains evolutionary fragile genomic regions. Cell Rep 2015; 10:1913-24. [PMID: 25801028 DOI: 10.1016/j.celrep.2015.02.046] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Revised: 12/17/2014] [Accepted: 02/18/2015] [Indexed: 10/23/2022] Open
Abstract
Genomic rearrangements are a major source of evolutionary divergence in eukaryotic genomes, a cause of genetic diseases and a hallmark of tumor cell progression, yet the mechanisms underlying their occurrence and evolutionary fixation are poorly understood. Statistical associations between breakpoints and specific genomic features suggest that genomes may contain elusive “fragile regions” with a higher propensity for breakage. Here, we use ancestral genome reconstructions to demonstrate a near-perfect correlation between gene density and evolutionary rearrangement breakpoints. Simulations based on functional features in the human genome show that this pattern is best explained as the outcome of DNA breaks that occur in open chromatin regions coming into 3D contact in the nucleus. Our model explains how rearrangements reorganize the order of genes in an evolutionary neutral fashion and provides a basis for understanding the susceptibility of “fragile regions” to breakage.
Collapse
|
19
|
Sexton T, Cavalli G. The role of chromosome domains in shaping the functional genome. Cell 2015; 160:1049-59. [PMID: 25768903 DOI: 10.1016/j.cell.2015.02.040] [Citation(s) in RCA: 271] [Impact Index Per Article: 30.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Indexed: 10/23/2022]
Abstract
The genome must be highly compacted to fit within eukaryotic nuclei but must be accessible to the transcriptional machinery to allow appropriate expression of genes in different cell types and throughout developmental pathways. A growing body of work has shown that the genome, analogously to proteins, forms an ordered, hierarchical structure that closely correlates and may even be causally linked with regulation of functions such as transcription. This review describes our current understanding of how these functional genomic "secondary and tertiary structures" form a blueprint for global nuclear architecture and the potential they hold for understanding and manipulating genomic regulation.
Collapse
Affiliation(s)
- Tom Sexton
- Institute of Genetics and Molecular and Cellular Biology (IGBMC), 1 rue Laurent Fries, 67404 Illkirch, France.
| | - Giacomo Cavalli
- Institute of Human Genetics (IGH), 141 rue de la Cardonille, 34396 Montpellier, France.
| |
Collapse
|
20
|
Boulos RE, Drillon G, Argoul F, Arneodo A, Audit B. Structural organization of human replication timing domains. FEBS Lett 2015; 589:2944-57. [PMID: 25912651 DOI: 10.1016/j.febslet.2015.04.015] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Revised: 04/09/2015] [Accepted: 04/10/2015] [Indexed: 12/16/2022]
Abstract
Recent analysis of genome-wide epigenetic modification data, mean replication timing (MRT) profiles and chromosome conformation data in mammals have provided increasing evidence that flexibility in replication origin usage is regulated locally by the epigenetic landscape and over larger genomic distances by the 3D chromatin architecture. Here, we review the recent results establishing some link between replication domains and chromatin structural domains in pluripotent and various differentiated cell types in human. We reconcile the originally proposed dichotomic picture of early and late constant timing regions that replicate by multiple rather synchronous origins in separated nuclear compartments of open and closed chromatins, with the U-shaped MRT domains bordered by "master" replication origins specified by a localized (∼200-300 kb) zone of open and transcriptionally active chromatin from which a replication wave likely initiates and propagates toward the domain center via a cascade of origin firing. We discuss the relationships between these MRT domains, topologically associated domains and lamina-associated domains. This review sheds a new light on the epigenetically regulated global chromatin reorganization that underlies the loss of pluripotency and the determination of differentiation properties.
Collapse
Affiliation(s)
- Rasha E Boulos
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Guénola Drillon
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Françoise Argoul
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Alain Arneodo
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France
| | - Benjamin Audit
- Université de Lyon, F-69000 Lyon, France; Laboratoire de Physique, CNRS UMR5672, Ecole Normale Supérieure de Lyon, F-69007 Lyon, France.
| |
Collapse
|
21
|
Chan C, Jayasekera S, Kao B, Páramo M, von Grotthuss M, Ranz JM. Remodelling of a homeobox gene cluster by multiple independent gene reunions in Drosophila. Nat Commun 2015; 6:6509. [DOI: 10.1038/ncomms7509] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Accepted: 02/04/2015] [Indexed: 11/09/2022] Open
|
22
|
Farré M, Robinson TJ, Ruiz-Herrera A. An Integrative Breakage Model of genome architecture, reshuffling and evolution: The Integrative Breakage Model of genome evolution, a novel multidisciplinary hypothesis for the study of genome plasticity. Bioessays 2015; 37:479-88. [PMID: 25739389 DOI: 10.1002/bies.201400174] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2014] [Revised: 02/12/2015] [Accepted: 02/13/2015] [Indexed: 12/23/2022]
Abstract
Our understanding of genomic reorganization, the mechanics of genomic transmission to offspring during germ line formation, and how these structural changes contribute to the speciation process, and genetic disease is far from complete. Earlier attempts to understand the mechanism(s) and constraints that govern genome remodeling suffered from being too narrowly focused, and failed to provide a unified and encompassing view of how genomes are organized and regulated inside cells. Here, we propose a new multidisciplinary Integrative Breakage Model for the study of genome evolution. The analysis of the high-level structural organization of genomes (nucleome), together with the functional constrains that accompany genome reshuffling, provide insights into the origin and plasticity of genome organization that may assist with the detection and isolation of therapeutic targets for the treatment of complex human disorders.
Collapse
Affiliation(s)
- Marta Farré
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, Campus UAB, Barcelona, Spain
| | | | | |
Collapse
|
23
|
Thévenin A, Ein-Dor L, Ozery-Flato M, Shamir R. Functional gene groups are concentrated within chromosomes, among chromosomes and in the nuclear space of the human genome. Nucleic Acids Res 2014; 42:9854-61. [PMID: 25056310 PMCID: PMC4150778 DOI: 10.1093/nar/gku667] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2013] [Revised: 07/10/2014] [Accepted: 07/10/2014] [Indexed: 12/19/2022] Open
Abstract
Genomes undergo changes in organization as a result of gene duplications, chromosomal rearrangements and local mutations, among other mechanisms. In contrast to prokaryotes, in which genes of a common function are often organized in operons and reside contiguously along the genome, most eukaryotes show much weaker clustering of genes by function, except for few concrete functional groups. We set out to check systematically if there is a relation between gene function and gene organization in the human genome. We test this question for three types of functional groups: pairs of interacting proteins, complexes and pathways. We find a significant concentration of functional groups both in terms of their distance within the same chromosome and in terms of their dispersal over several chromosomes. Moreover, using Hi-C contact map of the tendency of chromosomal segments to appear close in the 3D space of the nucleus, we show that members of the same functional group that reside on distinct chromosomes tend to co-localize in space. The result holds for all three types of functional groups that we tested. Hence, the human genome shows substantial concentration of functional groups within chromosomes and across chromosomes in space.
Collapse
Affiliation(s)
- Annelyse Thévenin
- Genome Informatics, Faculty of Technology and Institute for Bioinformatics, Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld 33615, Germany IBM Research-Haifa, Mount Carmel, Haifa 3498825, Israel
| | - Liat Ein-Dor
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel
| | - Michal Ozery-Flato
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel
| | - Ron Shamir
- IBM Research-Haifa, Mount Carmel, Haifa 3498825, Israel
| |
Collapse
|
24
|
Ebert G, Steininger A, Weißmann R, Boldt V, Lind-Thomsen A, Grune J, Badelt S, Heßler M, Peiser M, Hitzler M, Jensen LR, Müller I, Hu H, Arndt PF, Kuss AW, Tebel K, Ullmann R. Distribution of segmental duplications in the context of higher order chromatin organisation of human chromosome 7. BMC Genomics 2014; 15:537. [PMID: 24973960 PMCID: PMC4092221 DOI: 10.1186/1471-2164-15-537] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 06/17/2014] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Segmental duplications (SDs) are not evenly distributed along chromosomes. The reasons for this biased susceptibility to SD insertion are poorly understood. Accumulation of SDs is associated with increased genomic instability, which can lead to structural variants and genomic disorders such as the Williams-Beuren syndrome. Despite these adverse effects, SDs have become fixed in the human genome. Focusing on chromosome 7, which is particularly rich in interstitial SDs, we have investigated the distribution of SDs in the context of evolution and the three dimensional organisation of the chromosome in order to gain insights into the mutual relationship of SDs and chromatin topology. RESULTS Intrachromosomal SDs preferentially accumulate in those segments of chromosome 7 that are homologous to marmoset chromosome 2. Although this formerly compact segment has been re-distributed to three different sites during primate evolution, we can show by means of public data on long distance chromatin interactions that these three intervals, and consequently the paralogous SDs mapping to them, have retained their spatial proximity in the nucleus. Focusing on SD clusters implicated in the aetiology of the Williams-Beuren syndrome locus we demonstrate by cross-species comparison that these SDs have inserted at the borders of a topological domain and that they flank regions with distinct DNA conformation. CONCLUSIONS Our study suggests a link of nuclear architecture and the propagation of SDs across chromosome 7, either by promoting regional SD insertion or by contributing to the establishment of higher order chromatin organisation themselves. The latter could compensate for the high risk of structural rearrangements and thus may have contributed to their evolutionary fixation in the human genome.
Collapse
Affiliation(s)
- Grit Ebert
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Department of Biology, Chemistry and Pharmacy, Free University Berlin, 14195 Berlin, Germany
| | - Anne Steininger
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Department of Biology, Chemistry and Pharmacy, Free University Berlin, 14195 Berlin, Germany
| | - Robert Weißmann
- />Department of Human Genetics, University Medicine Greifswald, and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, Fleischmannstraße 42-44, 17475 Greifswald, Germany
| | - Vivien Boldt
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Department of Biology, Chemistry and Pharmacy, Free University Berlin, 14195 Berlin, Germany
| | - Allan Lind-Thomsen
- />Wilhelm Johannsen Centre for Functional Genome Research, Department of Cellular and Molecular Medicine, University of Copenhagen, Blegdamsvej 3, DK-2200 Copenhagen, Denmark
| | - Jana Grune
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Stefan Badelt
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Institute for Theoretical Chemistry, University of Vienna, Waehringer Straße 17, A-1090 Vienna, Austria
| | - Melanie Heßler
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Matthias Peiser
- />Unit Experimental Research, Department of Product Safety, Federal Institute for Bundeswehr Institute of Radiobiology affiliated, the University of Ulm, Neuherbergstraße 11, 80937 Munich, Germany
| | - Manuel Hitzler
- />Unit Experimental Research, Department of Product Safety, Federal Institute for Bundeswehr Institute of Radiobiology affiliated, the University of Ulm, Neuherbergstraße 11, 80937 Munich, Germany
| | - Lars R Jensen
- />Department of Human Genetics, University Medicine Greifswald, and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, Fleischmannstraße 42-44, 17475 Greifswald, Germany
| | - Ines Müller
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Hao Hu
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Peter F Arndt
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Andreas W Kuss
- />Department of Human Genetics, University Medicine Greifswald, and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, Fleischmannstraße 42-44, 17475 Greifswald, Germany
| | - Katrin Tebel
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Reinhard Ullmann
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| |
Collapse
|
25
|
Dai Z, Xiong Y, Dai X. Neighboring genes show interchromosomal colocalization after their separation. Mol Biol Evol 2014; 31:1166-72. [PMID: 24505120 DOI: 10.1093/molbev/msu065] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The order of genes on eukaryotic chromosomes is nonrandom. Some neighboring genes show order conservation among species, while some neighboring genes separate during evolution. Here, we investigated whether neighboring genes show interactions after their separation. We found that neighboring gene pairs tend to show interchromosomal colocalization (i.e., nuclear colocalization) in the species in which they separate. These nuclear colocalized separated neighboring gene pairs 1) show neighborhood conservation in more species, 2) tend to be regulated by the same transcription factor, and 3) tend to be regulated by the same histone modification. These results suggest a mechanism by which neighboring genes could retain nuclear proximity after their separation.
Collapse
Affiliation(s)
- Zhiming Dai
- Department of Electronics and Communication Engineering, School of Information Science and Technology, Sun Yat-Sen University, Guangzhou, China
| | | | | |
Collapse
|
26
|
Boulos RE, Arneodo A, Jensen P, Audit B. Revealing long-range interconnected hubs in human chromatin interaction data using graph theory. PHYSICAL REVIEW LETTERS 2013; 111:118102. [PMID: 24074120 DOI: 10.1103/physrevlett.111.118102] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2013] [Indexed: 06/02/2023]
Abstract
We use graph theory to analyze chromatin interaction (Hi-C) data in the human genome. We show that a key functional feature of the genome--"master" replication origins--corresponds to DNA loci of maximal network centrality. These loci form a set of interconnected hubs both within chromosomes and between different chromosomes. Our results open the way to a fruitful use of graph theory concepts to decipher DNA structural organization in relation to genome functions such as replication and transcription. This quantitative information should prove useful to discriminate between possible polymer models of nuclear organization.
Collapse
Affiliation(s)
- R E Boulos
- Université de Lyon, F-69000 Lyon, France and Laboratoire de Physique, ENS de Lyon, CNRS UMR5672, F-69007 Lyon, France
| | | | | | | |
Collapse
|
27
|
Genomic positions of co-expressed genes: echoes of chromosome organisation in gene expression data. BMC Res Notes 2013; 6:229. [PMID: 23764369 PMCID: PMC3689077 DOI: 10.1186/1756-0500-6-229] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2012] [Accepted: 05/28/2013] [Indexed: 01/12/2023] Open
Abstract
Background The relationships between gene expression and nuclear structure, chromosome territories in particular, are currently being elucidated experimentally. Each chromosome occupies an individual, spatially-limited space with a preferential position relative to the nuclear centre that may be specific to the cell and tissue type. We sought to discover whether patterns in gene expression databases might exist that would mirror prevailing or recurring nuclear structure patterns, chromosome territory interactions in particular. Results We used human gene expression datasets, both from a tissue expression atlas and from a large set including diverse types of perturbations. We identified groups of positional gene clusters over-represented in gene expression clusters. We show that some pairs of chromosomes and pairs of 10 Mbp long chromosome regions are significantly enriched in the expression clusters. The functions of genes involved in inter-chromosome co-expression relationships are non-random and predominantly related to cell-cell communication and reaction to external stimuli. Conclusions We suggest that inter-chromosomal gene co-expression can be interpreted in the context of nuclear structure, and that even expression datasets that include very diverse conditions and cell types show consistent relationships.
Collapse
|
28
|
Berná L, Chaurasia A, Angelini C, Federico C, Saccone S, D'Onofrio G. The footprint of metabolism in the organization of mammalian genomes. BMC Genomics 2012; 13:174. [PMID: 22568857 PMCID: PMC3384468 DOI: 10.1186/1471-2164-13-174] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2011] [Accepted: 05/08/2012] [Indexed: 01/02/2023] Open
Abstract
Background At present five evolutionary hypotheses have been proposed to explain the great variability of the genomic GC content among and within genomes: the mutational bias, the biased gene conversion, the DNA breakpoints distribution, the thermal stability and the metabolic rate. Several studies carried out on bacteria and teleostean fish pointed towards the critical role played by the environment on the metabolic rate in shaping the base composition of genomes. In mammals the debate is still open, and evidences have been produced in favor of each evolutionary hypothesis. Human genes were assigned to three large functional categories (as well as to the corresponding functional classes) according to the KOG database: (i) information storage and processing, (ii) cellular processes and signaling, and (iii) metabolism. The classification was extended to the organisms so far analyzed performing a reciprocal Blastp and selecting the best reciprocal hit. The base composition was calculated for each sequence of the whole CDS dataset. Results The GC3 level of the above functional categories was increasing from (i) to (iii). This specific compositional pattern was found, as footprint, in all mammalian genomes, but not in frog and lizard ones. Comparative analysis of human versus both frog and lizard functional categories showed that genes involved in the metabolic processes underwent the highest GC3 increment. Analyzing the KOG functional classes of genes, again a well defined intra-genomic pattern was found in all mammals. Not only genes of metabolic pathways, but also genes involved in chromatin structure and dynamics, transcription, signal transduction mechanisms and cytoskeleton, showed an average GC3 level higher than that of the whole genome. In the case of the human genome, the genes of the aforementioned functional categories showed a high probability to be associated with the chromosomal bands. Conclusions In the light of different evolutionary hypotheses proposed so far, and contributing with different potential to the genome compositional heterogeneity of mammalian genomes, the one based on the metabolic rate seems to play not a minor role. Keeping in mind similar results reported in bacteria and in teleosts, the specific compositional patterns observed in mammals highlight metabolic rate as unifying factor that fits over a wide range of living organisms.
Collapse
Affiliation(s)
- Luisa Berná
- Genome Evolution and Organization - Department Animal Physiology and Evolution, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy
| | | | | | | | | | | |
Collapse
|
29
|
Linard B, Nguyen NH, Prosdocimi F, Poch O, Thompson JD. EvoluCode: Evolutionary Barcodes as a Unifying Framework for Multilevel Evolutionary Data. Evol Bioinform Online 2011; 8:61-77. [PMID: 22267905 PMCID: PMC3256995 DOI: 10.4137/ebo.s8814] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Evolutionary systems biology aims to uncover the general trends and principles governing the evolution of biological networks. An essential part of this process is the reconstruction and analysis of the evolutionary histories of these complex, dynamic networks. Unfortunately, the methodologies for representing and exploiting such complex evolutionary histories in large scale studies are currently limited. Here, we propose a new formalism, called EvoluCode (Evolutionary barCode), which allows the integration of different evolutionary parameters (eg, sequence conservation, orthology, synteny …) in a unifying format and facilitates the multilevel analysis and visualization of complex evolutionary histories at the genome scale. The advantages of the approach are demonstrated by constructing barcodes representing the evolution of the complete human proteome. Two large-scale studies are then described: (i) the mapping and visualization of the barcodes on the human chromosomes and (ii) automatic clustering of the barcodes to highlight protein subsets sharing similar evolutionary histories and their functional analysis. The methodologies developed here open the way to the efficient application of other data mining and knowledge extraction techniques in evolutionary systems biology studies. A database containing all EvoluCode data is available at: http://lbgi.igbmc.fr/barcodes.
Collapse
Affiliation(s)
- Benjamin Linard
- Laboratoire De Bioinformatique Et Génomique Intégratives, Institut de Génétique et de Biologie Moléculaire et Cellulaire CNRS/INSERM/UDS, Illkirch, France
| | - Ngoc Hoan Nguyen
- Laboratoire De Bioinformatique Et Génomique Intégratives, Institut de Génétique et de Biologie Moléculaire et Cellulaire CNRS/INSERM/UDS, Illkirch, France
| | | | - Olivier Poch
- Laboratoire De Bioinformatique Et Génomique Intégratives, Institut de Génétique et de Biologie Moléculaire et Cellulaire CNRS/INSERM/UDS, Illkirch, France
| | - Julie D. Thompson
- Laboratoire De Bioinformatique Et Génomique Intégratives, Institut de Génétique et de Biologie Moléculaire et Cellulaire CNRS/INSERM/UDS, Illkirch, France
| |
Collapse
|