1
|
Seaman J, Buggs RJA. FluentDNA: Nucleotide Visualization of Whole Genomes, Annotations, and Alignments. Front Genet 2020; 11:292. [PMID: 32425967 PMCID: PMC7203487 DOI: 10.3389/fgene.2020.00292] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 03/11/2020] [Indexed: 12/03/2022] Open
Abstract
Researchers seldom look at naked genome assemblies: instead the attributes of DNA sequences are mediated through statistics, annotations and high level summaries. Here we present software that visualizes the bare sequences of whole genome assemblies in a zoomable interface. This can assist in detection of chromosome architecture and contamination by the naked eye through changes in color patterns, in the absence of any other annotation. When available, annotations can be visualized alongside or on top of the naked sequence. Genome alignments can also be visualized, laying two genomes side by side in an alignment and highlighting their differences at nucleotide resolution. FluentDNA gives researchers direct visualization of whole genome assemblies, annotations and alignments, for quality control, hypothesis generation, and communicating results.
Collapse
Affiliation(s)
- Josiah Seaman
- Royal Botanic Gardens Kew, Jodrell Laboratory, Richmond, United Kingdom.,School of Biological and Chemical Sciences, Queen Mary University of London, London, United Kingdom
| | - Richard J A Buggs
- Royal Botanic Gardens Kew, Jodrell Laboratory, Richmond, United Kingdom.,School of Biological and Chemical Sciences, Queen Mary University of London, London, United Kingdom
| |
Collapse
|
2
|
Badenhorst D, Hillier LW, Literman R, Montiel EE, Radhakrishnan S, Shen Y, Minx P, Janes DE, Warren WC, Edwards SV, Valenzuela N. Physical Mapping and Refinement of the Painted Turtle Genome (Chrysemys picta) Inform Amniote Genome Evolution and Challenge Turtle-Bird Chromosomal Conservation. Genome Biol Evol 2015; 7:2038-50. [PMID: 26108489 PMCID: PMC4524486 DOI: 10.1093/gbe/evv119] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/19/2015] [Indexed: 01/04/2023] Open
Abstract
Comparative genomics continues illuminating amniote genome evolution, but for many lineages our understanding remains incomplete. Here, we refine the assembly (CPI 3.0.3 NCBI AHGY00000000.2) and develop a cytogenetic map of the painted turtle (Chrysemys picta-CPI) genome, the first in turtles and in vertebrates with temperature-dependent sex determination. A comparison of turtle genomes with those of chicken, selected nonavian reptiles, and human revealed shared and novel genomic features, such as numerous chromosomal rearrangements. The largest conserved syntenic blocks between birds and turtles exist in four macrochromosomes, whereas rearrangements were evident in these and other chromosomes, disproving that turtles and birds retain fully conserved macrochromosomes for greater than 300 Myr. C-banding revealed large heterochromatic blocks in the centromeric region of only few chromosomes. The nucleolar-organizing region (NOR) mapped to a single CPI microchromosome, whereas in some turtles and lizards the NOR maps to nonhomologous sex-chromosomes, thus revealing independent translocations of the NOR in various reptilian lineages. There was no evidence for recent chromosomal fusions as interstitial telomeric-DNA was absent. Some repeat elements (CR1-like, Gypsy) were enriched in the centromeres of five chromosomes, whereas others were widespread in the CPI genome. Bacterial artificial chromosome (BAC) clones were hybridized to 18 of the 25 CPI chromosomes and anchored to a G-banded ideogram. Several CPI sex-determining genes mapped to five chromosomes, and homology was detected between yet other CPI autosomes and the globally nonhomologous sex chromosomes of chicken, other turtles, and squamates, underscoring the independent evolution of vertebrate sex-determining mechanisms.
Collapse
Affiliation(s)
- Daleen Badenhorst
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | | | - Robert Literman
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | | | | | - Yingjia Shen
- The Genome Institute at Washington University, St Louis
| | - Patrick Minx
- The Genome Institute at Washington University, St Louis
| | - Daniel E Janes
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University Department of Organismic and Evolutionary Biology, Harvard University
| | | | - Scott V Edwards
- Department of Organismic and Evolutionary Biology, Harvard University
| | - Nicole Valenzuela
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| |
Collapse
|
3
|
Doležel J, Vrána J, Safář J, Bartoš J, Kubaláková M, Simková H. Chromosomes in the flow to simplify genome analysis. Funct Integr Genomics 2012; 12:397-416. [PMID: 22895700 PMCID: PMC3431466 DOI: 10.1007/s10142-012-0293-0] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2012] [Accepted: 07/30/2012] [Indexed: 11/25/2022]
Abstract
Nuclear genomes of human, animals, and plants are organized into subunits called chromosomes. When isolated into aqueous suspension, mitotic chromosomes can be classified using flow cytometry according to light scatter and fluorescence parameters. Chromosomes of interest can be purified by flow sorting if they can be resolved from other chromosomes in a karyotype. The analysis and sorting are carried out at rates of 10(2)-10(4) chromosomes per second, and for complex genomes such as wheat the flow sorting technology has been ground-breaking in reducing genome complexity for genome sequencing. The high sample rate provides an attractive approach for karyotype analysis (flow karyotyping) and the purification of chromosomes in large numbers. In characterizing the chromosome complement of an organism, the high number that can be studied using flow cytometry allows for a statistically accurate analysis. Chromosome sorting plays a particularly important role in the analysis of nuclear genome structure and the analysis of particular and aberrant chromosomes. Other attractive but not well-explored features include the analysis of chromosomal proteins, chromosome ultrastructure, and high-resolution mapping using FISH. Recent results demonstrate that chromosome flow sorting can be coupled seamlessly with DNA array and next-generation sequencing technologies for high-throughput analyses. The main advantages are targeting the analysis to a genome region of interest and a significant reduction in sample complexity. As flow sorters can also sort single copies of chromosomes, shotgun sequencing DNA amplified from them enables the production of haplotype-resolved genome sequences. This review explains the principles of flow cytometric chromosome analysis and sorting (flow cytogenetics), discusses the major uses of this technology in genome analysis, and outlines future directions.
Collapse
Affiliation(s)
- Jaroslav Doležel
- Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany, Sokolovská 6, Olomouc, Czech Republic.
| | | | | | | | | | | |
Collapse
|
4
|
Graphodatsky A, Ferguson-Smith MA, Stanyon R. A short introduction to cytogenetic studies in mammals with reference to the present volume. Cytogenet Genome Res 2012; 137:83-96. [PMID: 22846392 DOI: 10.1159/000341502] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Genome diversity has long been studied from the comparative cytogenetic perspective. Early workers documented differences between species in diploid chromosome number and fundamental number. Banding methods allowed more detailed descriptions of between-species rearrangements and classes of differentially staining chromosome material. The infusion of molecular methods into cytogenetics provided a third revolution, which is still not exhausted. Chromosome painting has provided a global view of the translocation history of mammalian genome evolution, well summarized in the contributions to this special volume. More recently, FISH of cloned DNA has provided details on defining breakpoint and intrachromosomal marker order, which have helped to document inversions and centromere repositioning. The most recent trend in comparative molecular cytogenetics is to integrate sequencing information in order to formulate and test reconstructions of ancestral genomes and phylogenomic hypotheses derived from comparative cytogenetics. The integration of comparative cytogenetics and sequencing promises to provide an understanding of what drives chromosome rearrangements and genome evolution in general. We believe that the contributions in this volume, in no small way, point the way to the next phase in cytogenetic studies.
Collapse
Affiliation(s)
- A Graphodatsky
- Institute of Molecular and Cellular Biology, Siberian Division of the Russian Academy of Sciences, Novosibirsk, Russia
| | | | | |
Collapse
|
5
|
Alekseyev MA, Pevzner PA. Breakpoint graphs and ancestral genome reconstructions. Genes Dev 2009; 19:943-57. [PMID: 19218533 PMCID: PMC2675983 DOI: 10.1101/gr.082784.108] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2008] [Accepted: 01/22/2009] [Indexed: 11/24/2022]
Abstract
Recently completed whole-genome sequencing projects marked the transition from gene-based phylogenetic studies to phylogenomics analysis of entire genomes. We developed an algorithm MGRA for reconstructing ancestral genomes and used it to study the rearrangement history of seven mammalian genomes: human, chimpanzee, macaque, mouse, rat, dog, and opossum. MGRA relies on the notion of the multiple breakpoint graphs to overcome some limitations of the existing approaches to ancestral genome reconstructions. MGRA also generates the rearrangement-based characters guiding the phylogenetic tree reconstruction when the phylogeny is unknown.
Collapse
Affiliation(s)
- Max A. Alekseyev
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093-0404, USA
| | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093-0404, USA
| |
Collapse
|
6
|
Kemkemer C, Kohn M, Cooper DN, Froenicke L, Högel J, Hameister H, Kehrer-Sawatzki H. Gene synteny comparisons between different vertebrates provide new insights into breakage and fusion events during mammalian karyotype evolution. BMC Evol Biol 2009; 9:84. [PMID: 19393055 PMCID: PMC2681463 DOI: 10.1186/1471-2148-9-84] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2008] [Accepted: 04/24/2009] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND Genome comparisons have made possible the reconstruction of the eutherian ancestral karyotype but also have the potential to provide new insights into the evolutionary inter-relationship of the different eutherian orders within the mammalian phylogenetic tree. Such comparisons can additionally reveal (i) the nature of the DNA sequences present within the evolutionary breakpoint regions and (ii) whether or not the evolutionary breakpoints occur randomly across the genome. Gene synteny analysis (E-painting) not only greatly reduces the complexity of comparative genome sequence analysis but also extends its evolutionary reach. RESULTS E-painting was used to compare the genome sequences of six different mammalian species and chicken. A total of 526 evolutionary breakpoint intervals were identified and these were mapped to a median resolution of 120 kb, the highest level of resolution so far obtained. A marked correlation was noted between evolutionary breakpoint frequency and gene density. This correlation was significant not only at the chromosomal level but also sub-chromosomally when comparing genome intervals of lengths as short as 40 kb. Contrary to previous findings, a comparison of evolutionary breakpoint locations with the chromosomal positions of well mapped common fragile sites and cancer-associated breakpoints failed to reveal any evidence for significant co-location. Primate-specific chromosomal rearrangements were however found to occur preferentially in regions containing segmental duplications and copy number variants. CONCLUSION Specific chromosomal regions appear to be prone to recurring rearrangement in different mammalian lineages ('breakpoint reuse') even if the breakpoints themselves are likely to be non-identical. The putative ancestral eutherian genome, reconstructed on the basis of the synteny analysis of 7 vertebrate genome sequences, not only confirmed the results of previous molecular cytogenetic studies but also increased the definition of the inferred structure of ancestral eutherian chromosomes. For the first time in such an analysis, the opossum was included as an outgroup species. This served to confirm our previous model of the ancestral eutherian genome since all ancestral syntenic segment associations were also noted in this marsupial.
Collapse
Affiliation(s)
- Claus Kemkemer
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
- LMU München, Biozentrum Martinsried, München, Germany
| | - Matthias Kohn
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | - Lutz Froenicke
- Dept. of Population Health and Reproduction, School of Veterinary Medicine, University of California, Davis, CA 95616, USA
| | - Josef Högel
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
| | - Horst Hameister
- Institute of Human Genetics, University of Ulm, 89081 Ulm, Germany
| | | |
Collapse
|
7
|
Chauve C, Tannier E. A methodological framework for the reconstruction of contiguous regions of ancestral genomes and its application to mammalian genomes. PLoS Comput Biol 2008; 4:e1000234. [PMID: 19043541 PMCID: PMC2580819 DOI: 10.1371/journal.pcbi.1000234] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2008] [Accepted: 10/17/2008] [Indexed: 01/07/2023] Open
Abstract
The reconstruction of ancestral genome architectures and gene orders from homologies between extant species is a long-standing problem, considered by both cytogeneticists and bioinformaticians. A comparison of the two approaches was recently investigated and discussed in a series of papers, sometimes with diverging points of view regarding the performance of these two approaches. We describe a general methodological framework for reconstructing ancestral genome segments from conserved syntenies in extant genomes. We show that this problem, from a computational point of view, is naturally related to physical mapping of chromosomes and benefits from using combinatorial tools developed in this scope. We develop this framework into a new reconstruction method considering conserved gene clusters with similar gene content, mimicking principles used in most cytogenetic studies, although on a different kind of data. We implement and apply it to datasets of mammalian genomes. We perform intensive theoretical and experimental comparisons with other bioinformatics methods for ancestral genome segments reconstruction. We show that the method that we propose is stable and reliable: it gives convergent results using several kinds of data at different levels of resolution, and all predicted ancestral regions are well supported. The results come eventually very close to cytogenetics studies. It suggests that the comparison of methods for ancestral genome reconstruction should include the algorithmic aspects of the methods as well as the disciplinary differences in data aquisition. No DNA molecule is preserved after a few hundred thousand years, so inferring the DNA sequence organization of ancient living organisms beyond several million years can only be achieved by computational estimations, using the similarities and differences between chromosomes of extant species. This is the scope of “paleogenomics”, and it can help to better understand how genomes have evolved until today. We propose here a computational framework to estimate contiguous segments of ancestral chromosomes, based on techniques of physical mapping that are used to infer chromosome maps of extant species when their genome is not sequenced. This framework is not guided by possible evolutionary events such as rearrangements but only proposes ancestral genome architectures. We developed a method following this framework and applied it to mammalian genomes. We inferred ancestral chromosomal regions that are stable and well supported at different levels of resolution. These ancestral chromosomal regions agree with previous cytogenetics studies and were very probably part of the genome of the common ancestor of humans, macaca, mice, dogs, and cows, living 120 million years ago. We illustrate, through comparison with other bioinformatics methods, the importance of a formal methodological background when comparing ancestral genome architecture proposals obtained from different methods.
Collapse
Affiliation(s)
- Cedric Chauve
- Department of Mathematics, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Eric Tannier
- INRIA, Rhône-Alpes, France
- Université de Lyon, Lyon, France
- Université Lyon 1, Lyon, France
- Laboratoire de Biométrie et Biologie Évolutive, CNRS, UMR5558, Villeurbanne, France
- * E-mail:
| |
Collapse
|
8
|
Muffato M, Crollius HR. Paleogenomics in vertebrates, or the recovery of lost genomes from the mist of time. Bioessays 2008; 30:122-34. [DOI: 10.1002/bies.20707] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|