1
|
Gruhl F, Janich P, Kaessmann H, Gatfield D. Circular RNA repertoires are associated with evolutionarily young transposable elements. eLife 2021; 10:67991. [PMID: 34542406 PMCID: PMC8516420 DOI: 10.7554/elife.67991] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 09/19/2021] [Indexed: 12/29/2022] Open
Abstract
Circular RNAs (circRNAs) are found across eukaryotes and can function in post-transcriptional gene regulation. Their biogenesis through a circle-forming backsplicing reaction is facilitated by reverse-complementary repetitive sequences promoting pre-mRNA folding. Orthologous genes from which circRNAs arise, overall contain more strongly conserved splice sites and exons than other genes, yet it remains unclear to what extent this conservation reflects purifying selection acting on the circRNAs themselves. Our analyses of circRNA repertoires from five species representing three mammalian lineages (marsupials, eutherians: rodents, primates) reveal that surprisingly few circRNAs arise from orthologous exonic loci across all species. Even the circRNAs from orthologous loci are associated with young, recently active and species-specific transposable elements, rather than with common, ancient transposon integration events. These observations suggest that many circRNAs emerged convergently during evolution - as a byproduct of splicing in orthologs prone to transposon insertion. Overall, our findings argue against widespread functional circRNA conservation.
Collapse
Affiliation(s)
- Franziska Gruhl
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Peggy Janich
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Krebsforschung Schweiz, Bern, Switzerland
| | - Henrik Kaessmann
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany
| | - David Gatfield
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
2
|
Sotero-Caio CG, Platt RN, Suh A, Ray DA. Evolution and Diversity of Transposable Elements in Vertebrate Genomes. Genome Biol Evol 2017; 9:161-177. [PMID: 28158585 PMCID: PMC5381603 DOI: 10.1093/gbe/evw264] [Citation(s) in RCA: 147] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2016] [Indexed: 12/21/2022] Open
Abstract
Transposable elements (TEs) are selfish genetic elements that mobilize in genomes via transposition or retrotransposition and often make up large fractions of vertebrate genomes. Here, we review the current understanding of vertebrate TE diversity and evolution in the context of recent advances in genome sequencing and assembly techniques. TEs make up 4-60% of assembled vertebrate genomes, and deeply branching lineages such as ray-finned fishes and amphibians generally exhibit a higher TE diversity than the more recent radiations of birds and mammals. Furthermore, the list of taxa with exceptional TE landscapes is growing. We emphasize that the current bottleneck in genome analyses lies in the proper annotation of TEs and provide examples where superficial analyses led to misleading conclusions about genome evolution. Finally, recent advances in long-read sequencing will soon permit access to TE-rich genomic regions that previously resisted assembly including the gigantic, TE-rich genomes of salamanders and lungfishes.
Collapse
Affiliation(s)
| | - Roy N. Platt
- Department of Biological Sciences, Texas Tech University, Lubbock, TX
| | - Alexander Suh
- Department of Evolutionary Biology (EBC), Uppsala University, Uppsala, Sweden
| | - David A. Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX
| |
Collapse
|
3
|
Gallus S, Lammers F, Nilsson MA. When Genomics Is Not Enough: Experimental Evidence for a Decrease in LINE-1 Activity During the Evolution of Australian Marsupials. Genome Biol Evol 2016; 8:2406-12. [PMID: 27389686 PMCID: PMC5010896 DOI: 10.1093/gbe/evw159] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The autonomous transposable element LINE-1 is a highly abundant element that makes up between 15% and 20% of therian mammal genomes. Since their origin before the divergence of marsupials and placental mammals, LINE-1 elements have contributed actively to the genome landscape. A previous in silico screen of the Tasmanian devil genome revealed a lack of functional coding LINE-1 sequences. In this study we present the results of an in vitro analysis from a partial LINE-1 reverse transcriptase coding sequence in five marsupial species. Our experimental screen supports the in silico findings of the genome-wide degradation of LINE-1 sequences in the Tasmanian devil, and identifies a high frequency of degraded LINE-1 sequences in other Australian marsupials. The comparison between the experimentally obtained LINE-1 sequences and reference genome assemblies suggests that conclusions from in silico analyses of retrotransposition activity can be influenced by incomplete genome assemblies from short reads.
Collapse
Affiliation(s)
- Susanne Gallus
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft fuer Naturforschung, Senckenberg Anlage 25, Frankfurt, Germany Institute for Ecology, Evolution and Diversity, Faculty of Biological Sciences, Johann Wolfgang Goethe University Frankfurt Am Main, Max-von-Laue Straβe 9, 60438 Frankfurt am Main, Germany
| | - Fritjof Lammers
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft fuer Naturforschung, Senckenberg Anlage 25, Frankfurt, Germany Institute for Ecology, Evolution and Diversity, Faculty of Biological Sciences, Johann Wolfgang Goethe University Frankfurt Am Main, Max-von-Laue Straβe 9, 60438 Frankfurt am Main, Germany
| | - Maria Anna Nilsson
- Senckenberg Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft fuer Naturforschung, Senckenberg Anlage 25, Frankfurt, Germany
| |
Collapse
|
4
|
Gallus S, Hallström BM, Kumar V, Dodt WG, Janke A, Schumann GG, Nilsson MA. Evolutionary histories of transposable elements in the genome of the largest living marsupial carnivore, the Tasmanian devil. Mol Biol Evol 2015; 32:1268-83. [PMID: 25633377 PMCID: PMC4408412 DOI: 10.1093/molbev/msv017] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The largest living carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii), is the sole survivor of a lineage originating about 12 Ma. We set out to investigate the spectrum of transposable elements found in the Tasmanian devil genome, the first high-coverage genome of an Australian marsupial. Marsupial genomes have been shown to have the highest amount of transposable elements among vertebrates. We analyzed the horizontally transmitted DNA transposons OC1 and hAT-1_MEu in the Tasmanian devil genome. OC1 is present in all carnivorous marsupials, while having a very limited distribution among the remaining Australian marsupial orders. In contrast, hAT-1_MEu is present in all Australian marsupial orders, and has so far only been identified in a few placental mammals. We screened 158 introns for phylogenetically informative retrotransposons in the order Dasyuromorphia, and found that the youngest SINE (Short INterspersed Element), WSINE1, is no longer active in the subfamily Dasyuridae. The lack of detectable WSINE1 activity in this group may be due to a retrotransposon inactivation event approximately 30 Ma. We found that the Tasmanian devil genome contains a relatively low number of continuous full-length LINE-1 (Long INterspersed Element 1, L1) retrotransposons compared with the opossum genome. Furthermore, all L1 elements in the Tasmanian devil appeared to be nonfunctional. Hidden Markov Model approaches suggested that other potential sources of functional reverse transcriptase are absent from the genome. We discuss the issues associated with assembling long, highly similar L1 copies from short read Illumina data and describe how assembly artifacts can potentially lead to erroneous conclusions.
Collapse
Affiliation(s)
- Susanne Gallus
- Division of Medical Biotechnology, Paul Ehrlich Institute, Langen, Germany LOEWE Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft für Naturforschung, Frankfurt am Main, Germany
| | - Björn M Hallström
- Division of Medical Biotechnology, Paul Ehrlich Institute, Langen, Germany LOEWE Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft für Naturforschung, Frankfurt am Main, Germany Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden
| | - Vikas Kumar
- LOEWE Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft für Naturforschung, Frankfurt am Main, Germany
| | - William G Dodt
- LOEWE Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft für Naturforschung, Frankfurt am Main, Germany School of Earth, Environmental and Biological Sciences, Queensland University of Technology, Brisbane, Qld, Australia
| | - Axel Janke
- LOEWE Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft für Naturforschung, Frankfurt am Main, Germany Institute for Ecology, Evolution & Diversity, Biologicum, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Gerald G Schumann
- Division of Medical Biotechnology, Paul Ehrlich Institute, Langen, Germany
| | - Maria A Nilsson
- LOEWE Biodiversity and Climate Research Centre (BiK-F), Senckenberg Gesellschaft für Naturforschung, Frankfurt am Main, Germany
| |
Collapse
|
5
|
Walters-Conte KB, Johnson DLE, Johnson WE, O’Brien SJ, Pecon-Slattery J. The dynamic proliferation of CanSINEs mirrors the complex evolution of Feliforms. BMC Evol Biol 2014; 14:137. [PMID: 24947429 PMCID: PMC4084570 DOI: 10.1186/1471-2148-14-137] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2014] [Accepted: 06/11/2014] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Repetitive short interspersed elements (SINEs) are retrotransposons ubiquitous in mammalian genomes and are highly informative markers to identify species and phylogenetic associations. Of these, SINEs unique to the order Carnivora (CanSINEs) yield novel insights on genome evolution in domestic dogs and cats, but less is known about their role in related carnivores. In particular, genome-wide assessment of CanSINE evolution has yet to be completed across the Feliformia (cat-like) suborder of Carnivora. Within Feliformia, the cat family Felidae is composed of 37 species and numerous subspecies organized into eight monophyletic lineages that likely arose 10 million years ago. Using the Felidae family as a reference phylogeny, along with representative taxa from other families of Feliformia, the origin, proliferation and evolution of CanSINEs within the suborder were assessed. RESULTS We identified 93 novel intergenic CanSINE loci in Feliformia. Sequence analyses separated Feliform CanSINEs into two subfamilies, each characterized by distinct RNA polymerase binding motifs and phylogenetic associations. Subfamily I CanSINEs arose early within Feliformia but are no longer under active proliferation. Subfamily II loci are more recent, exclusive to Felidae and show evidence for adaptation to extant RNA polymerase activity. Further, presence/absence distributions of CanSINE loci are largely congruent with taxonomic expectations within Feliformia and the less resolved nodes in the Felidae reference phylogeny present equally ambiguous CanSINE data. SINEs are thought to be nearly impervious to excision from the genome. However, we observed a nearly complete excision of a CanSINEs locus in puma (Puma concolor). In addition, we found that CanSINE proliferation in Felidae frequently targeted existing CanSINE loci for insertion sites, resulting in tandem arrays. CONCLUSIONS We demonstrate the existence of at least two SINE families within the Feliformia suborder, one of which is actively involved in insertional mutagenesis. We find SINEs are powerful markers of speciation and conclude that the few inconsistencies with expected patterns of speciation likely represent incomplete lineage sorting, species hybridization and SINE-mediated genome rearrangement.
Collapse
Affiliation(s)
- Kathryn B Walters-Conte
- Department of Biology, American University, 101 Hurst Hall 4440 Massachusetts Ave, Washington, DC 20016, USA
| | - Diana LE Johnson
- Department of Biological Sciences, The George Washington University, 2036 G St, Washington, DC 20009, USA
| | - Warren E Johnson
- Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, VA 22630, USA
| | - Stephen J O’Brien
- Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, 41 A, Sredniy Avenue St., Petersburg 199034, Russia
| | - Jill Pecon-Slattery
- Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, VA 22630, USA
| |
Collapse
|
6
|
Nilsson MA, Janke A, Murchison EP, Ning Z, Hallström BM. Expansion of CORE-SINEs in the genome of the Tasmanian devil. BMC Genomics 2012; 13:172. [PMID: 22559330 PMCID: PMC3403934 DOI: 10.1186/1471-2164-13-172] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Accepted: 05/06/2012] [Indexed: 11/22/2022] Open
Abstract
Background The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species’ survival. To better understand the Tasmanian devil genome, we screened it for transposable elements and investigated the dynamics of short interspersed element (SINE) retroposons. Results The temporal history of Tasmanian devil SINEs, elucidated using a transposition in transposition analysis, indicates that WSINE1, a CORE-SINE present in around 200,000 copies, is the most recently active element. Moreover, we discovered a new subtype of WSINE1 (WSINE1b) that comprises at least 90% of all Tasmanian devil WSINE1s. The frequencies of WSINE1 subtypes differ in the genomes of two of the other Australian marsupial orders. A co-segregation analysis indicated that at least 66 subfamilies of WSINE1 evolved during the evolution of Dasyuromorphia. Using a substitution rate derived from WSINE1 insertions, the ages of the subfamilies were estimated and correlated with a newly established phylogeny of Dasyuromorphia. Phylogenetic analyses and divergence time estimates of mitochondrial genome data indicate a rapid radiation of the Tasmanian devil and the closest relative the quolls (Dasyurus) around 14 million years ago. Conclusions The radiation and abundance of CORE-SINEs in marsupial genomes indicates that they may be a major player in the evolution of marsupials. It is evident that the early phases of evolution of the carnivorous marsupial order Dasyuromorphia was characterized by a burst of SINE activity. A correlation between a speciation event and a major burst of retroposon activity is for the first time shown in a marsupial genome.
Collapse
Affiliation(s)
- Maria A Nilsson
- LOEWE-Biodiversity and Climate Research Center, BiK-F, Senckenberganlage 25, Frankfurt am Main D-60325, Germany.
| | | | | | | | | |
Collapse
|
7
|
Walters-Conte KB, Johnson DLE, Allard MW, Pecon-Slattery J. Carnivore-specific SINEs (Can-SINEs): distribution, evolution, and genomic impact. J Hered 2011; 102 Suppl 1:S2-10. [PMID: 21846743 DOI: 10.1093/jhered/esr051] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Short interspersed nuclear elements (SINEs) are a type of class 1 transposable element (retrotransposon) with features that allow investigators to resolve evolutionary relationships between populations and species while providing insight into genome composition and function. Characterization of a Carnivora-specific SINE family, Can-SINEs, has, has aided comparative genomic studies by providing rare genomic changes, and neutral sequence variants often needed to resolve difficult evolutionary questions. In addition, Can-SINEs constitute a significant source of functional diversity with Carnivora. Publication of the whole-genome sequence of domestic dog, domestic cat, and giant panda serves as a valuable resource in comparative genomic inferences gleaned from Can-SINEs. In anticipation of forthcoming studies bolstered by new genomic data, this review describes the discovery and characterization of Can-SINE motifs as well as describes composition, distribution, and effect on genome function. As the contribution of noncoding sequences to genomic diversity becomes more apparent, SINEs and other transposable elements will play an increasingly large role in mammalian comparative genomics.
Collapse
|
8
|
Pathak D, Ali S. RsaI repetitive DNA in Buffalo Bubalus bubalis representing retrotransposons, conserved in bovids, are part of the functional genes. BMC Genomics 2011; 12:338. [PMID: 21718551 PMCID: PMC3149587 DOI: 10.1186/1471-2164-12-338] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2011] [Accepted: 07/01/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Repetitive sequences are the major components of the eukaryotic genomes. Association of these repeats with transcribing sequences and their regulation in buffalo Bubalus bubalis has remained largely unresolved. RESULTS We cloned and sequenced RsaI repeat fragments pDp1, pDp2, pDp3, pDp4 of 1331, 651, 603 and 339 base pairs, respectively from the buffalo, Bubalus bubalis. Upon characterization, these fragments were found to represent retrotransposons and part of some functional genes. The resultant clones showed cross hybridization only with buffalo, cattle, goat and sheep genomic DNA. Real Time PCR, detected ~2 × 10(4) copies of pDp1, ~ 3000 copies of pDp2 and pDp3 and ~ 1000 of pDp4 in buffalo, cattle, goat and sheep genomes, respectively. RsaI repeats are transcriptionally active in somatic tissues and spermatozoa. Accordingly, pDp1 showed maximum expression in lung, pDp2 and pDp3 both in Kidney, and pDp4 in ovary. Fluorescence in situ hybridization showed repeats to be distributed all across the chromosomes. CONCLUSIONS The data suggest that RsaI repeats have been incorporated into the exonic regions of various transcribing genes, possibly contributing towards the architecture and evolution of the buffalo and related genomes. Prospects of our present work in the context of comparative and functional genomics are highlighted.
Collapse
Affiliation(s)
- Deepali Pathak
- Molecular Genetics Laboratory, National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi -110 067, India
| | | |
Collapse
|
9
|
The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates. Genome Biol 2010; 11:R59. [PMID: 20525173 PMCID: PMC2911107 DOI: 10.1186/gb-2010-11-6-r59] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2010] [Revised: 04/27/2010] [Accepted: 06/02/2010] [Indexed: 01/29/2023] Open
Abstract
Background Transposable elements (TEs) have played an important role in the diversification and enrichment of mammalian transcriptomes through various mechanisms such as exonization and intronization (the birth of new exons/introns from previously intronic/exonic sequences, respectively), and insertion into first and last exons. However, no extensive analysis has compared the effects of TEs on the transcriptomes of mammals, non-mammalian vertebrates and invertebrates. Results We analyzed the influence of TEs on the transcriptomes of five species, three invertebrates and two non-mammalian vertebrates. Compared to previously analyzed mammals, there were lower levels of TE introduction into introns, significantly lower numbers of exonizations originating from TEs and a lower percentage of TE insertion within the first and last exons. Although the transcriptomes of vertebrates exhibit significant levels of exonization of TEs, only anecdotal cases were found in invertebrates. In vertebrates, as in mammals, the exonized TEs are mostly alternatively spliced, indicating that selective pressure maintains the original mRNA product generated from such genes. Conclusions Exonization of TEs is widespread in mammals, less so in non-mammalian vertebrates, and very low in invertebrates. We assume that the exonization process depends on the length of introns. Vertebrates, unlike invertebrates, are characterized by long introns and short internal exons. Our results suggest that there is a direct link between the length of introns and exonization of TEs and that this process became more prevalent following the appearance of mammals.
Collapse
|
10
|
Abstract
The strategic importance of the genome sequence of the gray, short-tailed opossum, Monodelphis domestica, accrues from both the unique phylogenetic position of metatherian (marsupial) mammals and the fundamental biologic characteristics of metatherians that distinguish them from other mammalian species. Metatherian and eutherian (placental) mammals are more closely related to one another than to other vertebrate groups, and owing to this close relationship they share fundamentally similar genetic structures and molecular processes. However, during their long evolutionary separation these alternative mammals have developed distinctive anatomical, physiologic, and genetic features that hold tremendous potential for examining relationships between the molecular structures of mammalian genomes and the functional attributes of their components. Comparative analyses using the opossum genome have already provided a wealth of new evidence regarding the importance of noncoding elements in the evolution of mammalian genomes, the role of transposable elements in driving genomic innovation, and the relationships between recombination rate, nucleotide composition, and the genomic distributions of repetitive elements. The genome sequence is also beginning to enlarge our understanding of the evolution and function of the vertebrate immune system, and it provides an alternative model for investigating mechanisms of genomic imprinting. Equally important, availability of the genome sequence is fostering the development of new research tools for physical and functional genomic analyses of M. domestica that are expanding its versatility as an experimental system for a broad range of research applications in basic biology and biomedically oriented research.
Collapse
|
11
|
Chojnowski JL, Braun EL. Turtle isochore structure is intermediate between amphibians and other amniotes. Integr Comp Biol 2008; 48:454-62. [PMID: 21669806 DOI: 10.1093/icb/icn062] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Vertebrate genomes are comprised of isochores that are relatively long (>100 kb) regions with a relatively homogenous (either GC-rich or AT-rich) base composition and with rather sharp boundaries with neighboring isochores. Mammals and living archosaurs (birds and crocodilians) have heterogeneous genomes that include very GC-rich isochores. In sharp contrast, the genomes of amphibians and fishes are more homogeneous and they have a lower overall GC content. Because DNA with higher GC content is more thermostable, the elevated GC content of mammalian and archosaurian DNA has been hypothesized to be an adaptation to higher body temperatures. This hypothesis can be tested by examining structure of isochores across the reptilian clade, which includes the archosaurs, testudines (turtles), and lepidosaurs (lizards and snakes), because reptiles exhibit diverse body sizes, metabolic rates, and patterns of thermoregulation. This study focuses on a comparative analysis of a new set of expressed genes of the red-eared slider turtle and orthologs of the turtle genes in mammalian (human, mouse, dog, and opossum), archosaurian (chicken and alligator), and amphibian (western clawed frog) genomes. EST (expressed sequence tag) data from a turtle cDNA library enriched for genes that have specialized functions (developmental genes) revealed using the GC content of the third-codon-position to examine isochore structure requires careful consideration of the types of genes examined. The more highly expressed genes (e.g., housekeeping genes) are more likely to be GC-rich than are genes with specialized functions. However, the set of highly expressed turtle genes demonstrated that the turtle genome has a GC content that is intermediate between the GC-poor amphibians and the GC-rich mammals and archosaurs. There was a strong correlation between the GC content of all turtle genes and the GC content of other vertebrate genes, with the slope of the line describing this relationship also indicating that the isochore structure of turtles is intermediate between that of amphibians and other amniotes. These data are consistent with some thermal hypotheses of isochore evolution, but we believe that the credible set of models for isochore evolution still includes a variety of models. These data expand the amount of genomic data available from reptiles upon which future studies of reptilian genomics can build.
Collapse
Affiliation(s)
- Jena L Chojnowski
- Department of Zoology, University of Florida, 223 Bartram Hall, PO Box 118525, Gainesville, FL 32611, USA
| | | |
Collapse
|
12
|
Gu W, Castoe TA, Hedges DJ, Batzer MA, Pollock DD. Identification of repeat structure in large genomes using repeat probability clouds. Anal Biochem 2008; 380:77-83. [PMID: 18541131 DOI: 10.1016/j.ab.2008.05.015] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2008] [Revised: 05/01/2008] [Accepted: 05/02/2008] [Indexed: 11/28/2022]
Abstract
The identification of repeat structure in eukaryotic genomes can be time-consuming and difficult because of the large amount of information ( approximately 3 x 10(9) bp) that needs to be processed and compared. We introduce a new approach based on exact word counts to evaluate, de novo, the repeat structure present within large eukaryotic genomes. This approach avoids sequence alignment and similarity search, two of the most time-consuming components of traditional methods for repeat identification. Algorithms were implemented to efficiently calculate exact counts for any length oligonucleotide in large genomes. Based on these oligonucleotide counts, oligonucleotide excess probability clouds, or "P-clouds," were constructed. P-clouds are composed of clusters of related oligonucleotides that occur, as a group, more often than expected by chance. After construction, P-clouds were mapped back onto the genome, and regions of high P-cloud density were identified as repetitive regions based on a sliding window approach. This efficient method is capable of analyzing the repeat content of the entire human genome on a single desktop computer in less than half a day, at least 10-fold faster than current approaches. The predicted repetitive regions strongly overlap with known repeat elements as well as other repetitive regions such as gene families, pseudogenes, and segmental duplicons. This method should be extremely useful as a tool for use in de novo identification of repeat structure in large newly sequenced genomes.
Collapse
Affiliation(s)
- Wanjun Gu
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | | | | | | | | |
Collapse
|
13
|
Core-SINE blocks comprise a large fraction of monotreme genomes; implications for vertebrate chromosome evolution. Chromosome Res 2008; 15:975-84. [DOI: 10.1007/s10577-007-1187-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2007] [Revised: 10/21/2007] [Accepted: 10/21/2007] [Indexed: 10/22/2022]
|
14
|
Gentles AJ, Wakefield MJ, Kohany O, Gu W, Batzer MA, Pollock DD, Jurka J. Evolutionary dynamics of transposable elements in the short-tailed opossum Monodelphis domestica. Genome Res 2007; 17:992-1004. [PMID: 17495012 PMCID: PMC1899126 DOI: 10.1101/gr.6070707] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The genome of the gray short-tailed opossum Monodelphis domestica is notable for its large size ( approximately 3.6 Gb). We characterized nearly 500 families of interspersed repeats from the Monodelphis. They cover approximately 52% of the genome, higher than in any other amniotic lineage studied to date, and may account for the unusually large genome size. In comparison to other mammals, Monodelphis is significantly rich in non-LTR retrotransposons from the LINE-1, CR1, and RTE families, with >29% of the genome sequence comprised of copies of these elements. Monodelphis has at least four families of RTE, and we report support for horizontal transfer of this non-LTR retrotransposon. In addition to short interspersed elements (SINEs) mobilized by L1, we found several families of SINEs that appear to use RTE elements for mobilization. In contrast to L1-mobilized SINEs, the RTE-mobilized SINEs in Monodelphis appear to shift from G+C-rich to G+C-low regions with time. Endogenous retroviruses have colonized approximately 10% of the opossum genome. We found that their density is enhanced in centromeric and/or telomeric regions of most Monodelphis chromosomes. We identified 83 new families of ancient repeats that are highly conserved across amniotic lineages, including 14 LINE-derived repeats; and a novel SINE element, MER131, that may have been exapted as a highly conserved functional noncoding RNA, and whose emergence dates back to approximately 300 million years ago. Many of these conserved repeats are also present in human, and are highly over-represented in predicted cis-regulatory modules. Seventy-six of the 83 families are present in chicken in addition to mammals.
Collapse
Affiliation(s)
- Andrew J. Gentles
- Department of Radiology, School of Medicine, Stanford University, Stanford, California 94305, USA
- Genetic Information Research Institute, Mountain View, California 94043, USA
- Corresponding authors.E-mail ; fax (650) 723-5795.E-mail ; fax (650) 961-4473
| | - Matthew J. Wakefield
- ARC Centre for Kangaroo Genomics, Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria 3050, Australia
| | - Oleksiy Kohany
- Genetic Information Research Institute, Mountain View, California 94043, USA
| | - Wanjun Gu
- Department of Biochemistry and Molecular Genetics, University of Colorado Health Sciences Center, Aurora 80045, Colorado, USA
| | - Mark A. Batzer
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - David D. Pollock
- Department of Biochemistry and Molecular Genetics, University of Colorado Health Sciences Center, Aurora 80045, Colorado, USA
| | - Jerzy Jurka
- Genetic Information Research Institute, Mountain View, California 94043, USA
- Corresponding authors.E-mail ; fax (650) 723-5795.E-mail ; fax (650) 961-4473
| |
Collapse
|
15
|
Mikkelsen TS, Wakefield MJ, Aken B, Amemiya CT, Chang JL, Duke S, Garber M, Gentles AJ, Goodstadt L, Heger A, Jurka J, Kamal M, Mauceli E, Searle SMJ, Sharpe T, Baker ML, Batzer MA, Benos PV, Belov K, Clamp M, Cook A, Cuff J, Das R, Davidow L, Deakin JE, Fazzari MJ, Glass JL, Grabherr M, Greally JM, Gu W, Hore TA, Huttley GA, Kleber M, Jirtle RL, Koina E, Lee JT, Mahony S, Marra MA, Miller RD, Nicholls RD, Oda M, Papenfuss AT, Parra ZE, Pollock DD, Ray DA, Schein JE, Speed TP, Thompson K, VandeBerg JL, Wade CM, Walker JA, Waters PD, Webber C, Weidman JR, Xie X, Zody MC, Graves JAM, Ponting CP, Breen M, Samollow PB, Lander ES, Lindblad-Toh K. Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences. Nature 2007; 447:167-77. [PMID: 17495919 DOI: 10.1038/nature05805] [Citation(s) in RCA: 508] [Impact Index Per Article: 29.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Accepted: 04/03/2007] [Indexed: 12/15/2022]
Abstract
We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian ('marsupial') species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation.
Collapse
Affiliation(s)
- Tarjei S Mikkelsen
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|