1
|
Gold MP, Fresco JR. A Role for the Mutagenic DNA Self-Catalyzed Depurination Mechanism in the Evolution of 7SL-Derived RNAs. J Mol Evol 2017; 85:84-98. [PMID: 29103173 DOI: 10.1007/s00239-017-9811-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Accepted: 10/03/2017] [Indexed: 11/28/2022]
Abstract
The Alu element, the most prevalent SINE (short interspersed element) in the human genome, is one of the many RNA-encoding genes that evolved from the 7SL RNA gene. During analysis of the evolution of 7SL-derived RNAs, two distinct evolutionary intermediates capable of self-catalyzed DNA depurination (SDP) were identified. These SDP sequences spontaneously create apurinic sites that can result in increased mutagenesis due to their error-prone repair. This DNA self-depurination mechanism has been shown both in vitro and in vivo to lead to substitution and short frameshift mutations at a frequency that far exceeds their occurrence due to random errors in DNA replication. In both evolutionary intermediates, the same self-depurination sequence overlaps motifs necessary for successful transcription and SRP9/14 (signal recognition particle) binding; hence, mutations in this region could disrupt RNA activity. Yet, the 7SL-derived RNAs that arose from the elements capable of SDP show significant diversity in this region, and every new sequence retains the transcription and SRP9/14-binding motifs, even as it has lost the SDP sequence. While some (but not all) of the mutagenesis can be alternatively attributed to CpG decay, the very fact that the self-depurinating sequences are selectively discarded in all cases suggests that this was evolutionarily motivated to prevent further destructive mutagenesis by the SDP mechanism.
Collapse
Affiliation(s)
- Maxwell P Gold
- Department of Molecular Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Jacques R Fresco
- Department of Molecular Biology, Princeton University, Princeton, NJ, 08544, USA.
| |
Collapse
|
2
|
Tajaddod M, Tanzer A, Licht K, Wolfinger MT, Badelt S, Huber F, Pusch O, Schopoff S, Janisiw M, Hofacker I, Jantsch MF. Transcriptome-wide effects of inverted SINEs on gene expression and their impact on RNA polymerase II activity. Genome Biol 2016; 17:220. [PMID: 27782844 PMCID: PMC5080714 DOI: 10.1186/s13059-016-1083-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 10/10/2016] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Short interspersed elements (SINEs) represent the most abundant group of non-long-terminal repeat transposable elements in mammalian genomes. In primates, Alu elements are the most prominent and homogenous representatives of SINEs. Due to their frequent insertion within or close to coding regions, SINEs have been suggested to play a crucial role during genome evolution. Moreover, Alu elements within mRNAs have also been reported to control gene expression at different levels. RESULTS Here, we undertake a genome-wide analysis of insertion patterns of human Alus within transcribed portions of the genome. Multiple, nearby insertions of SINEs within one transcript are more abundant in tandem orientation than in inverted orientation. Indeed, analysis of transcriptome-wide expression levels of 15 ENCODE cell lines suggests a cis-repressive effect of inverted Alu elements on gene expression. Using reporter assays, we show that the negative effect of inverted SINEs on gene expression is independent of known sensors of double-stranded RNAs. Instead, transcriptional elongation seems impaired, leading to reduced mRNA levels. CONCLUSIONS Our study suggests that there is a bias against multiple SINE insertions that can promote intramolecular base pairing within a transcript. Moreover, at a genome-wide level, mRNAs harboring inverted SINEs are less expressed than mRNAs harboring single or tandemly arranged SINEs. Finally, we demonstrate a novel mechanism by which inverted SINEs can impact on gene expression by interfering with RNA polymerase II.
Collapse
Affiliation(s)
- Mansoureh Tajaddod
- Department of Chromosome Biology, Max F. Perutz Laboratories, University of Vienna, Dr. Bohr Gasse 9/5, Vienna, A-1030, Austria
| | - Andrea Tanzer
- Institute for Theoretical Chemistry, University of Vienna, Währinger Strasse 17, Vienna, A-1090, Austria
| | - Konstantin Licht
- Department of Cell and Developmental Biology, Medical University of Vienna, Schwarzspanierstrasse 17, Vienna, A-1090, Austria
| | - Michael T Wolfinger
- Department of Cell and Developmental Biology, Medical University of Vienna, Schwarzspanierstrasse 17, Vienna, A-1090, Austria
- Institute for Theoretical Chemistry, University of Vienna, Währinger Strasse 17, Vienna, A-1090, Austria
| | - Stefan Badelt
- Institute for Theoretical Chemistry, University of Vienna, Währinger Strasse 17, Vienna, A-1090, Austria
| | - Florian Huber
- Department of Chromosome Biology, Max F. Perutz Laboratories, University of Vienna, Dr. Bohr Gasse 9/5, Vienna, A-1030, Austria
- Present address: Center for molecular biology of the University Heidelberg, Im Neuenheimer Feld 282, Heidelberg, D-69120, Germany
| | - Oliver Pusch
- Department of Cell and Developmental Biology, Medical University of Vienna, Schwarzspanierstrasse 17, Vienna, A-1090, Austria
| | - Sandy Schopoff
- Department of Chromosome Biology, Max F. Perutz Laboratories, University of Vienna, Dr. Bohr Gasse 9/5, Vienna, A-1030, Austria
| | - Michael Janisiw
- Department of Cell and Developmental Biology, Medical University of Vienna, Schwarzspanierstrasse 17, Vienna, A-1090, Austria
| | - Ivo Hofacker
- Institute for Theoretical Chemistry, University of Vienna, Währinger Strasse 17, Vienna, A-1090, Austria
| | - Michael F Jantsch
- Department of Cell and Developmental Biology, Medical University of Vienna, Schwarzspanierstrasse 17, Vienna, A-1090, Austria.
- Department of Cell and Developmental Biology, Medical University of Vienna, Center of Anatomy and Cell Biology, Schwarzspanierstrasse 17, Vienna, A-1090, Austria.
| |
Collapse
|
3
|
Abstract
SINEBase (http://sines.eimb.ru) integrates the revisited body of knowledge about short interspersed elements (SINEs). A set of formal definitions concerning SINEs was introduced. All available sequence data were screened through these definitions and the genetic elements misidentified as SINEs were discarded. As a result, 175 SINE families have been recognized in animals, flowering plants and green algae. These families were classified by the modular structure of their nucleotide sequences and the frequencies of different patterns were evaluated. These data formed the basis for the database of SINEs. The SINEBase website can be used in two ways: first, to explore the database of SINE families, and second, to analyse candidate SINE sequences using specifically developed tools. This article presents an overview of the database and the process of SINE identification and analysis.
Collapse
Affiliation(s)
- Nikita S Vassetzky
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Moscow 119991, Russia
| | | |
Collapse
|
4
|
“Delayed death” phenomenon: A synergistic action of cyclophosphamide and exogenous DNA. Gene 2012; 495:134-45. [DOI: 10.1016/j.gene.2011.12.032] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Revised: 12/05/2011] [Accepted: 12/15/2011] [Indexed: 01/20/2023]
|
5
|
Abstract
Short interspersed elements (SINEs) are one of the two most prolific mobile genomic elements in most of the higher eukaryotes. Although their biology is still not thoroughly understood, unusual life cycle of these simple elements amplified as genomic parasites makes their evolution unique in many ways. In contrast to most genetic elements including other transposons, SINEs emerged de novo many times in evolution from available molecules (for example, tRNA). The involvement of reverse transcription in their amplification cycle, huge number of genomic copies and modular structure allow variation mechanisms in SINEs uncommon or rare in other genetic elements (module exchange between SINE families, dimerization, and so on.). Overall, SINE evolution includes their emergence, progressive optimization and counteraction to the cell's defense against mobile genetic elements.
Collapse
|
6
|
Chen Z, Yang G. Novel CHR-2 SINE subfamilies and t-SINEs identified in cetaceans using nonradioactive Southern blotting. Genes Genomics 2010. [DOI: 10.1007/s13258-010-0044-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
7
|
Unique functions of repetitive transcriptomes. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2010; 285:115-88. [PMID: 21035099 DOI: 10.1016/b978-0-12-381047-2.00003-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Repetitive sequences occupy a huge fraction of essentially every eukaryotic genome. Repetitive sequences cover more than 50% of mammalian genomic DNAs, whereas gene exons and protein-coding sequences occupy only ~3% and 1%, respectively. Numerous genomic repeats include genes themselves. They generally encode "selfish" proteins necessary for the proliferation of transposable elements (TEs) in the host genome. The major part of evolutionary "older" TEs accumulated mutations over time and fails to encode functional proteins. However, repeats have important functions also on the RNA level. Repetitive transcripts may serve as multifunctional RNAs by participating in the antisense regulation of gene activity and by competing with the host-encoded transcripts for cellular factors. In addition, genomic repeats include regulatory sequences like promoters, enhancers, splice sites, polyadenylation signals, and insulators, which actively reshape cellular transcriptomes. TE expression is tightly controlled by the host cells, and some mechanisms of this regulation were recently decoded. Finally, capacity of TEs to proliferate in the host genome led to the development of multiple biotechnological applications.
Collapse
|
8
|
Gogolevsky KP, Vassetzky NS, Kramerov DA. 5S rRNA-derived and tRNA-derived SINEs in fruit bats. Genomics 2009; 93:494-500. [PMID: 19442632 DOI: 10.1016/j.ygeno.2009.02.001] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2008] [Revised: 02/04/2009] [Accepted: 02/04/2009] [Indexed: 11/24/2022]
Abstract
Most short retroposons (SINEs) descend from cellular tRNA of 7SL RNA. Here, four new SINEs were found in megabats (Megachiroptera) but neither in microbats nor in other mammals. Two of them, MEG-RS and MEG-RL, descend from another cellular RNA, 5S rRNA; one (MEG-T2) is a tRNA-derived SINE; and MEG-TR is a hybrid tRNA/5S rRNA SINE. Insertion locus analysis suggests that these SINEs were active in the recent fruit bat evolution. Analysis of MEG-RS and MEG-RL in comparison with other few 5S rRNA-derived SINEs demonstrates that the internal RNA polymerase III promoter is their most invariant region, while the secondary structure is more variable. The mechanisms underlying the modular structure of these and other SINEs as well as their variation are discussed. The scenario of evolution of MEG SINEs is proposed.
Collapse
Affiliation(s)
- Konstantin P Gogolevsky
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov St., Moscow 119991, Russia
| | | | | |
Collapse
|
9
|
Yu L, Liu J, Luan PT, Lee H, Lee M, Min MS, Ryder OA, Chemnick L, Davis H, Zhang YP. New insights into the evolution of intronic sequences of the beta-fibrinogen gene and their application in reconstructing mustelid phylogeny. Zoolog Sci 2008; 25:662-72. [PMID: 18624576 DOI: 10.2108/zsj.25.662] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2007] [Accepted: 03/13/2008] [Indexed: 11/17/2022]
Abstract
Mustelidae is the largest and most diverse family in the order Carnivora. The phylogenetic relationships among the subfamilies have especially long been a focus of study. Herein we are among the first to employ two new introns (4 and 7) of the nuclear beta-fibrinogen gene to clarify these enigmatic problems. In addition, two previously available nuclear (IRBP exon 1 and TTR intron 1) and one mt (ND2) data sets were also combined and analyzed simultaneously with the newly obtained sequence data in this study. Detailed characterizations of the two intronic regions not only reveal the remarkable occurrences of short interspersed element (SINE) insertion events, providing a new example supporting the attractive hypothesis that attrition of an earlier retroposition may offer a proper environment for successive retropositions by forming a "dimer-like" structure, but also demonstrate their utility in the resolution of mustelid phylogeny. All of our analyses confirm the assemblage of Mustelinae, Lutrinae, and Melinae with confidence; moreover, two clades within Mustelinae were clearly recognized, i.e., genera Mustela and Martes. Notably, genus Martes of Mustelinae was found to branch off first, followed by Melinae and then a clade containing Lutrinae and genus Mustela of Mustelinae, indicating paraphyly of Mustelinae. In addition, Mephitinae diverges before the other mustelids and the monophyletic Procyonidae in all cases, supporting its elevation to a separate family. Additional independent genetic markers are still in need to resolve the trichotomy among Mephitinae and the other two carnivoran clades, Ailuridae and Procyonidae/non-mephitine Mustelidae.
Collapse
Affiliation(s)
- Li Yu
- Laboratory for Conservation and Utilization of Bio-resource, Yunnan University, Kunming, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Veniaminova NA, Vassetzky NS, Lavrenchenko LA, Popov SV, Kramerov DA. Phylogeny of the order rodentia inferred from structural analysis of short retroposon B1. RUSS J GENET+ 2007. [DOI: 10.1134/s1022795407070071] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
11
|
Nishihara H, Kuno S, Nikaido M, Okada N. MyrSINEs: a novel SINE family in the anteater genomes. Gene 2007; 400:98-103. [PMID: 17628355 DOI: 10.1016/j.gene.2007.06.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2007] [Revised: 05/24/2007] [Accepted: 06/01/2007] [Indexed: 10/23/2022]
Abstract
Recent rapid generation of genomic sequence data has allowed many researchers to perform comparative analyses in various mammalian species. However, characterization of transposable elements, such as short interspersed repetitive elements (SINEs), has not been reported for several mammalian groups. Because SINEs occupy a large portion of the mammalian genome, they are believed to have contributed to the constitution and diversification of the host genomes during evolution. In the present study, we characterized a novel SINE family in the anteater genomes and designated it the MyrSINE family. Typical SINEs consist of a tRNA-related, a tRNA-unrelated and an AT-rich (or poly-A) region. MyrSINEs have only tRNA-related and poly-A regions; they are included in a group called t-SINE. The tRNA-related regions of the MyrSINEs were found to be derived from tRNA(Gly). We demonstrate that the MyrSINE family can be classified into three subfamilies. Two of the MyrSINE subfamilies are distributed in the genomes of both giant anteater and tamandua, while the other is present only in the giant anteater. We discuss the evolutionary history of MyrSINEs and their relationship to the evolution of anteaters. We also speculate that the simple structure of t-SINEs may be a potential evolutionary source for the generation of the typical SINE structure.
Collapse
Affiliation(s)
- Hidenori Nishihara
- Department of Biological Sciences, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama 226-8501, Japan
| | | | | | | |
Collapse
|
12
|
Farwick A, Jordan U, Fuellen G, Huchon D, Catzeflis F, Brosius J, Schmitz J. Automated scanning for phylogenetically informative transposed elements in rodents. Syst Biol 2007; 55:936-48. [PMID: 17345675 DOI: 10.1080/10635150601064806] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
Transposed elements constitute an attractive, useful source of phylogenetic markers to elucidate the evolutionary history of their hosts. Frequent and successive amplifications over evolutionary time are important requirements for utilizing their presence or absence as landmarks of evolution. Although transposed elements are well distributed in rodent taxa, the generally high degree of genomic sequence divergence among species complicates our access to presence/absence data. With this in mind we developed a novel, high-throughput computational strategy, called CPAL (Conserved Presence/Absence Locus-finder), to identify genome-wide distributed, phylogenetically informative transposed elements flanked by highly conserved regions. From a total of 232 extracted chromosomal mouse loci we randomly selected 14 of these plus 2 others from previous test screens and attempted to amplify them via PCR in representative rodent species. All loci were amplifiable and ultimately contributed 31 phylogenetically informative markers distributed throughout the major groups of Rodentia.
Collapse
Affiliation(s)
- Astrid Farwick
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| | | | | | | | | | | | | |
Collapse
|
13
|
Veniaminova NA, Vassetzky NS, Kramerov DA. B1 SINEs in different rodent families. Genomics 2007; 89:678-86. [PMID: 17433864 DOI: 10.1016/j.ygeno.2007.02.007] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2006] [Revised: 02/28/2007] [Accepted: 02/28/2007] [Indexed: 11/18/2022]
Abstract
B1 SINEs were studied in 22 families covering all major rodent lineages. The number of B1 copies considerably varies, from 1 x 10(4) in Geomyidae to 1 x 10(6) in Myodonta. B1 sequences can be divided into three main structural variants: B1 with a 20-bp tandem duplication (found in Gliridae, Sciuridae, and Aplodontidae), B1 with a 29-bp duplication (found in other families), and proto-B1 without duplication (pB1). These variants can be further subdivided according to their characters, including specific 7-, 9-, or 10-bp deletions. Different B1 subfamilies predominate in different rodent families. The analysis of B1 variants allowed us to propose possible pathways for the evolution of this SINE in the context of rodent evolution.
Collapse
Affiliation(s)
- Natalia A Veniaminova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov Street, Moscow 119991, Russia
| | | | | |
Collapse
|
14
|
|
15
|
Gogolevsky KP, Kramerov DA. Short interspersed elements (SINEs) of the Geomyoidea superfamily rodents. Gene 2006; 373:67-74. [PMID: 16517098 DOI: 10.1016/j.gene.2006.01.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2005] [Revised: 01/04/2006] [Accepted: 01/09/2006] [Indexed: 10/24/2022]
Abstract
A new short interspersed element (SINE) was isolated from the genome of desert kangaroo rat (Dipodomys deserti) using single-primer PCR. This SINE consists of two monomers: the left monomer (IDL) resembles rodent ID element and other tRNAAla(CGC)-derived SINEs, whereas the right one (Geo) shows no similarity with known SINE sequences. PCR and hybridization analyses demonstrated that IDL-Geo SINE is restricted to the rodent superfamily Geomyoidea (families Geomyidea and Heteromyidea). Isolation and analysis of IDL-Geo from California pocket mouse (Chaetodipus californicus) and Botta's pocket gopher (Thomomys bottae) revealed some species-specific features of this SINE family. The structure and evolution of known dimeric SINEs are discussed.
Collapse
Affiliation(s)
- Konstantin P Gogolevsky
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov Street, Moscow 119991, Russia
| | | |
Collapse
|
16
|
Borodulina OR, Kramerov DA. PCR-based approach to SINE isolation: Simple and complex SINEs. Gene 2005; 349:197-205. [PMID: 15777739 DOI: 10.1016/j.gene.2004.12.035] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2004] [Revised: 11/19/2004] [Accepted: 12/22/2004] [Indexed: 10/25/2022]
Abstract
Highly repeated copies of short interspersed elements (SINEs) occur in eukaryotic genomes. The distribution of each SINE family is usually restricted to some genera, families, or orders. SINEs have an RNA polymerase III internal promoter, which is composed of boxes A and B. Here we propose a method for isolation of novel SINE families based on genomic DNA PCR with oligonucleotide identical to box A as a primer. Cloning of the size-heterogeneous PCR-products and sequencing of their terminal regions allow determination of SINE structure. Using this approach, two novel SINE families, Rhin-1 and Das-1, from the genomes of great horseshoe bat (Rhinolophus ferrumequinum) and nine-banded armadillo (Dasypus novemcinctus), respectively, were isolated and studied. The distribution of Rhin-1 is restricted to two of six bat families tested. Copies of this SINE are characterized by frequent internal insertions and significant length (200-270 bp). Das-1 being only 90 bp in length is one of the shortest SINEs known. Most of Das-1 nucleotide sequences demonstrate significant similarity to alanine tRNA which appears to be an evolutionary progenitor of this SINE. Together with three other known SINEs (ID, Vic-1, and CYN), Das-1 constitutes a group of simple SINEs. Interestingly, three SINE families of this group are alanine tRNA-derived. Most probably, this tRNA gave rise to short and simple but successful SINEs several times during mammalian evolution.
Collapse
Affiliation(s)
- Olga R Borodulina
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | | |
Collapse
|
17
|
Wicker T, Robertson JS, Schulze SR, Feltus FA, Magrini V, Morrison JA, Mardis ER, Wilson RK, Peterson DG, Paterson AH, Ivarie R. The repetitive landscape of the chicken genome. Genome Res 2004; 15:126-36. [PMID: 15256510 PMCID: PMC540276 DOI: 10.1101/gr.2438004] [Citation(s) in RCA: 77] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.
Collapse
Affiliation(s)
- Thomas Wicker
- Plant Genome Mapping Laboratory, University of Georgia, Athens, Georgia 30602, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Wicker T, Robertson JS, Schulze SR, Feltus FA, Magrini V, Morrison JA, Mardis ER, Wilson RK, Peterson DG, Paterson AH, Ivarie R. The repetitive landscape of the chicken genome. Genome Res 2004. [PMID: 15256510 DOI: 10.1101/gr.2438005] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, and low-copy fractions of the chicken genome. Sequencing high-copy DNA of chicken to about 2.7 x coverage of its estimated sequence complexity led to the initial identification of several new repeat families, which were then used for a survey of the newly released first draft of the complete chicken genome. The analysis provided insight into the diversity and biology of known repeat structures such as CR1 and CNM, for which only limited sequence data had previously been available. Cot sequence data also resulted in the identification of four novel repeats (Birddawg, Hitchcock, Kronos, and Soprano), two new subfamilies of CR1 repeats, and many elements absent from the chicken genome assembly. Multiple autonomous elements were found for a novel Mariner-like transposon, Galluhop, in addition to nonautonomous deletion derivatives. Phylogenetic analysis of the high-copy repeats CR1, Galluhop, and Birddawg provided insight into two distinct genome dispersion strategies. This study also exemplifies the power of the CBCS method to create representative databases for the repetitive fractions of genomes for which only limited sequence data is available.
Collapse
Affiliation(s)
- Thomas Wicker
- Plant Genome Mapping Laboratory, University of Georgia, Athens, Georgia 30602, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Abstract
Although B1 and Alu were the first discovered Short Interspersed Elements (SINEs), the studies of these genomic repeats were mostly limited to mice and humans and little data on their presence in other animals were available. Here we report the presence of these SINEs in a wide range of rodents (in all 15 tested families) as well as primates and tree-shrews and their absence in other mammals. Distribution pattern of these SINEs in mammals supports close relationship between rodents and primates as well as tree-shrews. Sequence analysis of these elements, apparently descending from cellular 7SL RNA indicates their rearrangements such as dimerization (Alu), quasi-dimerization (B1), acquiring a tRNA-related unit (B1-dID), extended deletions, etc., preceding their active expansion in the genomes. The revealed common pattern of microenvironment of some rearrangement hot spots in SINEs (internal duplications and deletions) suggests involvement of short direct repeats in the mechanism of such rearrangements. This hypothesis allows us to explain short rearrangements in these and other short retroposons.
Collapse
Affiliation(s)
- Nikita S Vassetzky
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov St., 119991, Moscow, Russia
| | | | | |
Collapse
|