1
|
Deng P, Tan SQ, Yang QY, Fu L, Wu Y, Zhu HZ, Sun L, Bao Z, Lin Y, Zhang QC, Wang H, Wang J, Liu JJG. Structural RNA components supervise the sequential DNA cleavage in R2 retrotransposon. Cell 2023; 186:2865-2879.e20. [PMID: 37301196 DOI: 10.1016/j.cell.2023.05.032] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 03/14/2023] [Accepted: 05/20/2023] [Indexed: 06/12/2023]
Abstract
Retroelements are the widespread jumping elements considered as major drivers for genome evolution, which can also be repurposed as gene-editing tools. Here, we determine the cryo-EM structures of eukaryotic R2 retrotransposon with ribosomal DNA target and regulatory RNAs. Combined with biochemical and sequencing analysis, we reveal two essential DNA regions, Drr and Dcr, required for recognition and cleavage. The association of 3' regulatory RNA with R2 protein accelerates the first-strand cleavage, blocks the second-strand cleavage, and initiates the reverse transcription starting from the 3'-tail. Removing 3' regulatory RNA by reverse transcription allows the association of 5' regulatory RNA and initiates the second-strand cleavage. Taken together, our work explains the DNA recognition and RNA supervised sequential retrotransposition mechanisms by R2 machinery, providing insights into the retrotransposon and application reprogramming.
Collapse
Affiliation(s)
- Pujuan Deng
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Shun-Qing Tan
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Qi-Yu Yang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Liangzheng Fu
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yachao Wu
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Han-Zhou Zhu
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lei Sun
- Shandong Provincial Key Laboratory of Animal Cell and Developmental Biology, School of Life Sciences, Shandong University, Qingdao 266237, China
| | - Zhangbin Bao
- Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
| | - Yi Lin
- Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
| | - Qiangfeng Cliff Zhang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Haoyi Wang
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jia Wang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| | - Jun-Jie Gogo Liu
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
2
|
Setiawan AB, Teo CH, Kikuchi S, Sassa H, Kato K, Koba T. Chromosomal Locations of a Non-LTR Retrotransposon, Menolird18, in Cucumis melo and Cucumis sativus, and Its Implication on Genome Evolution of Cucumis Species. Cytogenet Genome Res 2020; 160:554-564. [DOI: 10.1159/000511119] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Accepted: 07/07/2020] [Indexed: 11/19/2022] Open
Abstract
Mobile elements are major regulators of genome evolution through their effects on genome size and chromosome structure in higher organisms. Non-long terminal repeat (non-LTR) retrotransposons, one of the subclasses of transposons, are specifically inserted into repetitive DNA sequences. While studies on the insertion of non-LTR retrotransposons into ribosomal RNA genes and other repetitive DNA sequences have been reported in the animal kingdom, studies in the plant kingdom are limited. Here, using FISH, we confirmed that <i>Menolird18</i>, a member of LINE (long interspersed nuclear element) in non-LTR retrotransposons and found in <i>Cucumis melo</i>, was inserted into ITS and ETS (internal and external transcribed spacers) regions of 18S rDNA in melon and cucumber. Beside the 18S rDNA regions, <i>Menolird18</i> was also detected in all centromeric regions of melon, while it was located at pericentromeric and sub-telomeric regions in cucumber. The fact that FISH signals of <i>Menolird18</i> were found in centromeric and rDNA regions of mitotic chromosomes suggests that <i>Menolird18</i> is a rDNA and centromere-specific non-LTR retrotransposon in melon. Our findings are the first report on a non-LTR retrotransposon that is highly conserved in 2 different plant species, melon and cucumber. The clear distinction of chromosomal localization of <i>Menolird18</i> in melon and cucumber implies that it might have been involved in the evolutionary processes of the melon (<i>C. melo</i>) and cucumber (<i>C. sativus</i>) genomes.
Collapse
|
3
|
R2 and Non-Site-Specific R2-Like Retrotransposons of the German Cockroach, Blattella germanica. Genes (Basel) 2020; 11:genes11101202. [PMID: 33076367 PMCID: PMC7650587 DOI: 10.3390/genes11101202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 10/10/2020] [Accepted: 10/12/2020] [Indexed: 11/17/2022] Open
Abstract
The structural and functional organization of the ribosomal RNA gene cluster and the full-length R2 non-LTR retrotransposon (integrated into a specific site of 28S ribosomal RNA genes) of the German cockroach, Blattella germanica, is described. A partial sequence of the R2 retrotransposon of the cockroach Rhyparobia maderae is also analyzed. The analysis of previously published next-generation sequencing data from the B. germanica genome reveals a new type of retrotransposon closely related to R2 retrotransposons but with a random distribution in the genome. Phylogenetic analysis reveals that these newly described retrotransposons form a separate clade. It is shown that proteins corresponding to the open reading frames of newly described retrotransposons exhibit unequal structural domains. Within these retrotransposons, a recombination event is described. New mechanism of transposition activity is discussed. The essential structural features of R2 retrotransposons are conserved in cockroaches and are typical of previously described R2 retrotransposons. However, the investigation of the number and frequency of 5′-truncated R2 retrotransposon insertion variants in eight B. germanica populations suggests recent mobile element activity. It is shown that the pattern of 5′-truncated R2 retrotransposon copies can be an informative molecular genetic marker for revealing genetic distances between insect populations.
Collapse
|
4
|
Khadgi BB, Govindaraju A, Christensen SM. Completion of LINE integration involves an open '4-way' branched DNA intermediate. Nucleic Acids Res 2019; 47:8708-8719. [PMID: 31392993 PMCID: PMC6895275 DOI: 10.1093/nar/gkz673] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Revised: 06/26/2019] [Accepted: 07/29/2019] [Indexed: 12/20/2022] Open
Abstract
Long Interspersed Elements (LINEs), also known as non-LTR retrotransposons, encode a multifunctional protein that reverse transcribes its mRNA into DNA at the site of insertion by target primed reverse transcription. The second half of the integration reaction remains very poorly understood. Second-strand DNA cleavage and second-strand DNA synthesis were investigated in vitro using purified components from a site-specific restriction-like endonuclease (RLE) bearing LINE. DNA structure was shown to be a critical component of second-strand DNA cleavage. A hitherto unknown and unexplored integration intermediate, an open ‘4-way’ DNA junction, was recognized by the element protein and cleaved in a Holliday junction resolvase-like reaction. Cleavage of the 4-way junction resulted in a natural primer-template pairing used for second-strand DNA synthesis. A new model for RLE LINE integration is presented.
Collapse
Affiliation(s)
- Brijesh B Khadgi
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Aruna Govindaraju
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Shawn M Christensen
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| |
Collapse
|
5
|
Bonandin L, Scavariello C, Mingazzini V, Luchetti A, Mantovani B. Obligatory parthenogenesis and TE load: Bacillus stick insects and the R2 non-LTR retrotransposon. INSECT SCIENCE 2017; 24:409-417. [PMID: 26813995 DOI: 10.1111/1744-7917.12322] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Revised: 12/17/2015] [Accepted: 01/03/2016] [Indexed: 06/05/2023]
Abstract
Transposable elements (TEs) are selfish genetic elements whose self-replication is contrasted by the host genome. In this context, host reproductive strategies are predicted to impact on both TEs load and activity. The presence and insertion distribution of the non-LTR retrotransposon R2 was here studied in populations of the strictly bisexual Bacillus grandii maretimi and of the obligatory parthenogenetic Bacillus atticus atticus. Furthermore, data were also obtained from the offspring of selected B. a. atticus females. At the population level, the gonochoric B. g. maretimi showed a significantly higher R2 load than the obligatory parthenogenetic B. a. atticus. The comparison with bisexual and unisexual Bacillus rossius populations showed that their values were higher than those recorded for B. a. atticus and similar, or even higher, than those of B. g. maretimi. Consistently, an R2 load reduction is scored in B. a. atticus offspring even if with a great variance. On the whole, data here produced indicate that in the obligatory unisexual B. a. atticus R2 is active and that mechanisms of molecular turnover are effective. Furthermore, progeny analyses show that, at variance of the facultative parthenogenetic B. rossius, the R2 activity is held at a lower rate. Modeling parental-offspring inheritance, suggests that in B. a. atticus recombination plays a major role in eliminating insertions rather than selection, as previously suggested for unisexual B. rossius progeny, even if in both cases a high variance is observed. In addition to this, mechanisms of R2 silencing or chances of clonal selection cannot be ruled out.
Collapse
Affiliation(s)
- Livia Bonandin
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Claudia Scavariello
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Valentina Mingazzini
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| |
Collapse
|
6
|
Scavariello C, Luchetti A, Martoni F, Bonandin L, Mantovani B. Hybridogenesis and a potential case of R2 non-LTR retrotransposon horizontal transmission in Bacillus stick insects (Insecta Phasmida). Sci Rep 2017; 7:41946. [PMID: 28165062 PMCID: PMC5292737 DOI: 10.1038/srep41946] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Accepted: 01/04/2017] [Indexed: 01/16/2023] Open
Abstract
Horizontal transfer (HT) is an event in which the genetic material is transferred from one species to another, even if distantly related, and it has been demonstrated as a possible essential part of the lifecycle of transposable elements (TEs). However, previous studies on the non-LTR R2 retrotransposon, a metazoan-wide distributed element, indicated its vertical transmission since the Radiata-Bilateria split. Here we present the first possible instances of R2 HT in stick insects of the genus Bacillus (Phasmida). Six R2 elements were characterized in the strictly bisexual subspecies B. grandii grandii, B. grandii benazzii and B. grandii maretimi and in the obligatory parthenogenetic taxon B. atticus. These elements were compared with those previously retrieved in the facultative parthenogenetic species B. rossius. Phylogenetic inconsistencies between element and host taxa, and age versus divergence analyses agree and support at least two HT events. These HT events can be explained by taking into consideration the complex Bacillus reproductive biology, which includes also hybridogenesis, gynogenesis and androgenesis. Through these non-canonical reproductive modes, R2 elements may have been transferred between Bacillus genomes. Our data suggest, therefore, a possible role of hybridization for TEs survival and the consequent reshaping of involved genomes.
Collapse
Affiliation(s)
- Claudia Scavariello
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Francesco Martoni
- Bio-Protection Research Centre, Lincoln University, Lincoln 7647, New Zealand
| | - Livia Bonandin
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| |
Collapse
|
7
|
Functionally conserved RNA-binding and protein-protein interaction properties of LINE-ORF1p in an ancient clade of non-LTR retrotransposons of Entamoeba histolytica. Mol Biochem Parasitol 2017; 211:84-93. [DOI: 10.1016/j.molbiopara.2016.11.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Revised: 11/17/2016] [Accepted: 11/24/2016] [Indexed: 11/23/2022]
|
8
|
Kojima KK, Seto Y, Fujiwara H. The Wide Distribution and Change of Target Specificity of R2 Non-LTR Retrotransposons in Animals. PLoS One 2016; 11:e0163496. [PMID: 27662593 PMCID: PMC5035012 DOI: 10.1371/journal.pone.0163496] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 09/09/2016] [Indexed: 12/23/2022] Open
Abstract
Transposons, or transposable elements, are the major components of genomes in most eukaryotes. Some groups of transposons have developed target specificity that limits the integration sites to a specific nonessential sequence or a genomic region to avoid gene disruption caused by insertion into an essential gene. R2 is one of the most intensively investigated groups of sequence-specific non-LTR retrotransposons and is inserted at a specific site inside of 28S ribosomal RNA (rRNA) genes. R2 is known to be distributed among at least six animal phyla even though its occurrence is reported to be patchy. Here, in order to obtain a more detailed picture of the distribution of R2, we surveyed R2 using both in silico screening and degenerate PCR, particularly focusing on actinopterygian fish. We found two families of the R2C lineage from vertebrates, although it has previously only been found in platyhelminthes. We also revealed the apparent movement of insertion sites of a lineage of actinopterygian R2, which was likely concurrent with the acquisition of a 28S rRNA-derived sequence in their 3' UTR. Outside of actinopterygian fish, we revealed the maintenance of a single R2 lineage in birds; the co-existence of four lineages of R2 in the leafcutter bee Megachile rotundata; the first examples of R2 in Ctenophora, Mollusca, and Hemichordata; and two families of R2 showing no target specificity. These findings indicate that R2 is relatively stable and universal, while differences in the distribution and maintenance of R2 lineages probably reflect characteristics of some combination of both R2 lineages and host organisms.
Collapse
Affiliation(s)
- Kenji K. Kojima
- Genetic Information Research Institute, Mountain View, CA, 94043, United States of America
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, 277–8562, Japan
- Department of Life Sciences, National Cheng Kung University, Tainan, 701, Taiwan
- * E-mail:
| | - Yosuke Seto
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, 277–8562, Japan
| | - Haruhiko Fujiwara
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, 277–8562, Japan
| |
Collapse
|
9
|
Both the Exact Target Site Sequence and a Long Poly(A) Tail Are Required for Precise Insertion of the 18S Ribosomal DNA-Specific Non-Long Terminal Repeat Retrotransposon R7Ag. Mol Cell Biol 2016; 36:1494-508. [PMID: 26976636 DOI: 10.1128/mcb.00970-15] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 03/02/2016] [Indexed: 11/20/2022] Open
Abstract
Ribosomal elements (R elements) are site-specific non-long terminal repeat (LTR) retrotransposons that target ribosomal DNA (rDNA). To elucidate how R elements specifically access their target sites, we isolated and characterized the 18S rDNA-specific R element R7Ag from Anopheles gambiae Using an in vivo and ex vivo recombinant baculovirus retrotransposition system, we found that the exact host 18S rDNA sequence at the target site is essential for the precise insertion of R7Ag. In addition, a long poly(A) tail is necessary for the accurate initiation of R7Ag reverse transcription, a novel mechanism found in non-LTR elements. We further compared the subcellular localizations of proteins in R7Ag as well as R1Bm, another R element that targets 28S rDNA. Although the open reading frame 1 proteins (ORF1ps) of both R7Ag and R1Bm localized predominantly in the cytoplasm, ORF2 proteins (ORF2ps) colocalized in the nucleus with the nucleolar marker fibrillarin. The ORF1ps and ORF2ps of both R elements colocalized largely in the nuclear periphery and to a lesser extent within the nucleus. These results suggest that R7Ag and R1Bm proteins may access nucleolar rDNA targets in an ORF2p-dependent manner.
Collapse
|
10
|
Govindaraju A, Cortez JD, Reveal B, Christensen SM. Endonuclease domain of non-LTR retrotransposons: loss-of-function mutants and modeling of the R2Bm endonuclease. Nucleic Acids Res 2016; 44:3276-87. [PMID: 26961309 PMCID: PMC4838377 DOI: 10.1093/nar/gkw134] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Revised: 02/22/2016] [Accepted: 02/23/2016] [Indexed: 01/07/2023] Open
Abstract
Non-LTR retrotransposons are an important class of mobile elements that insert into host DNA by target-primed reverse transcription (TPRT). Non-LTR retrotransposons must bind to their mRNA, recognize and cleave their target DNA, and perform TPRT at the site of DNA cleavage. As DNA binding and cleavage are such central parts of the integration reaction, a better understanding of the endonuclease encoded by non-LTR retrotransposons is needed. This paper explores the R2 endonuclease domain from Bombyx mori using in vitro studies and in silico modeling. Mutations in conserved sequences located across the putative PD-(D/E)XK endonuclease domain reduced DNA cleavage, DNA binding and TPRT. A mutation at the beginning of the first α-helix of the modeled endonuclease obliterated DNA cleavage and greatly reduced DNA binding. It also reduced TPRT when tested on pre-cleaved DNA substrates. The catalytic K was located to a non-canonical position within the second α-helix. A mutation located after the fourth β-strand reduced DNA binding and cleavage. The motifs that showed impaired activity form an extensive basic region. The R2 biochemical and structural data are compared and contrasted with that of two other well characterized PD-(D/E)XK endonucleases, restriction endonucleases and archaeal Holliday junction resolvases.
Collapse
Affiliation(s)
- Aruna Govindaraju
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Jeremy D. Cortez
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Brad Reveal
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Shawn M. Christensen
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| |
Collapse
|
11
|
Abstract
Although most of non-long terminal repeat (non-LTR) retrotransposons are incorporated in the host genome almost randomly, some non-LTR retrotransposons are incorporated into specific sequences within a target site. On the basis of structural and phylogenetic features, non-LTR retrotransposons are classified into two large groups, restriction enzyme-like endonuclease (RLE)-encoding elements and apurinic/apyrimidinic endonuclease (APE)-encoding elements. All clades of RLE-encoding non-LTR retrotransposons include site-specific elements. However, only two of more than 20 APE-encoding clades, Tx1 and R1, contain site-specific non-LTR elements. Site-specific non-LTR retrotransposons usually target within multi-copy RNA genes, such as rRNA gene (rDNA) clusters, or repetitive genomic sequences, such as telomeric repeats; this behavior may be a symbiotic strategy to reduce the damage to the host genome. Site- and sequence-specificity are variable even among closely related non-LTR elements and appeared to have changed during evolution. In the APE-encoding elements, the primary determinant of the sequence- specific integration is APE itself, which nicks one strand of the target DNA during the initiation of target primed reverse transcription (TPRT). However, other factors, such as interaction between mRNA and the target DNA, and access to the target region in the nuclei also affect the sequence-specificity. In contrast, in the RLE-encoding elements, DNA-binding motifs appear to affect their sequence-specificity, rather than the RLE domain itself. Highly specific integration properties of these site-specific non-LTR elements make them ideal alternative tools for sequence-specific gene delivery, particularly for therapeutic purposes in human diseases.
Collapse
|
12
|
Abstract
R2 elements are sequence specific non-LTR retrotransposons that exclusively insert in the 28S rRNA genes of animals. R2s encode an endonuclease that cleaves the insertion site and a reverse transcriptase that uses the cleaved DNA to prime reverse transcription of the R2 transcript, a process termed target primed reverse transcription. Additional unusual properties of the reverse transcriptase as well as DNA and RNA binding domains of the R2 encoded protein have been characterized. R2 expression is through co-transcription with the 28S gene and self-cleavage by a ribozyme encoded at the R2 5' end. Studies in laboratory stocks and natural populations of Drosophila suggest that R2 expression is tied to the distribution of R2-inserted units within the rDNA locus. Most individuals have no R2 expression because only a small fraction of their rRNA genes need to be active, and a contiguous region of the locus free of R2 insertions can be selected for activation. However, if the R2-free region is not large enough to produce sufficient rRNA, flanking units - including those inserted with R2 - must be activated. Finally, R2 copies rapidly turnover within the rDNA locus, yet R2 has been vertically maintained in animal lineages for hundreds of millions of years. The key to this stability is R2's ability to remain dormant in rDNA units outside the transcribed regions for generations until the stochastic nature of the crossovers that drive the concerted evolution of the rDNA locus inevitably reshuffle the inserted and uninserted units, resulting in transcription of the R2-inserted units.
Collapse
|
13
|
Kojima KK, Jurka J. Ancient Origin of the U2 Small Nuclear RNA Gene-Targeting Non-LTR Retrotransposons Utopia. PLoS One 2015; 10:e0140084. [PMID: 26556480 PMCID: PMC4640811 DOI: 10.1371/journal.pone.0140084] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Accepted: 09/21/2015] [Indexed: 11/22/2022] Open
Abstract
Most non-long terminal repeat (non-LTR) retrotransposons encoding a restriction-like endonuclease show target-specific integration into repetitive sequences such as ribosomal RNA genes and microsatellites. However, only a few target-specific lineages of non-LTR retrotransposons are distributed widely and no lineage is found across the eukaryotic kingdoms. Here we report the most widely distributed lineage of target sequence-specific non-LTR retrotransposons, designated Utopia. Utopia is found in three supergroups of eukaryotes: Amoebozoa, SAR, and Opisthokonta. Utopia is inserted into a specific site of U2 small nuclear RNA genes with different strength of specificity for each family. Utopia families from oomycetes and wasps show strong target specificity while only a small number of Utopia copies from reptiles are flanked with U2 snRNA genes. Oomycete Utopia families contain an “archaeal” RNase H domain upstream of reverse transcriptase (RT), which likely originated from a plant RNase H gene. Analysis of Utopia from oomycetes indicates that multiple lineages of Utopia have been maintained inside of U2 genes with few copy numbers. Phylogenetic analysis of RT suggests the monophyly of Utopia, and it likely dates back to the early evolution of eukaryotes.
Collapse
Affiliation(s)
- Kenji K. Kojima
- Genetic Information Research Institute, Los Altos, California, United States of America
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Minato-ku, Tokyo, Japan
- Institute of Medical Science, University of Tokyo, Minato-ku, Tokyo, Japan
- * E-mail:
| | - Jerzy Jurka
- Genetic Information Research Institute, Los Altos, California, United States of America
| |
Collapse
|
14
|
Eickbush DG, Burke WD, Eickbush TH. Evolution of the R2 retrotransposon ribozyme and its self-cleavage site. PLoS One 2013; 8:e66441. [PMID: 24066021 PMCID: PMC3774820 DOI: 10.1371/journal.pone.0066441] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 05/07/2013] [Indexed: 12/23/2022] Open
Abstract
R2 is a non-long terminal repeat retrotransposon that inserts site-specifically in the tandem 28S rRNA genes of many animals. Previously, R2 RNA from various species of Drosophila was shown to self-cleave from the 28S rRNA/R2 co-transcript by a hepatitis D virus (HDV)-like ribozyme encoded at its 5' end. RNA cleavage was at the precise 5' junction of the element with the 28S gene. Here we report that RNAs encompassing the 5' ends of R2 elements from throughout its species range fold into HDV-like ribozymes. In vitro assays of RNA self-cleavage conducted in many R2 lineages confirmed activity. For many R2s, RNA self-cleavage was not at the 5' end of the element but at 28S rRNA sequences up to 36 nucleotides upstream of the junction. The location of cleavage correlated well with the types of endogenous R2 5' junctions from different species. R2 5' junctions were uniform for most R2s in which RNA cleavage was upstream in the rRNA sequences. The 28S sequences remaining on the first DNA strand synthesized during retrotransposition are postulated to anneal to the target site and uniformly prime second strand DNA synthesis. In species where RNA cleavage occurred at the R2 5' end, the 5' junctions were variable. This junction variation is postulated to result from the priming of second strand DNA synthesis by chance microhomologies between the target site and the first DNA strand. Finally, features of R2 ribozyme evolution, especially changes in cleavage site and convergence on the same active site sequences, are discussed.
Collapse
Affiliation(s)
- Danna G. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - William D. Burke
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Thomas H. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
- * E-mail:
| |
Collapse
|
15
|
A superfamily of DNA transposons targeting multicopy small RNA genes. PLoS One 2013; 8:e68260. [PMID: 23874566 PMCID: PMC3706591 DOI: 10.1371/journal.pone.0068260] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2013] [Accepted: 05/29/2013] [Indexed: 01/29/2023] Open
Abstract
Target-specific integration of transposable elements for multicopy genes, such as ribosomal RNA and small nuclear RNA (snRNA) genes, is of great interest because of the relatively harmless nature, stable inheritance and possible application for targeted gene delivery of target-specific transposable elements. To date, such strict target specificity has been observed only among non-LTR retrotransposons. We here report a new superfamily of sequence-specific DNA transposons, designated Dada. Dada encodes a DDE-type transposase that shows a distant similarity to transposases encoded by eukaryotic MuDR, hAT, P and Kolobok transposons, as well as the prokaryotic IS256 insertion element. Dada generates 6-7 bp target site duplications upon insertion. One family of Dada DNA transposons targets a specific site inside the U6 snRNA genes and are found in various fish species, water flea, oyster and polycheate worm. Other target sequences of the Dada transposons are U1 snRNA genes and different tRNA genes. The targets are well conserved in multicopy genes, indicating that copy number and sequence conservation are the primary constraints on the target choice of Dada transposons. Dada also opens a new frontier for target-specific gene delivery application.
Collapse
|
16
|
Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013; 4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open
Abstract
The molecular mechanisms of the transposition of non-long terminal repeat (non-LTR) retrotransposons are not well understood; the key questions of how the 3′-ends of cDNA copies integrate and how site-specific integration occurs remain unresolved. Integration depends on properties of the endonuclease (EN) domain of retrotransposons. Using the EN domain of the Drosophila R2 retrotransposon as a model for other, closely related non-LTR retrotransposons, we investigated the EN domain and found that it resembles archaeal Holliday-junction resolvases. We suggest that these non-LTR retrotransposons are co-transcribed with the host transcript. Combined with the proposed resolvase activity of the EN domain, this model yields a novel mechanism for site-specific retrotransposition within this class of retrotransposons, with resolution proceeding via a Holliday junction intermediate.
Collapse
Affiliation(s)
- Dmitry V Mukha
- Vavilov Institute of General Genetics, Russian Academy of Sciences Moscow, Russia
| | | | | | | |
Collapse
|
17
|
Non-LTR R2 element evolutionary patterns: phylogenetic incongruences, rapid radiation and the maintenance of multiple lineages. PLoS One 2013; 8:e57076. [PMID: 23451148 PMCID: PMC3581529 DOI: 10.1371/journal.pone.0057076] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2012] [Accepted: 01/17/2013] [Indexed: 01/02/2023] Open
Abstract
Retrotransposons of the R2 superclade specifically insert within the 28S ribosomal gene. They have been isolated from a variety of metazoan genomes and were found vertically inherited even if their phylogeny does not always agree with that of the host species. This was explained with the diversification/extinction of paralogous lineages, being proved the absence of horizontal transfer. We here analyze the widest available collection of R2 sequences, either newly isolated from recently sequenced genomes or drawn from public databases, in a phylogenetic framework. Results are congruent with previous analyses, but new important issues emerge. First, the N-terminal end of the R2-B clade protein, so far unknown, presents a new zinc fingers configuration. Second, the phylogenetic pattern is consistent with an ancient, rapid radiation of R2 lineages: being the estimated time of R2 origin (850–600 Million years ago) placed just before the metazoan Cambrian explosion, the wide element diversity and the incongruence with the host phylogeny could be attributable to the sudden expansion of available niches represented by host’s 28S ribosomal genes. Finally, we detect instances of coexisting multiple R2 lineages showing a non-random phylogenetic pattern, strongly similar to that of the “library” model known for tandem repeats: a collection of R2s were present in the ancestral genome and then differentially activated/repressed in the derived species. Models for activation/repression as well as mechanisms for sequence maintenance are also discussed within this framework.
Collapse
|
18
|
A population genetic model for the maintenance of R2 retrotransposons in rRNA gene loci. PLoS Genet 2013; 9:e1003179. [PMID: 23326244 PMCID: PMC3542110 DOI: 10.1371/journal.pgen.1003179] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2012] [Accepted: 11/02/2012] [Indexed: 01/03/2023] Open
Abstract
R2 retrotransposable elements exclusively insert into the tandemly repeated rRNA genes, the rDNA loci, of their animal hosts. R2 elements form stable long-term associations with their host, in which all individuals in a population contain many potentially active copies, but only a fraction of these individuals show active R2 retrotransposition. Previous studies have found that R2 RNA transcripts are processed from a 28S co-transcript and that the likelihood of R2-inserted units being transcribed is dependent upon their distribution within the rDNA locus. Here we analyze the rDNA locus and R2 elements from nearly 100 R2-active and R2-inactive individuals from natural populations of Drosophila simulans. Along with previous findings concerning the structure and expression of the rDNA loci, these data were incorporated into computer simulations to model the crossover events that give rise to the concerted evolution of the rRNA genes. The simulations that best reproduce the population data assume that only about 40 rDNA units out of the over 200 total units are actively transcribed and that these transcribed units are clustered in a single region of the locus. In the model, the host establishes this transcription domain at each generation in the region with the fewest R2 insertions. Only if the host cannot avoid R2 insertions within this 40-unit domain are R2 elements active in that generation. The simulations also require that most crossover events in the locus occur in the transcription domain in order to explain the empirical observation that R2 elements are seldom duplicated by crossover events. Thus the key to the long-term stability of R2 elements is the stochastic nature of the crossover events within the rDNA locus, and the inevitable expansions and contractions that introduce and remove R2-inserted units from the transcriptionally active domain. Selfish transposable elements survive in eukaryotic genomes despite the elaborate mechanisms developed by the hosts to limit their activity. One accessible system that simplifies the complex interactions between element and host involves the R2 elements, which exclusively insert in the tandemly arranged rRNA genes. R2 exhibits remarkable stability in animal lineages even though each insertion inactivates one rRNA gene. Here we determine the size of the rDNA locus and R2 number in natural isolates of Drosophila simulans. Combined with previous data concerning the expression and regulation of R2, we develop a detailed population genetic model for rRNA gene and R2 evolution that duplicates all properties of the rRNA loci in natural populations. Critical components of the model are that only a contiguous 40 unit array of rRNA gene units are needed for transcription, that R2 elements are active only when present in this transcription domain, and that most of the crossovers in the rDNA loci occur in this domain. These results suggest that the key to the long-term survival of R2 is the redistribution of rDNA units in the locus brought about by the crossovers that maintain sequence identity in all rDNA units.
Collapse
|
19
|
Luchetti A, Mingazzini V, Mantovani B. 28S junctions and chimeric elements of the rDNA targeting non-LTR retrotransposon R2 in crustacean living fossils (Branchiopoda, Notostraca). Genomics 2012; 100:51-6. [PMID: 22564473 DOI: 10.1016/j.ygeno.2012.04.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2012] [Revised: 04/17/2012] [Accepted: 04/23/2012] [Indexed: 11/29/2022]
Abstract
The 28S rRNA genes of several metazoans are interrupted by site-specific targeting non-LTR retrotransposons, such as R2. R2 elements have been deeply analyzed but aspects of their retrotransposition mechanism and the origin of the wide diversity observed are still debated. We characterized six new R2 lineages in four tadpole shrimp species (Notostraca), samples deriving from a parthenogenetic population of Triops cancriformis (R2Tc_it) and from bisexual Lepidurus populations of L. lubbocki (R2Ll), L. couesii (R2LcA, R2LcB, R2LcC) and L. arcticus (R2La). All elements fit the canonical R2 structure but R2Ll which turned out to be a chimera with an additional ORF originating from another R2. Consistently with data on LINEs, R2Ll could be the result of recombination due to reverse transcriptase template jump. The analysis of 28S/R2 5' end junctions further suggests aberrant homologous recombination, as observed in RNA viruses.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dip. Biologia Evoluzionistica Sperimentale, Università di Bologna, via Selmi 3, 40126 Bologna, Italy.
| | | | | |
Collapse
|
20
|
Eagle SH, Crease TJ. Copy number variation of ribosomal DNA and Pokey transposons in natural populations of Daphnia. Mob DNA 2012; 3:4. [PMID: 22390386 PMCID: PMC3315735 DOI: 10.1186/1759-8753-3-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2011] [Accepted: 03/05/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Despite their ubiquity and high diversity in eukaryotic genomes, DNA transposons are rarely encountered in ribosomal DNA (rDNA). In contrast, R-elements, a diverse group of non-LTR retrotransposons, specifically target rDNA. Pokey is a DNA transposon that targets a specific rDNA site, but also occurs in many other genomic locations, unlike R-elements. However, unlike most DNA transposons, Pokey has been a stable component of Daphnia genomes for over 100 million years. Here we use qPCR to estimate the number of 18S and 28S ribosomal RNA genes and Pokey elements in rDNA (rPokey), as well as other genomic locations (gPokey) in two species of Daphnia. Our goals are to estimate the correlation between (1) the number of 18S and 28S rRNA genes, (2) the number of 28S genes and rPokey, and (3) the number of rPokey and gPokey. In addition, we ask whether Pokey number and distribution in both genomic compartments are affected by differences in life history between D. pulex and D. pulicaria. RESULTS We found differences in 18S and 28S gene number within isolates that are too large to be explained by experimental variation. In general, Pokey number within isolates is modest (< 20), and most are gPokey. There is no correlation between the number of rRNA genes and rPokey, or between rPokey and gPokey. However, we identified three isolates with unusually high numbers of both rPokey and gPokey, which we infer is a consequence of recent transposition. We also detected other rDNA insertions (rInserts) that could be degraded Pokey elements, R- elements or the divergent PokeyB lineage recently detected in the Daphnia genome sequence. Unlike rPokey, rInserts are positively correlated with rRNA genes, suggesting that they are amplified by the same mechanisms that amplify rDNA units even though rPokey is not. Overall, Pokey frequency and distribution are similar in D. pulex and D. pulicaria suggesting that differences in life history have no impact on Pokey. CONCLUSIONS The possibility that many rDNA units do not contain a copy of both 18S and 28S genes suggests that rDNA is much more complicated than once thought, and warrants further study. In addition, the lack of correlation between rPokey, gPokey and rDNA unit numbers suggests that Pokey transposition rate is generally very low, and that recombination, in combination with natural selection, eliminates rPokey much faster than gPokey. Our results suggest that further research to determine the mechanisms by which Pokey has escaped complete inactivation by its host (the usual fate of DNA transposons), would provide important insights into transposon biology.
Collapse
Affiliation(s)
- Shannon Hc Eagle
- Department of Integrative Biology, University of Guelph, Guelph, ON N1G 2W1, Canada.
| | | |
Collapse
|
21
|
Shivram H, Cawley D, Christensen SM. Targeting novel sites: The N-terminal DNA binding domain of non-LTR retrotransposons is an adaptable module that is implicated in changing site specificities. Mob Genet Elements 2011; 1:169-178. [PMID: 22479684 PMCID: PMC3312299 DOI: 10.4161/mge.1.3.18453] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2011] [Revised: 10/16/2011] [Accepted: 10/17/2011] [Indexed: 02/07/2023] Open
Abstract
Restriction-like endonuclease (RLE) bearing non-LTR retrotransposons are site-specific elements that integrate into the genome through target primed reverse transcription (TPRT). RLE-bearing elements have been used as a model system for investigating non-LTR retrotransposon integration. R2 elements target a specific site in the 28S rDNA gene. We previously demonstrated that the two major sub-classes of R2 (R2-A and R2-D) target the R2 insertion site in an opposing manner with regard to the pairing of known DNA binding domains and bound sequences-indicating that the A- and D-clades represent independently derived modes of targeting that site. Elements have been discovered that group phylogenetically with R2 but do not target the canonical R2 site. Here we extend our earlier studies to show that a separate R2-A clade element, which targets a site other than the canonical R2 site, does so by using the N-terminal zinc fingers and Myb motifs. We further extend our targeting studies beyond R2 clade elements by investigating the ability of the N-terminal zinc fingers from the nematode NeSL-1 element to target its integration site. Our data are consistent with the use of an N-terminal DNA binding domain as one of the major targeting determinants used by RLE-bearing non-LTR retrotransposons to secure a protein subunit near the insertion site. This N-terminal DNA binding domain can undergo modifications, allowing the element to target novel sites. The binding orientation of the N-terminal domain relative to the insertion site is quite variable.
Collapse
Affiliation(s)
- Haridha Shivram
- Department of Biology; University of Texas at Arlington; Arlington, TX USA
| | | | | |
Collapse
|
22
|
Moss WN, Eickbush DG, Lopez MJ, Eickbush TH, Turner DH. The R2 retrotransposon RNA families. RNA Biol 2011; 8:714-8. [PMID: 21734471 DOI: 10.4161/rna.8.5.16033] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Analysis of the R2 retrotransposons from multiple silkmoth and fruitfly species have revealed three segments that contain conserved RNA secondary structures. These conserved structures play important roles in the propagation of the R2 element, including R2 RNA processing and transposon integration into the host genome as well as a likely role in translation. Two of the structured regions comprise protein binding sites: one is located in the 3' UTR and the other is in the 5' UTR close to the putative start of the R2 open reading frame (ORF). The 3' structure was deduced from chemical mapping and sequence comparison. The 5' structure was determined using a combination of chemical mapping, oligonucleotide binding, NMR and sequence analysis and contains an unusual pseudoknot structure. The third structure occurs at the 5' end of the R2 RNA and is responsible for self-cleavage of the 5' end of the element from a 28S ribosomal RNA co-transcript. A structure for this fragment was proposed based on motif searching and sequence comparison. There is remarkable similarity in sequence and structure to the hepatitis delta virus (HDV) ribozyme. Seed alignments for the 5' structure and the R2 ribozyme, containing representative sequences and consensus structures, have been submitted to the Rfam database.
Collapse
Affiliation(s)
- Walter N Moss
- Department of Chemistry, University of Rochester, Rochester, NY, USA.
| | | | | | | | | |
Collapse
|
23
|
Thompson BK, Christensen SM. Independently derived targeting of 28S rDNA by A- and D-clade R2 retrotransposons: Plasticity of integration mechanism. Mob Genet Elements 2011; 1:29-37. [PMID: 22016843 PMCID: PMC3190273 DOI: 10.4161/mge.1.1.16485] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2011] [Revised: 05/16/2011] [Accepted: 05/16/2011] [Indexed: 12/24/2022] Open
Abstract
Restriction-like endonuclease (RLE) bearing non-LTR retrotransposons are site-specific elements that integrate into the genome through a target primed reverse transcription mechanism (TPRT). R2 elements have been used as a model system for investigating non-LTR retrotransposon integration. We previously demonstrated that R2 retrotransposons require two subunits of the element-encoded multifunctional protein to integrate-one subunit bound upstream of the insertion site and one bound downstream. R2 elements have been phylogenetically categorized into four clades: R2-A, B, C and D, that diverged from a common ancestor more than 850 million years ago. All R2 elements target the same sequence within 28S rDNA. The amino-terminal domain of R2Bm, an R2-D clade element, contains a single zinc finger and a Myb motif that are responsible for binding R2 protein downstream of the insertion site. Target site recognition is of interest as it is the first step in the integration reaction and may help elucidate evolutionary history and integration mechanism. The amino-terminal domain of R2-A clade members contains three zinc fingers and a Myb motif. We show here that R2Lp, an R2-A clade member, uses its amino-terminal DNA binding motifs to bind upstream of the insertion site. Because the R2-A and R2-D clade elements recognize 28S rDNA differently, we conclude the A- and D-clades represent independent targeting events to the 28S site. Our results also indicate a certain plasticity of insertional mechanics exists between the two clades.
Collapse
Affiliation(s)
- Blaine K Thompson
- Department of Biology; University of Texas at Arlington; Arlington, TX USA
| | | |
Collapse
|
24
|
Ghesini S, Luchetti A, Marini M, Mantovani B. The Non-LTR Retrotransposon R2 in Termites (Insecta, Isoptera): Characterization and Dynamics. J Mol Evol 2011; 72:296-305. [DOI: 10.1007/s00239-011-9430-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2010] [Accepted: 01/03/2011] [Indexed: 01/07/2023]
|
25
|
R2 dynamics in Triops cancriformis (Bosc, 1801) (Crustacea, Branchiopoda, Notostraca): turnover rate and 28S concerted evolution. Heredity (Edinb) 2010; 106:567-75. [PMID: 20628416 DOI: 10.1038/hdy.2010.86] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The R2 retrotransposon is here characterized in bisexual populations of the European crustacean Triops cancriformis. The isolated element matches well with the general aspects of the R2 family and it is highly differentiated from that of the congeneric North American Triops longicaudatus. The analysis of 5' truncations indicates that R2 dynamics in T. cancriformis populations show a high turnover rate as observed in Drosophila simulans. For the first time in the literature, though, individuals harboring truncation variants, but lacking the complete element, are found. Present results suggest that transposition-mediated deletion mechanisms, possibly involving genomic turnover processes acting on rDNAs, can dramatically decrease the copy number or even delete R2 from the ribosomal locus. The presence of R2 does not seem to impact on the nucleotide variation of inserted 28S rDNA with respect to the uninserted genes. On the other hand, a low level of polymorphism characterizes rDNA units because new 28S variants continuously spread across the ribosomal array. Again, the interplay between transposition-mediated deletion and molecular drive may explain this pattern.
Collapse
|
26
|
R2 retrotransposons encode a self-cleaving ribozyme for processing from an rRNA cotranscript. Mol Cell Biol 2010; 30:3142-50. [PMID: 20421411 DOI: 10.1128/mcb.00300-10] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The non-long terminal repeat (non-LTR) retrotransposon R2 is inserted into the 28S rRNA genes of many animals. Expression of the element appears to be by cotranscription with the rRNA gene unit. We show here that processing of the rRNA cotranscript at the 5' end of the R2 element in Drosophila simulans is rapid and utilizes an unexpected mechanism. Using RNA synthesized in vitro, the 5' untranslated region of R2 was shown capable of rapid and efficient self-cleavage of the 28S-R2 cotranscript. The 5' end generated in vitro by the R2 ribozyme was at the position identical to that found for in vivo R2 transcripts. The RNA segment corresponding to the R2 ribozyme could be folded into a double pseudoknot structure similar to that of the hepatitis delta virus (HDV) ribozyme. Remarkably, 21 of the nucleotide positions in and around the active site of the HDV ribozyme were identical in R2. R2 elements from other Drosophila species were also shown to encode HDV-like ribozymes capable of self-cleavage. Tracing their sequence evolution in the Drosophila lineage suggests that the extensive similarity of the R2 ribozyme from D. simulans to that of HDV was a result of convergent evolution, not common descent.
Collapse
|
27
|
Stage DE, Eickbush TH. Maintenance of multiple lineages of R1 and R2 retrotransposable elements in the ribosomal RNA gene loci of Nasonia. INSECT MOLECULAR BIOLOGY 2010; 19 Suppl 1:37-48. [PMID: 20167016 DOI: 10.1111/j.1365-2583.2009.00949.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Sequencing reads from the Nasonia genome project were used to study the ribosomal RNA gene loci and the retrotransposons R1 and R2 that insert specifically into the 28S genes. Five highly divergent R1 and five highly divergent R2 families were identified in the three sequenced species, as well as a non-autonomous element that appears to use the retrotransposition machinery of R1. A duplication of the R1 target site within the spacer region of the rDNA units was also found to be extensively utilized by R1 elements. We document numerous instances where the R1 and R2 families appropriated parts of the retrotransposition machinery of other lineages and speculate that this enables rapid adaptation and the maintenance of multiple R1 and R2 families.
Collapse
Affiliation(s)
- D E Stage
- Biology Department, University of Rochester, Rochester, NY 14627, USA
| | | |
Collapse
|
28
|
Unique functions of repetitive transcriptomes. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2010; 285:115-88. [PMID: 21035099 DOI: 10.1016/b978-0-12-381047-2.00003-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Repetitive sequences occupy a huge fraction of essentially every eukaryotic genome. Repetitive sequences cover more than 50% of mammalian genomic DNAs, whereas gene exons and protein-coding sequences occupy only ~3% and 1%, respectively. Numerous genomic repeats include genes themselves. They generally encode "selfish" proteins necessary for the proliferation of transposable elements (TEs) in the host genome. The major part of evolutionary "older" TEs accumulated mutations over time and fails to encode functional proteins. However, repeats have important functions also on the RNA level. Repetitive transcripts may serve as multifunctional RNAs by participating in the antisense regulation of gene activity and by competing with the host-encoded transcripts for cellular factors. In addition, genomic repeats include regulatory sequences like promoters, enhancers, splice sites, polyadenylation signals, and insulators, which actively reshape cellular transcriptomes. TE expression is tightly controlled by the host cells, and some mechanisms of this regulation were recently decoded. Finally, capacity of TEs to proliferate in the host genome led to the development of multiple biotechnological applications.
Collapse
|
29
|
Gladyshev EA, Arkhipova IR. Rotifer rDNA-specific R9 retrotransposable elements generate an exceptionally long target site duplication upon insertion. Gene 2009; 448:145-50. [DOI: 10.1016/j.gene.2009.08.016] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2009] [Revised: 07/19/2009] [Accepted: 08/27/2009] [Indexed: 11/27/2022]
|
30
|
Kapitonov VV, Tempel S, Jurka J. Simple and fast classification of non-LTR retrotransposons based on phylogeny of their RT domain protein sequences. Gene 2009; 448:207-13. [PMID: 19651192 DOI: 10.1016/j.gene.2009.07.019] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Revised: 07/19/2009] [Accepted: 07/22/2009] [Indexed: 11/29/2022]
Abstract
Rapidly growing number of sequenced genomes requires fast and accurate computational tools for analysis of different transposable elements (TEs). In this paper we focus on a rapid and reliable procedure for classification of autonomous non-LTR retrotransposons based on alignment and clustering of their reverse transcriptase (RT) domains. Typically, the RT domain protein sequences encoded by different non-LTR retrotransposons are similar to each other in terms of significant BLASTP E-values. Therefore, they can be easily detected by the routine BLASTP searches of genomic DNA sequences coding for proteins similar to the RT domains of known non-LTR retrotransposons. However, detailed classification of non-LTR retrotransposons, i.e. their assignment to specific clades, is a slow and complex procedure that is not formalized or integrated as a standard set of computational methods and data. Here we describe a tool (RTclass1) designed for the fast and accurate automated assignment of novel non-LTR retrotransposons to known or novel clades using phylogenetic analysis of the RT domain protein sequences. RTclass1 classifies a particular non-LTR retrotransposon based on its RT domain in less than 10 min on a standard desktop computer and achieves 99.5% accuracy. RT1class1 works either as a stand-alone program installed locally or as a web-server that can be accessed distantly by uploading sequence data through the internet (http://www.girinst.org/RTphylogeny/RTclass1).
Collapse
Affiliation(s)
- Vladimir V Kapitonov
- Genetic Information Research Institute, 1925 Landings Dr, Mountain View, CA 94041, USA.
| | | | | |
Collapse
|
31
|
Stage DE, Eickbush TH. Origin of nascent lineages and the mechanisms used to prime second-strand DNA synthesis in the R1 and R2 retrotransposons of Drosophila. Genome Biol 2009; 10:R49. [PMID: 19416522 PMCID: PMC2718515 DOI: 10.1186/gb-2009-10-5-r49] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Revised: 03/27/2009] [Accepted: 05/05/2009] [Indexed: 01/31/2023] Open
Abstract
Comparative analysis of 12 Drosophila genomes reveals insights into the evolution and mechanism of integration of R1 and R2 retrotransposons. Background Most arthropods contain R1 and R2 retrotransposons that specifically insert into the 28S rRNA genes. Here, the sequencing reads from 12 Drosophila genomes have been used to address two questions concerning these elements. First, to what extent is the evolution of these elements subject to the concerted evolution process that is responsible for sequence homogeneity among the different copies of rRNA genes? Second, how precise are the target DNA cleavages and priming of DNA synthesis used by these elements? Results Most copies of R1 and R2 in each species were found to exhibit less than 0.2% sequence divergence. However, in many species evidence was obtained for the formation of distinct sublineages of elements, particularly in the case of R1. Analysis of the hundreds of R1 and R2 junctions with the 28S gene revealed that cleavage of the first DNA strand was precise both in location and the priming of reverse transcription. Cleavage of the second DNA strand was less precise within a species, differed between species, and gave rise to variable priming mechanisms for second strand synthesis. Conclusions These findings suggest that the high sequence identity amongst R1 and R2 copies is because all copies are relatively new. However, each active element generates its own independent lineage that can eventually populate the locus. Independent lineages occur more often with R1, possibly because these elements contain their own promoter. Finally, both R1 and R2 use imprecise, rapidly evolving mechanisms to cleave the second strand and prime second strand synthesis.
Collapse
Affiliation(s)
- Deborah E Stage
- Biology Department, University of Rochester, Rochester NY 14627-0211, USA.
| | | |
Collapse
|
32
|
Jo SH, Koo DH, Kim JF, Hur CG, Lee S, Yang TJ, Kwon SY, Choi D. Evolution of ribosomal DNA-derived satellite repeat in tomato genome. BMC PLANT BIOLOGY 2009; 9:42. [PMID: 19351415 PMCID: PMC2679016 DOI: 10.1186/1471-2229-9-42] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2008] [Accepted: 04/08/2009] [Indexed: 05/03/2023]
Abstract
BACKGROUND Tandemly repeated DNA, also called as satellite DNA, is a common feature of eukaryotic genomes. Satellite repeats can expand and contract dramatically, which may cause genome size variation among genetically-related species. However, the origin and expansion mechanism are not clear yet and needed to be elucidated. RESULTS FISH analysis revealed that the satellite repeat showing homology with intergenic spacer (IGS) of rDNA present in the tomato genome. By comparing the sequences representing distinct stages in the divergence of rDNA repeat with those of canonical rDNA arrays, the molecular mechanism of the evolution of satellite repeat is described. Comprehensive sequence analysis and phylogenetic analysis demonstrated that a long terminal repeat retrotransposon was interrupted into each copy of the 18S rDNA and polymerized by recombination rather than transposition via an RNA intermediate. The repeat was expanded through doubling the number of IGS into the 25S rRNA gene, and also greatly increasing the copy number of type I subrepeat in the IGS of 25-18S rDNA by segmental duplication. Homogenization to a single type of subrepeat in the satellite repeat was achieved as the result of amplifying copy number of the type I subrepeat but eliminating neighboring sequences including the type II subrepeat and rRNA coding sequence from the array. FISH analysis revealed that the satellite repeats are commonly present in closely-related Solanum species, but vary in their distribution and abundance among species. CONCLUSION These results represent that the dynamic satellite repeats were originated from intergenic spacer of rDNA unit in the tomato genome. This result could serve as an example towards understanding the initiation and the expansion of the satellite repeats in complex eukaryotic genome.
Collapse
MESH Headings
- DNA, Plant/genetics
- DNA, Ribosomal Spacer/genetics
- DNA, Satellite/genetics
- Evolution, Molecular
- Genome, Plant
- In Situ Hybridization, Fluorescence
- Solanum lycopersicum/genetics
- Phylogeny
- RNA, Ribosomal/genetics
- RNA, Ribosomal, 18S/genetics
- Retroelements
- Sequence Analysis, DNA
Collapse
Affiliation(s)
- Sung-Hwan Jo
- Plant Genome Research Center, KRIBB, Daejeon, 305-806, Korea
- Department of Functional Genomics, University of Science & Technology, Daejeon 305-333, Korea
- Omics and Integration Research Center, KRIBB, Daejeon, 305-806, Korea
| | - Dal-Hoe Koo
- Plant Genome Research Center, KRIBB, Daejeon, 305-806, Korea
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jihyun F Kim
- Systems Microbiology Research Center, KRIBB, Daejeon, 305-806, Korea
- Department of Functional Genomics, University of Science & Technology, Daejeon 305-333, Korea
| | - Cheol-Goo Hur
- Plant Genome Research Center, KRIBB, Daejeon, 305-806, Korea
- Omics and Integration Research Center, KRIBB, Daejeon, 305-806, Korea
| | - Sanghyeob Lee
- Plant Genome Research Center, KRIBB, Daejeon, 305-806, Korea
- Dongbu Advanced Research Institute, Dongbu HiTek Co, Ltd Daejeon 305-708, Korea
| | - Tae-jin Yang
- Department of Plant Science and Plant Genomics and Breeding Institute, Seoul National University, Seoul, 151-742, Korea
| | - Suk-Yoon Kwon
- Plant Genome Research Center, KRIBB, Daejeon, 305-806, Korea
| | - Doil Choi
- Department of Plant Science and Plant Genomics and Breeding Institute, Seoul National University, Seoul, 151-742, Korea
| |
Collapse
|
33
|
|
34
|
Epigenetic regulation of retrotransposons within the nucleolus of Drosophila. Mol Cell Biol 2008; 28:6452-61. [PMID: 18678644 DOI: 10.1128/mcb.01015-08] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
R2 retrotransposable elements exclusively insert into a conserved region of the tandemly organized 28S rRNA genes. Despite inactivating a subset of these genes, R2 elements have persisted in the ribosomal DNA (rDNA) loci of insects for hundreds of millions of years. Controlling R2 proliferation was addressed in this study using lines of Drosophila simulans previously shown to have either active or inactive R2 retrotransposition. Lines with active retrotransposition were shown to have high R2 transcript levels, which nuclear run-on transcription experiments revealed were due to increased transcription of R2-inserted genes. Crosses between R2 active and inactive lines indicated that an important component of this transcriptional control is linked to or near the rDNA locus, with the R2 transcription level of the inactive parent being dominant. Pulsed-field gel analysis suggested that the R2 active and inactive states were determined by R2 distribution within the locus. Molecular and cytological analyses further suggested that the entire rDNA locus from the active line can be silenced in favor of the locus from the inactive line. This silencing of entire rDNA loci represents an example of the large-scale epigenetic control of transposable elements and shares features with the nucleolar dominance frequently seen in interspecies hybrids.
Collapse
|
35
|
Eickbush TH, Jamburuthugoda VK. The diversity of retrotransposons and the properties of their reverse transcriptases. Virus Res 2008; 134:221-34. [PMID: 18261821 DOI: 10.1016/j.virusres.2007.12.010] [Citation(s) in RCA: 171] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2007] [Revised: 12/14/2007] [Accepted: 12/14/2007] [Indexed: 11/30/2022]
Abstract
A number of abundant mobile genetic elements called retrotransposons reverse transcribe RNA to generate DNA for insertion into eukaryotic genomes. Four major classes of retrotransposons are described here. First, the long-terminal-repeat (LTR) retrotransposons have similar structures and mechanisms to those of the vertebrate retroviruses. Genes that may enable these retrotransposons to leave a cell have been acquired by these elements in a number of animal and plant lineages. Second, the tyrosine recombinase retrotransposons are similar to the LTR retrotransposons except that they have substituted a recombinase for the integrase and recombine into the host chromosomes. Third, the non-LTR retrotransposons use a cleaved chromosomal target site generated by an encoded endonuclease to prime reverse transcription. Finally, the Penelope-like retrotransposons are not well understood but appear to also use cleaved DNA or the ends of chromosomes as primer for reverse transcription. Described in the second part of this review are the enzymatic properties of the reverse transcriptases (RTs) encoded by retrotransposons. The RTs of the LTR retrotransposons are highly divergent in sequence but have similar enzymatic activities to those of retroviruses. The RTs of the non-LTR retrotransposons have several unique properties reflecting their adaptation to a different mechanism of retrotransposition.
Collapse
Affiliation(s)
- Thomas H Eickbush
- Department of Biology, University of Rochester, Rochester, NY 14627, USA.
| | | |
Collapse
|
36
|
DNA-directed DNA polymerase and strand displacement activity of the reverse transcriptase encoded by the R2 retrotransposon. J Mol Biol 2007; 374:322-33. [PMID: 17936300 DOI: 10.1016/j.jmb.2007.09.047] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2007] [Revised: 09/10/2007] [Accepted: 09/17/2007] [Indexed: 11/24/2022]
Abstract
R2 elements are non-long terminal repeat (non-LTR) retrotransposons with a single open reading-frame encoding reverse transcriptase, DNA endonuclease and nucleic acid-binding domains. The elements are specialized for insertion into the 28 S rRNA genes of many animal phyla. The R2-encoded activities initiate retrotransposition by sequence-specific cleavage of the 28 S gene target site and the utilization of the released DNA 3' end to prime reverse transcription (target primed reverse transcription). The activity of the R2 polymerase on RNA templates has been shown to differ from retroviral reverse transcriptases (RTs) in a number of properties. We demonstrate that the R2-RT is capable of efficiently utilizing single-stranded DNA (ssDNA) as a template. The processivity of the enzyme on ssDNA templates is higher than its processivity on RNA templates. This finding suggests that R2-RT is also capable of synthesizing the second DNA strand during retrotransposition. However, R2-RT lacks the RNAse H activity that is typically used by retroviral and LTR-retrotransposon RTs to remove the RNA strand before the first DNA strand is used as template. Remarkably, R2-RT can displace RNA strands that are annealed to ssDNA templates with essentially no loss of processivity. Such strand displacement activity is highly unusual for a DNA polymerase. Thus the single R2 protein contains all the activities needed to make a double-stranded DNA product from an RNA transcript. Finally, during these studies we found an unexpected property of the highly sequence-specific R2 endonuclease domain. The endonuclease can non-specifically cleave ssDNA at a junction with double-stranded DNA. This activity suggests that second-strand cleavage of the target site may not be sequence specific, but rather is specified by a single-stranded region generated when the first DNA strand is used to prime reverse transcription.
Collapse
|
37
|
Johansen SD, Haugen P, Nielsen H. Expression of protein-coding genes embedded in ribosomal DNA. Biol Chem 2007; 388:679-86. [PMID: 17570819 DOI: 10.1515/bc.2007.089] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Ribosomal DNA (rDNA) is a specialised chromosomal location that is dedicated to high-level transcription of ribosomal RNA genes. Interestingly, rDNAs are frequently interrupted by parasitic elements, some of which carry protein genes. These are non-LTR retrotransposons and group II introns that encode reverse transcriptase-like genes, and group I introns and archaeal introns that encode homing endonuclease genes (HEGs). Although rDNA-embedded protein genes are widespread in nuclei, organelles and bacteria, there is surprisingly little information available on how these genes are expressed. Exceptions include a handful of HEGs from group I introns. Recent studies have revealed unusual and essential roles of group I and group I-like ribozymes in the endogenous expression of HEGs. Here we discuss general aspects of rDNA-embedded protein genes and focus on HEG expression from group I introns in the nucleolus.
Collapse
Affiliation(s)
- Steinar D Johansen
- Department of Molecular Biotechnology, Institute of Medical Biology, University of Tromsø, Tromsø, Norway.
| | | | | |
Collapse
|
38
|
Abstract
Evolution of the tandemly repeated ribosomal RNA (rRNA) genes is intriguing because in each species all units within the array are highly uniform in sequence but that sequence differs between species. In this review we summarize the origins of the current models to explain this process of concerted evolution, emphasizing early studies of recombination in yeast and more recent studies in Drosophila and mammalian systems. These studies suggest that unequal crossover is the major driving force in the evolution of the rRNA genes with sister chromatid exchange occurring more often than exchange between homologs. Gene conversion is also believed to play a role; however, direct evidence for its involvement has not been obtained. Remarkably, concerted evolution is so well orchestrated that even transposable elements that insert into a large fraction of the rRNA genes appear to have little effect on the process. Finally, we summarize data that suggest that recombination in the rDNA locus of higher eukaryotes is sufficiently frequent to monitor changes within a few generations.
Collapse
Affiliation(s)
- Thomas H Eickbush
- Department of Biology, University of Rochester, Rochester, New York 14627, USA.
| | | |
Collapse
|