1
|
Rodríguez-Vargas A, Collins K. Distinct and overlapping RNA determinants for binding and target-primed reverse transcription by Bombyx mori R2 retrotransposon protein. Nucleic Acids Res 2024; 52:6571-6585. [PMID: 38499488 PMCID: PMC11194090 DOI: 10.1093/nar/gkae194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 02/08/2024] [Accepted: 03/09/2024] [Indexed: 03/20/2024] Open
Abstract
Eukaryotic retrotransposons encode a reverse transcriptase that binds RNA to template DNA synthesis. The ancestral non-long terminal repeat (non-LTR) retrotransposons encode a protein that performs target-primed reverse transcription (TPRT), in which the nicked genomic target site initiates complementary DNA (cDNA) synthesis directly into the genome. The best understood model system for biochemical studies of TPRT is the R2 protein from the silk moth Bombyx mori. The R2 protein selectively binds the 3' untranslated region of its encoding RNA as template for DNA insertion to its target site in 28S ribosomal DNA. Here, binding and TPRT assays define RNA contributions to RNA-protein interaction, template use for TPRT and the fidelity of template positioning for TPRT cDNA synthesis. We quantify both sequence and structure contributions to protein-RNA interaction. RNA determinants of binding affinity overlap but are not equivalent to RNA features required for TPRT and its fidelity of template positioning for full-length TPRT cDNA synthesis. Additionally, we show that a previously implicated RNA-binding protein surface of R2 protein makes RNA binding affinity dependent on the presence of two stem-loops. Our findings inform evolutionary relationships across R2 retrotransposon RNAs and are a step toward understanding the mechanism and template specificity of non-LTR retrotransposon mobility.
Collapse
Affiliation(s)
- Anthony Rodríguez-Vargas
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Kathleen Collins
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
2
|
Deng P, Tan SQ, Yang QY, Fu L, Wu Y, Zhu HZ, Sun L, Bao Z, Lin Y, Zhang QC, Wang H, Wang J, Liu JJG. Structural RNA components supervise the sequential DNA cleavage in R2 retrotransposon. Cell 2023; 186:2865-2879.e20. [PMID: 37301196 DOI: 10.1016/j.cell.2023.05.032] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 03/14/2023] [Accepted: 05/20/2023] [Indexed: 06/12/2023]
Abstract
Retroelements are the widespread jumping elements considered as major drivers for genome evolution, which can also be repurposed as gene-editing tools. Here, we determine the cryo-EM structures of eukaryotic R2 retrotransposon with ribosomal DNA target and regulatory RNAs. Combined with biochemical and sequencing analysis, we reveal two essential DNA regions, Drr and Dcr, required for recognition and cleavage. The association of 3' regulatory RNA with R2 protein accelerates the first-strand cleavage, blocks the second-strand cleavage, and initiates the reverse transcription starting from the 3'-tail. Removing 3' regulatory RNA by reverse transcription allows the association of 5' regulatory RNA and initiates the second-strand cleavage. Taken together, our work explains the DNA recognition and RNA supervised sequential retrotransposition mechanisms by R2 machinery, providing insights into the retrotransposon and application reprogramming.
Collapse
Affiliation(s)
- Pujuan Deng
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Shun-Qing Tan
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Qi-Yu Yang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Liangzheng Fu
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yachao Wu
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Han-Zhou Zhu
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lei Sun
- Shandong Provincial Key Laboratory of Animal Cell and Developmental Biology, School of Life Sciences, Shandong University, Qingdao 266237, China
| | - Zhangbin Bao
- Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
| | - Yi Lin
- Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
| | - Qiangfeng Cliff Zhang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Haoyi Wang
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jia Wang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| | - Jun-Jie Gogo Liu
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
3
|
Khadgi BB, Govindaraju A, Christensen SM. Completion of LINE integration involves an open '4-way' branched DNA intermediate. Nucleic Acids Res 2019; 47:8708-8719. [PMID: 31392993 PMCID: PMC6895275 DOI: 10.1093/nar/gkz673] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Revised: 06/26/2019] [Accepted: 07/29/2019] [Indexed: 12/20/2022] Open
Abstract
Long Interspersed Elements (LINEs), also known as non-LTR retrotransposons, encode a multifunctional protein that reverse transcribes its mRNA into DNA at the site of insertion by target primed reverse transcription. The second half of the integration reaction remains very poorly understood. Second-strand DNA cleavage and second-strand DNA synthesis were investigated in vitro using purified components from a site-specific restriction-like endonuclease (RLE) bearing LINE. DNA structure was shown to be a critical component of second-strand DNA cleavage. A hitherto unknown and unexplored integration intermediate, an open ‘4-way’ DNA junction, was recognized by the element protein and cleaved in a Holliday junction resolvase-like reaction. Cleavage of the 4-way junction resulted in a natural primer-template pairing used for second-strand DNA synthesis. A new model for RLE LINE integration is presented.
Collapse
Affiliation(s)
- Brijesh B Khadgi
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Aruna Govindaraju
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Shawn M Christensen
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| |
Collapse
|
4
|
Su Y, Nichuguti N, Kuroki-Kami A, Fujiwara H. Sequence-specific retrotransposition of 28S rDNA-specific LINE R2Ol in human cells. RNA (NEW YORK, N.Y.) 2019; 25:1432-1438. [PMID: 31434792 PMCID: PMC6795142 DOI: 10.1261/rna.072512.119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Accepted: 08/12/2019] [Indexed: 06/10/2023]
Abstract
R2 is a long interspersed element (LINE) found in a specific sequence of the 28S rDNA among a wide variety of animals. Recently, we observed that R2Ol isolated from medaka fish, Oryzias latipes, retrotransposes sequence specifically into the target sequence of zebrafish. Because the 28S target and flanking regions are widely conserved among vertebrates, we examined whether R2Ol can also integrate in a sequence-specific manner in human cells. Using adenovirus-mediated expression of R2Ol constructs, we confirmed an accurate insertion of R2Ol into the 28S target of human 293T cells. However, the R2Ol mutant devoid of endonuclease (EN) activity showed no retrotransposition ability, suggesting that the sequence-specific integration of R2Ol into 28S rDNA occurs via the cleavage activity of EN. By introducing both R2Ol helper virus and donor plasmid in human cells, we succeeded in retrotransposing an exogenous EGFP gene into the 28S target site by the trans-complementation system, which enabled simplification of specific gene knock-in in a time-efficient manner. We believe that R2Ol may provide an alternative targeted gene knock-in method for practical applications such as gene therapy in future.
Collapse
Affiliation(s)
- Yuting Su
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Narisu Nichuguti
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Azusa Kuroki-Kami
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Haruhiko Fujiwara
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| |
Collapse
|
5
|
Kuroki-Kami A, Nichuguti N, Yatabe H, Mizuno S, Kawamura S, Fujiwara H. Targeted gene knockin in zebrafish using the 28S rDNA-specific non-LTR-retrotransposon R2Ol. Mob DNA 2019; 10:23. [PMID: 31139267 PMCID: PMC6530143 DOI: 10.1186/s13100-019-0167-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Accepted: 05/13/2019] [Indexed: 02/07/2023] Open
Abstract
Background Although most of long interspersed elements (LINEs), one class of non-LTR-retrotransposons, are integrated into the host genome randomely, some elements are retrotransposed into the specific sequences of the genomic regions, such as rRNA gene (rDNA) clusters, telomeric repeats and other repetitive sequenes. Most of the sequence-specific LINEs have been reported mainly among invertebrate species and shown to retrotranspose into the specific sequences in vivo and in vitro systems. Recenlty, 28S rDNA-specific LINE R2 elements are shown to be distributed among widespread vertebrate species, but the sequence-specific retrotransposition of R2 has never been demonstrated in vertebrates. Results Here we cloned a full length unit of R2 from medaka fish Oryzias latipes, named R2Ol, and engineered it to a targeted gene integration tool in zebrafish. By injecting R2Ol-encoding mRNA into zebrafish embryos, R2Ol retrotransposed precisely into the target site at high efficiency (98%) and was transmitted to the next generation at high frequency (50%). We also generated transgenic zebrafish carrying the enhanced green fluorescent protein (EGFP) reporter gene in 28S rDNA target by the R2Ol retrotransposition system. Conclusions Sequence-specific LINE retrotransposes into the precise sequence using target primed reverse transcription (TPRT), possibly providing an alternative and effective targeted gene knockin method in vertebrates. Electronic supplementary material The online version of this article (10.1186/s13100-019-0167-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Azusa Kuroki-Kami
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Bioscience Bldg., Kashiwanoha 5-1-5, Kashiwa, Chiba, 277-8562 Japan
| | - Narisu Nichuguti
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Bioscience Bldg., Kashiwanoha 5-1-5, Kashiwa, Chiba, 277-8562 Japan
| | - Haruka Yatabe
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Bioscience Bldg., Kashiwanoha 5-1-5, Kashiwa, Chiba, 277-8562 Japan
| | - Sayaka Mizuno
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Bioscience Bldg., Kashiwanoha 5-1-5, Kashiwa, Chiba, 277-8562 Japan
| | - Shoji Kawamura
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Bioscience Bldg., Kashiwanoha 5-1-5, Kashiwa, Chiba, 277-8562 Japan
| | - Haruhiko Fujiwara
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Bioscience Bldg., Kashiwanoha 5-1-5, Kashiwa, Chiba, 277-8562 Japan
| |
Collapse
|
6
|
Abstract
Retrotransposons have generated about 40 % of the human genome. This review examines the strategies the cell has evolved to coexist with these genomic "parasites", focussing on the non-long terminal repeat retrotransposons of humans and mice. Some of the restriction factors for retrotransposition, including the APOBECs, MOV10, RNASEL, SAMHD1, TREX1, and ZAP, also limit replication of retroviruses, including HIV, and are part of the intrinsic immune system of the cell. Many of these proteins act in the cytoplasm to degrade retroelement RNA or inhibit its translation. Some factors act in the nucleus and involve DNA repair enzymes or epigenetic processes of DNA methylation and histone modification. RISC and piRNA pathway proteins protect the germline. Retrotransposon control is relaxed in some cell types, such as neurons in the brain, stem cells, and in certain types of disease and cancer, with implications for human health and disease. This review also considers potential pitfalls in interpreting retrotransposon-related data, as well as issues to consider for future research.
Collapse
Affiliation(s)
- John L. Goodier
- McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD USA 212051
| |
Collapse
|
7
|
Abstract
R2 elements are sequence specific non-LTR retrotransposons that exclusively insert in the 28S rRNA genes of animals. R2s encode an endonuclease that cleaves the insertion site and a reverse transcriptase that uses the cleaved DNA to prime reverse transcription of the R2 transcript, a process termed target primed reverse transcription. Additional unusual properties of the reverse transcriptase as well as DNA and RNA binding domains of the R2 encoded protein have been characterized. R2 expression is through co-transcription with the 28S gene and self-cleavage by a ribozyme encoded at the R2 5' end. Studies in laboratory stocks and natural populations of Drosophila suggest that R2 expression is tied to the distribution of R2-inserted units within the rDNA locus. Most individuals have no R2 expression because only a small fraction of their rRNA genes need to be active, and a contiguous region of the locus free of R2 insertions can be selected for activation. However, if the R2-free region is not large enough to produce sufficient rRNA, flanking units - including those inserted with R2 - must be activated. Finally, R2 copies rapidly turnover within the rDNA locus, yet R2 has been vertically maintained in animal lineages for hundreds of millions of years. The key to this stability is R2's ability to remain dormant in rDNA units outside the transcribed regions for generations until the stochastic nature of the crossovers that drive the concerted evolution of the rDNA locus inevitably reshuffle the inserted and uninserted units, resulting in transcription of the R2-inserted units.
Collapse
|
8
|
Abstract
Eukaryotic genomes are colonized by various transposons including short interspersed elements (SINEs). The 5' region (head) of the majority of SINEs is derived from one of the three types of RNA genes--7SL RNA, transfer RNA (tRNA), or 5S ribosomal RNA (rRNA)--and the internal promoter inside the head promotes the transcription of the entire SINEs. Here I report a new group of SINEs whose heads originate from either the U1 or U2 small nuclear RNA gene. These SINEs, named SINEU, are distributed among crocodilians and classified into three families. The structures of the SINEU-1 subfamilies indicate the recurrent addition of a U1- or U2-derived sequence onto the 5' end of SINEU-1 elements. SINEU-1 and SINEU-3 are ancient and shared among alligators, crocodiles, and gharials, while SINEU-2 is absent in the alligator genome. SINEU-2 is the only SINE family that was active after the split of crocodiles and gharials. All SINEU families, especially SINEU-3, are preferentially inserted into a family of Mariner DNA transposon, Mariner-N4_AMi. A group of Tx1 non-long terminal repeat retrotransposons designated Tx1-Mar also show target preference for Mariner-N4_AMi, indicating that SINEU was mobilized by Tx1-Mar.
Collapse
|
9
|
Eickbush DG, Burke WD, Eickbush TH. Evolution of the R2 retrotransposon ribozyme and its self-cleavage site. PLoS One 2013; 8:e66441. [PMID: 24066021 PMCID: PMC3774820 DOI: 10.1371/journal.pone.0066441] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 05/07/2013] [Indexed: 12/23/2022] Open
Abstract
R2 is a non-long terminal repeat retrotransposon that inserts site-specifically in the tandem 28S rRNA genes of many animals. Previously, R2 RNA from various species of Drosophila was shown to self-cleave from the 28S rRNA/R2 co-transcript by a hepatitis D virus (HDV)-like ribozyme encoded at its 5' end. RNA cleavage was at the precise 5' junction of the element with the 28S gene. Here we report that RNAs encompassing the 5' ends of R2 elements from throughout its species range fold into HDV-like ribozymes. In vitro assays of RNA self-cleavage conducted in many R2 lineages confirmed activity. For many R2s, RNA self-cleavage was not at the 5' end of the element but at 28S rRNA sequences up to 36 nucleotides upstream of the junction. The location of cleavage correlated well with the types of endogenous R2 5' junctions from different species. R2 5' junctions were uniform for most R2s in which RNA cleavage was upstream in the rRNA sequences. The 28S sequences remaining on the first DNA strand synthesized during retrotransposition are postulated to anneal to the target site and uniformly prime second strand DNA synthesis. In species where RNA cleavage occurred at the R2 5' end, the 5' junctions were variable. This junction variation is postulated to result from the priming of second strand DNA synthesis by chance microhomologies between the target site and the first DNA strand. Finally, features of R2 ribozyme evolution, especially changes in cleavage site and convergence on the same active site sequences, are discussed.
Collapse
Affiliation(s)
- Danna G. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - William D. Burke
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Thomas H. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
- * E-mail:
| |
Collapse
|
10
|
Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013; 4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open
Abstract
The molecular mechanisms of the transposition of non-long terminal repeat (non-LTR) retrotransposons are not well understood; the key questions of how the 3′-ends of cDNA copies integrate and how site-specific integration occurs remain unresolved. Integration depends on properties of the endonuclease (EN) domain of retrotransposons. Using the EN domain of the Drosophila R2 retrotransposon as a model for other, closely related non-LTR retrotransposons, we investigated the EN domain and found that it resembles archaeal Holliday-junction resolvases. We suggest that these non-LTR retrotransposons are co-transcribed with the host transcript. Combined with the proposed resolvase activity of the EN domain, this model yields a novel mechanism for site-specific retrotransposition within this class of retrotransposons, with resolution proceeding via a Holliday junction intermediate.
Collapse
Affiliation(s)
- Dmitry V Mukha
- Vavilov Institute of General Genetics, Russian Academy of Sciences Moscow, Russia
| | | | | | | |
Collapse
|
11
|
Singh DK, Rath PC. Long interspersed nuclear elements (LINEs) show tissue-specific, mosaic genome and methylation-unrestricted, widespread expression of noncoding RNAs in somatic tissues of the rat. RNA Biol 2012; 9:1380-96. [PMID: 23064113 DOI: 10.4161/rna.22402] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
We report strong somatic and germ line expression of LINE RNAs in eight different tissues of rat by using a novel ~2.8 kb genomic PstI-LINE DNA (P1-LINE) isolated from the rat brain. P1-LINE is present in a 93 kb LINE-SINE-cluster in sub-telomeric region of chromosome 12 (12p12) and as multiple truncated copies interspersed in all rat chromosomes. P1-LINEs occur as inverted repeats at multiple genomic loci in tissue-specific and mosaic patterns. P1-LINE RNAs are strongly expressed in brain, liver, lungs, heart, kidney, testes, spleen and thymus into large to small heterogeneous RNAs (~5.0 to 0.2 kb) in tissue-specific and dynamic patterns in individual rats. P1-LINE DNA is strongly methylated at CpG-dinucleotides in most genomic copies in all the tissues and weakly hypomethylated in few copies in some tissues. Small (700-75 nt) P1-LINE RNAs expressed in all tissues may be possible precursors for small regulatory RNAs (PIWI-interacting/piRNAs) bioinformatically derived from P1-LINE. The strong and dynamic expression of LINE RNAs from multiple chromosomal loci and the putative piRNAs in somatic tissues of rat under normal physiological conditions may define functional chromosomal domains marked by LINE RNAs as long noncoding RNAs (lncRNAs) unrestricted by DNA methylation. The tissue-specific, dynamic RNA expression and mosaic genomic distribution of LINEs representing a steady-state genomic flux of retrotransposon RNAs suggest for biological role of LINE RNAs as long ncRNAs and small piRNAs in mammalian tissues independent of their cellular fate for translation, reverse-transcription and retrotransposition. This may provide evolutionary advantages to LINEs and mammalian genomes.
Collapse
Affiliation(s)
- Deepak K Singh
- Molecular Biology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi, India
| | | |
Collapse
|
12
|
Unique functions of repetitive transcriptomes. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2010; 285:115-88. [PMID: 21035099 DOI: 10.1016/b978-0-12-381047-2.00003-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Repetitive sequences occupy a huge fraction of essentially every eukaryotic genome. Repetitive sequences cover more than 50% of mammalian genomic DNAs, whereas gene exons and protein-coding sequences occupy only ~3% and 1%, respectively. Numerous genomic repeats include genes themselves. They generally encode "selfish" proteins necessary for the proliferation of transposable elements (TEs) in the host genome. The major part of evolutionary "older" TEs accumulated mutations over time and fails to encode functional proteins. However, repeats have important functions also on the RNA level. Repetitive transcripts may serve as multifunctional RNAs by participating in the antisense regulation of gene activity and by competing with the host-encoded transcripts for cellular factors. In addition, genomic repeats include regulatory sequences like promoters, enhancers, splice sites, polyadenylation signals, and insulators, which actively reshape cellular transcriptomes. TE expression is tightly controlled by the host cells, and some mechanisms of this regulation were recently decoded. Finally, capacity of TEs to proliferate in the host genome led to the development of multiple biotechnological applications.
Collapse
|
13
|
The R2 mobile element of Rhynchosciara americana: molecular, cytological and dynamic aspects. Chromosome Res 2009; 17:455-67. [PMID: 19350401 DOI: 10.1007/s10577-009-9038-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2008] [Revised: 03/03/2009] [Accepted: 03/03/2009] [Indexed: 10/20/2022]
Abstract
Ribosomal RNA genes are encoded by large units clustered (18S, 5S, and 28S) in the nucleolar organizer region in several organisms. Sometimes additional insertions are present in the coding region for the 28S rDNA. These insertions are specific non-long terminal repeat retrotransposons that have very restricted integration targets within the genome. The retrotransposon present in the genome of Rhynchosciara americana, RaR2, was isolated by the screening of a genomic library. Sequence analysis showed the presence of conserved regions, such as a reverse transcriptase domain and a zinc finger motif in the amino terminal region. The insertion site was highly conserved in R. americana and a phylogenetic analysis showed that this element belongs to the R2 clade. The chromosomal localization confirmed that the RaR2 mobile element was inserted into a specific site in the rDNA gene. The expression level of RaR2 in salivary glands during larval development was determined by quantitative RT-PCR, and the increase of relative expression in the 3P of the fourth instar larval could be related to intense gene activity characteristic of this stage. 5'-Truncated elements were identified in different DNA samples. Additionally, in three other Rhynchosciara species, the R2 element was present as a full-length element.
Collapse
|
14
|
Guo H, Tse LV, Barbalat R, Sivaamnuaiphorn S, Xu M, Doulatov S, Miller JF. Diversity-generating retroelement homing regenerates target sequences for repeated rounds of codon rewriting and protein diversification. Mol Cell 2008; 31:813-23. [PMID: 18922465 DOI: 10.1016/j.molcel.2008.07.022] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2008] [Revised: 06/19/2008] [Accepted: 07/01/2008] [Indexed: 10/21/2022]
Abstract
Diversity-generating retroelements (DGRs) introduce vast amounts of sequence diversity into target genes. During mutagenic homing, adenine residues are converted to random nucleotides in a unidirectional, reverse transcriptase-dependent transposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using a Bordetella bacteriophage DGR as a model, we demonstrate that homing occurs through a TR-containing RNA intermediate and is RecA independent. Marker transfer studies show that cDNA integration at the 3' end of VR occurs within a (G/C)(14) element, and deletion analysis demonstrates that the reaction is independent of 5' end cDNA integration. cDNA integration at the 5' end of VR requires only short stretches of sequence homology. We propose that homing occurs through a unique target DNA-primed reverse transcription mechanism that precisely regenerates target sequences. This nonproliferative "copy and replace" mechanism enables repeated rounds of protein diversification and optimization of ligand-receptor interactions.
Collapse
Affiliation(s)
- Huatao Guo
- Department of Microbiology, Immunology, and Molecular Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | | | | | | | | | | | | |
Collapse
|
15
|
Hart JM, Kennedy SD, Mathews DH, Turner DH. NMR-assisted prediction of RNA secondary structure: identification of a probable pseudoknot in the coding region of an R2 retrotransposon. J Am Chem Soc 2008; 130:10233-9. [PMID: 18613678 PMCID: PMC2646634 DOI: 10.1021/ja8026696] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2008] [Indexed: 12/30/2022]
Abstract
As the rate of functional RNA sequence discovery escalates, high-throughput techniques for reliable structural determination are becoming crucial for revealing the essential features of these RNAs in a timely fashion. Computational predictions of RNA secondary structure quickly generate reasonable models but suffer from several approximations, including overly simplified models and incomplete knowledge of significant interactions. Similar problems limit the accuracy of predictions for other self-folding polymers, including DNA and peptide nucleic acid (PNA). The work presented here demonstrates that incorporating unassigned data from simple nuclear magnetic resonance (NMR) experiments into a dynamic folding algorithm greatly reduces the potential folding space of a given RNA and therefore increases the confidence and accuracy of modeling. This procedure has been packaged into an NMR-assisted prediction of secondary structure (NAPSS) algorithm that can produce pseudoknotted as well as non-pseudoknotted secondary structures. The method reveals a probable pseudoknot in the part of the coding region of the R2 retrotransposon from Bombyx mori that orchestrates second-strand DNA cleavage during insertion into the genome.
Collapse
Affiliation(s)
- James M Hart
- Department of Chemistry, University of Rochester, RC Box 270216, Rochester, New York 14627, USA
| | | | | | | |
Collapse
|
16
|
Epigenetic regulation of retrotransposons within the nucleolus of Drosophila. Mol Cell Biol 2008; 28:6452-61. [PMID: 18678644 DOI: 10.1128/mcb.01015-08] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
R2 retrotransposable elements exclusively insert into a conserved region of the tandemly organized 28S rRNA genes. Despite inactivating a subset of these genes, R2 elements have persisted in the ribosomal DNA (rDNA) loci of insects for hundreds of millions of years. Controlling R2 proliferation was addressed in this study using lines of Drosophila simulans previously shown to have either active or inactive R2 retrotransposition. Lines with active retrotransposition were shown to have high R2 transcript levels, which nuclear run-on transcription experiments revealed were due to increased transcription of R2-inserted genes. Crosses between R2 active and inactive lines indicated that an important component of this transcriptional control is linked to or near the rDNA locus, with the R2 transcription level of the inactive parent being dominant. Pulsed-field gel analysis suggested that the R2 active and inactive states were determined by R2 distribution within the locus. Molecular and cytological analyses further suggested that the entire rDNA locus from the active line can be silenced in favor of the locus from the inactive line. This silencing of entire rDNA loci represents an example of the large-scale epigenetic control of transposable elements and shares features with the nucleolar dominance frequently seen in interspecies hybrids.
Collapse
|
17
|
Patrick KL, Luz PM, Ruan JP, Shi H, Ullu E, Tschudi C. Genomic rearrangements and transcriptional analysis of the spliced leader-associated retrotransposon in RNA interference-deficient Trypanosoma brucei. Mol Microbiol 2007; 67:435-47. [PMID: 18067542 DOI: 10.1111/j.1365-2958.2007.06057.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The Trypanosoma brucei genome is colonized by the site-specific non-LTR retrotransposon SLACS, or spliced leader-associated conserved sequence, which integrates exclusively into the spliced leader (SL) RNA genes. Although there is evidence that the RNA interference (RNAi) machinery regulates SLACS transcript levels, we do not know whether RNAi deficiency affects the genomic stability of SLACS, nor do we understand the mechanism of SLACS transcription. Here, we report that prolonged culturing of RNAi-deficient T. brucei cells, but not wild-type cells, results in genomic rearrangements of SLACS. Furthermore, two populations of SLACS transcripts persist in RNAi-deficient cells: a full-length transcript of approximately 7 kb and a heterogeneous population of small SLACS transcripts ranging in size from 450 to 550 nt. We provide evidence that SLACS transcription initiates at the +1 of the interrupted SL RNA gene and proceeds into the 5' UTR and open reading frame 1 (ORF1). This transcription is carried out by an RNA polymerase with alpha-amanitin sensitivity reminiscent of SL RNA synthesis and is dependent on the SL RNA promoter. Additionally, we show that both sense and antisense small SLACS transcripts originate from ORF1 and that they are associated with proteins in vivo. We speculate that the small SLACS transcripts serve as substrates for the production of siRNAs to regulate SLACS expression.
Collapse
Affiliation(s)
- Kristin L Patrick
- Department of Epidemiology and Public Health, Yale University Medical School, 295 Congress Avenue, New Haven, CT 06536-0812, USA
| | | | | | | | | | | |
Collapse
|
18
|
Abstract
Long interspersed nucleotide element (LINE)-1 retrotransposon (L1) has emerged as the largest contributor to mammalian genome mass, responsible for over 35% of the human genome. Differences in the number and activity levels of L1s contribute to interindividual variation in humans, both by affecting an individual's likelihood of acquiring new L1-mediated mutations, as well as by differentially modifying gene expression. Here, we report on recent progress in understanding L1 biology, with a focus on mechanisms of L1-mediated disease. We discuss known details of L1 life cycle, including L1 structure, transcriptional regulation, and the mechanisms of translation and retrotransposition. Current views on cell type specificity, timing, and control of retrotransposition are put forth. Finally, we discuss the role of L1 as a mutagen, using the latest findings in L1 biology to illuminate molecular mechanisms of L1-mediated gene disruption.
Collapse
Affiliation(s)
- Daria V Babushok
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania 19104-6145, USA
| | | |
Collapse
|
19
|
Honda H, Ichiyanagi K, Suzuki J, Ono T, Koyama H, Kajikawa M, Okada N. A new system for analyzing LINE retrotransposition in the chicken DT40 cell line widely used for reverse genetics. Gene 2007; 395:116-24. [PMID: 17434692 DOI: 10.1016/j.gene.2007.02.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2006] [Revised: 02/14/2007] [Accepted: 02/19/2007] [Indexed: 11/15/2022]
Abstract
Long interspersed elements (LINEs) are autonomous transposable elements that proliferate via retrotransposition, which involves reverse transcription of LINE RNAs. It is anticipated that LINE retrotransposition requires both LINE-encoded proteins and host-encoded proteins. However, identification of the host factors, their roles, and the steps at which they act on retrotransposition are poorly understood because of the lack of an appropriate genetic system to study LINE retrotransposition in a series of mutant hosts. To construct such a genetic system, we applied the retrotransposition-indicative cassette method to DT40 cells, a chicken cell line for which a variety of isogenic mutants have been established by gene targeting. Because DT40 cells are non-adherent, we utilized a selective soft agarose medium to allow the formation of colonies of cells that had undergone LINE retrotransposition. Colony formation was completely dependent on the activities of the LINE-encoded proteins and on the presence of the essential 3' region of the LINE RNA. Moreover, the selected colonies indeed carried retrotransposed LINE copies in their chromosomes, with integration features similar to those of genomic (native) LINE copies. This method thus allows the authentic selection of LINE-retrotransposed cells and the approximate recapitulation of retrotransposition events that occur in nature. Therefore, the DT40 cell system established here provides a powerful tool for the elucidation of LINE retrotransposition pathways, the host factors involved, and their roles.
Collapse
Affiliation(s)
- Hiroshi Honda
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | | | | | | | | | | | | |
Collapse
|
20
|
Targeted expression of vascular endothelial growth factor 165 in the hrDNA locus mediated by hrDNA targeting vector. Chin Med J (Engl) 2007. [DOI: 10.1097/00029330-200703010-00016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
|
21
|
Babushok DV, Ostertag EM, Courtney CE, Choi JM, Kazazian HH. L1 integration in a transgenic mouse model. Genome Res 2005; 16:240-50. [PMID: 16365384 PMCID: PMC1361720 DOI: 10.1101/gr.4571606] [Citation(s) in RCA: 90] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
To study integration of the human LINE-1 retrotransposon (L1) in vivo, we developed a transgenic mouse model of L1 retrotransposition that displays de novo somatic L1 insertions at a high frequency, occasionally several insertions per mouse. We mapped 3' integration sites of 51 insertions by Thermal Asymmetric Interlaced PCR (TAIL-PCR). Analysis of integration locations revealed a broad genomic distribution with a modest preference for intergenic regions. We characterized the complete structures of 33 de novo retrotransposition events. Our results highlight the large number of highly truncated L1s, as over 52% (27/51) of total integrants were <1/3 the length of a full-length element. New integrants carry all structural characteristics typical of genomic L1s, including a number with inversions, deletions, and 5'-end microhomologies to the target DNA sequence. Notably, at least 13% (7/51) of all insertions contain a short stretch of extra nucleotides at their 5' end, which we postulate result from template-jumping by the L1-encoded reverse transcriptase. We propose a unified model of L1 integration that explains all of the characteristic features of L1 retrotransposition, such as 5' truncations, inversions, extra nucleotide additions, and 5' boundary and inversion point microhomologies.
Collapse
Affiliation(s)
- Daria V Babushok
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | | | | | | |
Collapse
|
22
|
Ye J, Pérez-González CE, Eickbush DG, Eickbush TH. Competition between R1 and R2 transposable elements in the 28S rRNA genes of insects. Cytogenet Genome Res 2005; 110:299-306. [PMID: 16093682 DOI: 10.1159/000084962] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2003] [Accepted: 01/13/2004] [Indexed: 11/19/2022] Open
Abstract
R1 and R2 are non-LTR retrotransposons that insert in the 28S rRNA genes of arthropods. R1 elements insert into a site that is 74 bp downstream of the R2 insertion site, thus the presence of an R2 in the same 28S gene may inhibit the expression of R1. Consistent with such a suggestion, the R1 elements of Drosophila melanogaster have a strong bias against inserting into 28S genes already containing an R2 element. R2 elements, on the other hand, are only 2-3 fold inhibited from inserting into a 28S gene already containing an R1. D. melanogaster R1 elements are unusual in that they generate a 23-bp deletion of the target site upstream of the insertion. Using in vitro assays developed to study R2 integration, we show that the presence of R1 sequences 51 bp downstream of the R2 insertion site changes the nucleosomal structure that can be formed by the R2 target site. The R2 endonuclease is inhibited from cleaving these altered nucleosomes. We suggest that R1 elements have been selected to make this large deletion of the 28S gene to block the insertion of an upstream R2 element. These findings are consistent with the model that R1 and R2 are in competition for the limited number of insertion sites available within their host's genome.
Collapse
Affiliation(s)
- J Ye
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| | | | | | | |
Collapse
|
23
|
Bunikis J, Barbour AG. Ticks have R2 retrotransposons but not the consensus transposon target site of other arthropods. INSECT MOLECULAR BIOLOGY 2005; 14:465-74. [PMID: 16164602 DOI: 10.1111/j.1365-2583.2005.00577.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Some copies of the large subunit rRNA genes (LSU rDNA) of most arthropods studied to date are inactivated by R-element retrotransposons at a specific target region that is highly conserved in sequence across all kingdoms of organisms. Here we report finding R2 elements in low copy numbers in the LSU rDNA of hard and soft ticks. Although the elements were inserted at the same LSU rDNA location as in insects, there were substitutions in the consensus R2 endonuclease cleavage site in the ticks and some other parasitiform mites. The substituted region comprises a critical contact point with small subunit rRNA, but in vitro structure probing analysis revealed novel, presumably stabilizing base-pairing.
Collapse
Affiliation(s)
- J Bunikis
- Department of Microbiology, University of California Irvine, Irvine, CA 92697-4025, USA.
| | | |
Collapse
|
24
|
Kojima KK, Fujiwara H. Long-term inheritance of the 28S rDNA-specific retrotransposon R2. Mol Biol Evol 2005; 22:2157-65. [PMID: 16014872 DOI: 10.1093/molbev/msi210] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
R2 is a non-long-terminal-repeat (LTR) retrotransposon that inserts specifically into 28S rDNA. R2 has been identified in many species of arthropods and three species of chordates. R2 may be even more widely distributed in animals, and its origin may be traceable to early animal evolution. In this study, we identified R2 elements in medaka fish, White Cloud Mountain minnow, Reeves' turtle, hagfish, sea lilies, and some arthropod species, using degenerate polymerase chain reaction methods. We also identified two R2 elements from the public genomic sequence database of the bloodfluke Schistosoma mansoni. One of the two bloodfluke R2 elements has two zinc-finger motifs at the N-terminus; this differs from other known R2 elements, which have one or three zinc-finger motifs. Phylogenetic analysis revealed that the whole phylogeny of R2 can be divided into 11 parts (subclades), in which the local R2 phylogeny and the corresponding host phylogeny are consistent. Divergence-versus-age analysis revealed that there is no reliable evidence for the horizontal transfer of R2 but supports the proposition that R2 has been vertically transferred since before the divergence of the deuterostomes and protostomes. The seeming inconsistency between the R2 phylogeny and the phylogeny of their hosts is due to the existence of paralogous lineages. The number of N-terminal zinc-finger motifs is consistent with the deep phylogeny of R2 and indicates that the common ancestor of R2 had three zinc-finger motifs at the N-terminus. This study revealed the long-term vertical inheritance and the ancient origin of sequence specificity of R2, both of which seem applicable to some other non-LTR retrotransposons.
Collapse
Affiliation(s)
- Kenji K Kojima
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Chiba, Japan
| | | |
Collapse
|
25
|
Anzai T, Osanai M, Hamada M, Fujiwara H. Functional roles of 3'-terminal structures of template RNA during in vivo retrotransposition of non-LTR retrotransposon, R1Bm. Nucleic Acids Res 2005; 33:1993-2002. [PMID: 15814816 PMCID: PMC1074724 DOI: 10.1093/nar/gki347] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
R1Bm is a non-LTR retrotransposon found specifically within 28S rRNA genes of the silkworm. Different from other non-LTR retrotransposons encoding two open reading frames (ORFs), R1Bm structurally lacks a poly (A) tract at its 3' end. To study how R1Bm initiates reverse transcription from the poly (A)-less template RNA, we established an in vivo retrotransposition system using recombinant baculovirus, and characterized retrotransposition activities of R1Bm. Target-primed reverse transcription (TPRT) of R1Bm occurred from the cleavage site generated by endonuclease (EN). The 147 bp of 3'-untranslated region (3'UTR) was essential for efficient retrotransposition of R1Bm. Even using the complete R1Bm element, however, reverse transcription started from various sites of the template RNA mostly with 5'-UG-3' or 5'-UGU-3' at their 3' ends, which are presumably base-paired with 3' end of the EN-digested 28S rDNA target sequence, 5'-AGTAGATAGGGACA-3'. When the downstream sequence of 28S rDNA target was added to the 3' end of R1 unit, reverse transcription started exactly from the 3' end of 3'UTR and retrotransposition efficiency increased. These results indicate that 3'-terminal structure of template RNA including read-through region interacts with its target rDNA sequences of R1Bm, which plays important roles in initial process of TPRT in vivo.
Collapse
Affiliation(s)
| | | | | | - Haruhiko Fujiwara
- To whom correspondence should be addressed. Tel: +81 4 7136 3659; Fax: +81 4 7136 3660;
| |
Collapse
|
26
|
Pérez-González CE, Burke WD, Eickbush TH. R1 and R2 retrotransposition and deletion in the rDNA loci on the X and Y chromosomes of Drosophila melanogaster. Genetics 2004; 165:675-85. [PMID: 14573479 PMCID: PMC1462780 DOI: 10.1093/genetics/165.2.675] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The non-LTR retrotransposons R1 and R2 insert into the 28S rRNA genes of arthropods. Comparisons among Drosophila lineages have shown that these elements are vertically inherited, while studies within species have indicated a rapid turnover of individual copies (elimination of old copies and the insertion of new copies). To better understand the turnover of R1 and R2, 200 retrotranspositions and nearly 100 eliminations have been scored in the Harwich mutation-accumulation lines of Drosophila melanogaster. Because the rDNA arrays in D. melanogaster are present on the X and Y chromosomes and no exchanges were detected in these lines, it was possible to show that R1 retrotranspositions occur predominantly in the male germ line, while R2 retrotranspositions were more evenly divided between the germ lines of both sexes. The rate of elimination of elements from the Y rDNA array was twice that of the X rDNA array with both chromosomal loci containing regions where the rate of elimination was on average eight times higher. Most R1 and R2 eliminations appear to occur by large intrachromosomal events (i.e., loop-out events) that involve multiple rDNA units. These findings are interpreted in light of the known abundance of R1 and R2 elements in the X and Y rDNA loci of D. melanogaster.
Collapse
|
27
|
Ruschak AM, Mathews DH, Bibillo A, Spinelli SL, Childs JL, Eickbush TH, Turner DH. Secondary structure models of the 3' untranslated regions of diverse R2 RNAs. RNA (NEW YORK, N.Y.) 2004; 10:978-87. [PMID: 15146081 PMCID: PMC1370589 DOI: 10.1261/rna.5216204] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2003] [Accepted: 03/10/2004] [Indexed: 05/19/2023]
Abstract
The RNA structure of the 3' untranslated region (UTR) of the R2 retrotransposable element is recognized by the R2-encoded reverse transcriptase in a reaction called target primed reverse transcription (TPRT). To provide insight into structure-function relationships important for TPRT, we have created alignments that reveal the secondary structure for 22 Drosophila and five silkmoth 3' UTR R2 sequences. In addition, free energy minimization has been used to predict the secondary structure for the 3' UTR R2 RNA of Forficula auricularia. The predicted structures for Bombyx mori and F. auricularia are consistent with chemical modification data obtained with beta-ethoxy-alpha-ketobutyraldehyde (kethoxal), dimethyl sulfate, and 1-cyclohexyl-3-(2-morpholinoethyl)carbodiimide metho-p-toluene sulfonate. The structures appear to have common helices that are likely important for function.
Collapse
Affiliation(s)
- Amy M Ruschak
- Department of Chemistry, University of Rochester, Rochester, New York 14627-0216, USA
| | | | | | | | | | | | | |
Collapse
|
28
|
Christensen S, Eickbush TH. Footprint of the retrotransposon R2Bm protein on its target site before and after cleavage. J Mol Biol 2004; 336:1035-45. [PMID: 15037067 DOI: 10.1016/j.jmb.2003.12.077] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2003] [Revised: 12/23/2003] [Accepted: 12/24/2003] [Indexed: 10/26/2022]
Abstract
R2 elements are non-long terminal repeat (non-LTR) retrotransposons that specifically integrate into the 28 S rRNA genes of their host. These elements encode a single open reading frame with a genome-specific endonuclease and a reverse transcriptase that uses the cleaved chromosomal target site to prime reverse transcription. Cleavage of the DNA strand that is used to prime reverse transcription is an efficient process that occurs in the presence or absence of RNA. Cleavage of the second DNA strand is much less efficient and requires RNA. Reverse transcription occurs before second strand cleavage and only if the RNA bound to the protein contains the 3' untranslated region of the R2 element. Thus a complex series of protein interactions with the DNA and conformational changes in the protein are likely to occur during this retrotransposition reaction. Here, we conduct electrophoretic mobility-shift assays and DNase I footprint studies on the binding of the R2 protein to the DNA target in the presence and absence of RNA both before and after first strand cleavage. While the total expanse of the protein footprint on the DNA eventually covers five helical turns, before cleavage the footprint only extends from 17 bp to 40 bp upstream of the cleavage site. This footprint is the same in the presence and absence of RNA. We hypothesize that the active site of the endonuclease domain is analogous to type IIS restriction enzymes in that it is located on a flexible domain that is not tightly bound to the cleavage site. After first strand cleavage the protein footprint extends beyond the cleavage site. We suggest that this increased protection after cleavage is the RT domain that is positioned over the free DNA end to begin reverse transcription on the nicked DNA substrate.
Collapse
Affiliation(s)
- Shawn Christensen
- Department of Biology, University of Rochester, Hutchinson Hall 334, Rochester, NY 14627-0211, USA
| | | |
Collapse
|
29
|
Fujimoto H, Hirukawa Y, Tani H, Matsuura Y, Hashido K, Tsuchida K, Takada N, Kobayashi M, Maekawa H. Integration of the 5' end of the retrotransposon, R2Bm, can be complemented by homologous recombination. Nucleic Acids Res 2004; 32:1555-65. [PMID: 14999096 PMCID: PMC390292 DOI: 10.1093/nar/gkh304] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
R2Bm is a non-long-terminal-repeat (non-LTR) retrotransposon that was identified at a specific target site in the 28S rRNA genes of the silkworm, Bombyx mori. Although in vitro analysis has revealed that the 3' end of R2Bm is integrated into the target site by means of target-primed reverse transcription (TPRT), the mechanism of the 5' end integration is not well understood. We established a novel in vivo system to assay the insertion mechanism of R2Bm using a cultured cell line, C65, and a baculovirus, AcNPV, as host and vector, respectively. The 3' end of R2Bm integrated at the target site in the rRNA genes of C65 cells when an AcNPV containing both the full-length 3' UTR and the entire open reading frame (ORF) of R2Bm was introduced while the 5' end integration was incorrect. The 5' end of R2Bm was integrated, however, when the 28S gene sequence upstream of the R2Bm target site was added to the R2Bm sequence. Thus, in our assay, homologous sequences were likely essential for the successful integration of the entire R2Bm into the host cell genome. We also demonstrated that the failure to integrate caused by a frame-shifted ORF was rescued by co-infection with a helper virus that contained only the R2Bm ORF. This indicates that R2 retrotransposition can be complemented in trans. These findings suggest that the host's mechanism for DNA repair may be necessary for the integration of the 5' end of R2Bm and that R2Bm protein may only have the ability to integrate the 3' end of the element by TPRT.
Collapse
Affiliation(s)
- Hirofumi Fujimoto
- Division of Radiological Protection and Biology, National Institute of Infectious Diseases, Toyama 1-23-1, Shinjuku-ku, Tokyo 162-8640, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Eickbush DG, Eickbush TH. Transcription of endogenous and exogenous R2 elements in the rRNA gene locus of Drosophila melanogaster. Mol Cell Biol 2003; 23:3825-36. [PMID: 12748285 PMCID: PMC155226 DOI: 10.1128/mcb.23.11.3825-3836.2003] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
R2 retrotransposons insert into the rRNA-encoding units (rDNA units) that form the nucleoli of insects. We have utilized an R2 integration system in Drosophila melanogaster to study transcription of foreign sequences integrated into the R2 target site of the 28S rRNA genes. The exogenous sequences were cotranscribed at dramatically different levels which closely paralleled the level of transcription of the endogenous R1 and R2 elements. Transcription levels were inversely correlated with the number of uninserted rDNA units, variation in this number having been brought about by the R2 integration system itself. Females with as few as 20 uninserted rDNA units per X chromosome had expression levels of endogenous and exogenous insertion sequences that were 2 orders of magnitude higher than lines that contained over 80 uninserted rDNA units per chromosome. R2 insertions only 167 bp in length exhibited this range of transcriptional regulation. Analysis of transcript levels in males suggested R2 insertions on the Y chromosome are not down-regulated to the same extent as insertions on the X chromosome. These results suggest that transcription of the rDNA units can be tightly regulated, but this regulation gradually breaks down as the cell approaches the minimum number of uninserted genes needed for survival.
Collapse
Affiliation(s)
- Danna G Eickbush
- Department of Biology, University of Rochester, Rochester, New York 146270-0211, USA
| | | |
Collapse
|
31
|
Abstract
Most active non-LTR (long terminal repeat) retrotransposons carry two open reading frames (ORFs) encoding ORF1p and ORF2p proteins. The ORF2p proteins are relatively well studied and are known to contain endonuclease/reverse transcriptase domains. At the same time, the biological function of ORF1p proteins remains poorly understood, except in that they nonspecifically bind single-stranded mRNA/DNA molecules. CR1-like elements form the most widely distributed clade/superfamily of non-LTR retrotransposons. We found that ORF1p proteins encoded by diverse CR1-like elements contain conserved esterase domain (ES) or plant homeodomain (PHD). This indicates that CR1-like ORF1p proteins are either lipolytic enzymes or are involved in protein-protein interactions related to chromatin remodeling. Sequence conservation of ES suggests that interaction with cellular membranes is an important phase in life circles of CR1-like elements. Presumably such interaction helps in penetrating host cells. As a consequence, the presence of multiple young CR1 families characterized by approximately 10% intrafamily and 40% interfamily identities may be explained by a relatively frequent horizontal transfer of these CR1-like elements. Unexpectedly, ES links together non-LTR retrotransposons and single-stranded RNA viruses like influenza C and coronaviruses, which are known to depend on their own ES.
Collapse
|
32
|
Ye J, Yang Z, Hayes JJ, Eickbush TH. R2 retrotransposition on assembled nucleosomes depends on the translational position of the target site. EMBO J 2002; 21:6853-64. [PMID: 12486006 PMCID: PMC139086 DOI: 10.1093/emboj/cdf665] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
R2 retrotransposons insert into the 28S rRNA genes of insects. Integration occurs by specific cleavage of the target site and utilization of the released DNA end to prime reverse transcription of the RNA transcript. Specificity of the protein to the target site is dependent upon nucleotide sequence recognition extending from 35 bp upstream to 15 bp downstream of the cleavage site. In this report, we show that sequence recognition and cleavage by the R2 protein can occur while the target site is assembled into nucleosomes. Reconstitution of DNA fragments containing the 28S gene sequence into a set of nucleosomes with different translational frames revealed that the R2 site adopted the same rotational orientation with respect to the histone octamer. Binding and cleavage by the R2 protein were most efficient when the upstream binding site for the R2 protein was near a nucleosome end. Interaction of the R2 protein with the nucleosome disrupted the histone:DNA contacts in the 50 bp region directly bound by R2, but did not modify the remainder of the nucleosome structure.
Collapse
Affiliation(s)
| | - Zungyoon Yang
- University of Rochester, Department of Biology, Rochester, NY 14627 and
University of Rochester Medical Center, Department of Biochemistry and Biophysics, Rochester, NY 14642, USA Corresponding author e-mail:
| | - Jeffrey J. Hayes
- University of Rochester, Department of Biology, Rochester, NY 14627 and
University of Rochester Medical Center, Department of Biochemistry and Biophysics, Rochester, NY 14642, USA Corresponding author e-mail:
| | - Thomas H. Eickbush
- University of Rochester, Department of Biology, Rochester, NY 14627 and
University of Rochester Medical Center, Department of Biochemistry and Biophysics, Rochester, NY 14642, USA Corresponding author e-mail:
| |
Collapse
|
33
|
Pyatkov KI, Shostak NG, Zelentsova ES, Lyozin GT, Melekhin MI, Finnegan DJ, Kidwell MG, Evgen'ev MB. Penelope retroelements from Drosophila virilis are active after transformation of Drosophila melanogaster. Proc Natl Acad Sci U S A 2002; 99:16150-5. [PMID: 12451171 PMCID: PMC138580 DOI: 10.1073/pnas.252641799] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/22/2002] [Indexed: 11/18/2022] Open
Abstract
The Penelope family of retroelements was first described in species of the Drosophila virilis group. Intact elements encode a reverse transcriptase and an endonuclease of the UvrC type, which may play a role in Penelope integration. Penelope is a key element in the induction of D. virilis hybrid dysgenesis, which involves the mobilization of several unrelated families of transposable elements. We here report the successful introduction of Penelope into the germ line of Drosophila melanogaster by P element-mediated transformation with three different constructs. Penelope is actively transcribed in the D. melanogaster genome only in lines transformed with a construct containing a full-length Penelope clone. The transcript is identical to that detected in D. virilis dysgenic hybrids. Most newly transposed Penelope elements have a very complex organization. Significant proliferation of Penelope copy number occurred in some lines during the 24-month period after transformation. The absence of copy number increase with two other constructs suggests that the 5' andor 3' UTRs of Penelope are required for successful transposition in D. melanogaster. No insect retroelement has previously been reported to be actively transcribed and to increase in copy number after interspecific transformation.
Collapse
|
34
|
Chambeyron S, Brun C, Robin S, Bucheton A, Busseau I. Chimeric RNA transposition intermediates of the I factor produce precise retrotransposed copies. Nucleic Acids Res 2002; 30:3387-94. [PMID: 12140323 PMCID: PMC137080 DOI: 10.1093/nar/gkf456] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
I elements in Drosophila melanogaster are non-long terminal repeat (LTR) retrotransposons of particular interest because high levels of transposition can be induced by appropriate crosses. They use a full-length RNA transposition intermediate as a template for reverse transcription. Detailed molecular characterization of this intermediate is rendered difficult because of the many transcripts produced by defective elements. The use of an active I element marked with a sequence encoding the HA epitope solves this problem. We used an RNA circularization procedure followed by RT-PCR to analyze the transcripts produced by actively transposing tagged I elements. Most start at the 5' end at the second nucleotide of the I element and all are polyadenylated at a site located in genomic sequences downstream of the 3' end. One of the tagged I elements, inserted in locus 88A, produces chimeric transcripts that carry sequences from both 5'- and 3'-flanking genomic DNA. We show that synthesis of these chimeric transcripts is controlled by the I element itself. Analysis of full-length transposed copies of this element shows that the extra sequences at the 5' and 3' ends are not integrated during retrotransposition. This suggests that initiation and arrest of reverse transcription during retrotransposition are precise processes.
Collapse
Affiliation(s)
- Séverine Chambeyron
- Institut de Génétique Humaine, CNRS, 141 rue de la Cardonille, 34396 Montpellier cedex 5, France
| | | | | | | | | |
Collapse
|
35
|
Bibiłło A, Eickbush TH. The reverse transcriptase of the R2 non-LTR retrotransposon: continuous synthesis of cDNA on non-continuous RNA templates. J Mol Biol 2002; 316:459-73. [PMID: 11866511 DOI: 10.1006/jmbi.2001.5369] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
R2 is a non-long terminal repeat (non-LTR) retrotransposon that inserts into the 28 S rRNA genes of arthropods. The element encodes two enzymatic activities: an endonuclease that specifically cleaves the 28 S gene target site, and a reverse transcriptase (RT) that can use the 3' end of the cleaved DNA to prime reverse transcription. R2 RT only utilizes RNA templates that contain the 3' untranslated region of the R2 element as templates in this target primed reverse transcription (TPRT) reaction. Here, detailed biochemical characterization of the R2 RT indicates that the enzyme is capable of making multiple, consecutive jumps between RNA templates. The terminal 3' nucleotide of the "acceptor" RNA and the 5' nucleotide of the "donor" RNA are frequently reverse transcribed in these jumps, indicating that the acceptor RNA does not anneal to the cDNA derived from the donor RNA template. These template jumps occur during TPRT as well as in non-specific extension reactions in which reverse transcription is primed by an oligonucleotide annealed to the RNA template. Analysis of these RT assays done in the absence of the target DNA also revealed that the R2 RT can initiate reverse transcription near the 3' end of any RNA molecule using the 3' end of a second RNA molecule as primer. Again there is no requirement for sequence complementarity between the RNA used as template and the RNA used as primer. These properties of the R2 RT differ substantially from those of retroviral RTs but have similarities to the RT of the Mauriceville retroplasmid of Neurospora crassa. We present a model which relates these unusual properties of the R2 RT to structural differences from retroviral RTs as well as correlates these properties to the likely retrotransposition mechanism of R2 and other non-LTR retrotransposons.
Collapse
Affiliation(s)
- Arkadiusz Bibiłło
- Department of Biology, University of Rochester, Rochester, NY 4627-0211, USA
| | | |
Collapse
|
36
|
Pérez-González CE, Eickbush TH. Dynamics of R1 and R2 elements in the rDNA locus of Drosophila simulans. Genetics 2001; 158:1557-67. [PMID: 11514447 PMCID: PMC1461747 DOI: 10.1093/genetics/158.4.1557] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The mobile elements R1 and R2 insert specifically into the rRNA gene locus (rDNA locus) of arthropods, a locus known to undergo concerted evolution, the recombinational processes that preserve the sequence homogeneity of all repeats. To monitor how rapidly individual R1 and R2 insertions are turned over in the rDNA locus by these processes, we have taken advantage of the many 5' truncation variants that are generated during the target-primed reverse transcription mechanism used by these non-LTR retrotransposons for their integration. A simple PCR assay was designed to reveal the pattern of the 5' variants present in the rDNA loci of individual X chromosomes in a population of Drosophila simulans. Each rDNA locus in this population was found to have a large, unique collection of 5' variants. Each variant was present at low copy number, usually one copy per chromosome, and was seldom distributed to other chromosomes in the population. The failure of these variants to spread to other units in the same rDNA locus suggests a strong recombinational bias against R1 and R2 that results in the individual copies of these elements being rapidly lost from the rDNA locus. This bias suggests a significantly higher frequency of R1 and R2 retrotransposition than we have previously suggested.
Collapse
Affiliation(s)
- C E Pérez-González
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
| | | |
Collapse
|
37
|
Gentile KL, Burke WD, Eickbush TH. Multiple lineages of R1 retrotransposable elements can coexist in the rDNA loci of Drosophila. Mol Biol Evol 2001; 18:235-45. [PMID: 11158382 DOI: 10.1093/oxfordjournals.molbev.a003797] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
R1 non-long terminal repeat retrotransposable elements insert specifically into the 28S rRNA genes of arthropods. One aspect of R1 evolution that has been difficult to explain is the presence of divergent lineages of R1 in the rDNA loci of the same species. Multiple lineages should compete for a limited number of insertion sites, in addition to being subject to the concerted evolution processes homogenizing the rRNA genes. The presence of multiple lineages suggests either the ability of the elements to overcome these factors and diverge within rDNA loci, or the introduction of new lineages by horizontal transmission. To address this issue, we attempted to characterize the complete set of R1 elements in the rDNA locus from five Drosophila species groups (melanogaster, obscura, testacea, quinaria, and repleta). Two major R1 lineages, A and B, that diverged about 100 MYA were found to exist in Drosophila. Elements of the A lineage were found in all 35 Drosophila species tested, while elements of the B lineage were found in only 11 species from three species groups. Phylogenetic analysis of the R1 elements, supported by comparison of their rates of nucleotide sequence substitution, revealed that both the A and the B lineages have been maintained by vertical descent. The B lineage was less stable and has undergone numerous, independent elimination events, while the A lineage has diverged into three sublineages, which were, in turn, differentially stable. We conclude that while the differential retention of multiple lineages greatly complicates its phylogenetic history, the available R1 data continue to be consistent with the strict vertical descent of these elements.
Collapse
Affiliation(s)
- K L Gentile
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| | | | | |
Collapse
|
38
|
Chaboissier MC, Finnegan D, Bucheton A. Retrotransposition of the I factor, a non-long terminal repeat retrotransposon of Drosophila, generates tandem repeats at the 3' end. Nucleic Acids Res 2000; 28:2467-72. [PMID: 10871395 PMCID: PMC102713 DOI: 10.1093/nar/28.13.2467] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2000] [Revised: 05/17/2000] [Accepted: 05/17/2000] [Indexed: 11/14/2022] Open
Abstract
Non-long terminal repeat (LTR) retrotransposons or LINEs transpose by reverse transcription of an RNA intermediate and are thought to use the 3' hydroxyl of a chromosomal cleavage to initiate synthesis of the first strand of the cDNA. Many of them terminate in a poly(dA) sequence at the 3' end of the coding strand although some, like the I factor of Drosophila melanogaster, have 3' ends formed by repeats of the trinucleotide TAA. We report results showing that I factor transcripts end a few nucleotides downstream of the TAA repeats and that these extra nucleotides are not integrated into chromosomal DNA during retrotransposition. We also show that the TAA repeats are not required for transposition and that I elements containing mutations affecting the TAA sequences generate transposed copies ending with tandem repeats of various types. Our results suggest that during integration the 3' end of the I factor RNA template can pair with nucleotides at the target site and that tandem duplications are generated by the reverse transcriptase of the I factor in a manner that is reminiscent of the activity of the reverse transcriptases of telomerases. Reverse transcriptases of other non-LTR retrotransposons may function in a similar way.
Collapse
Affiliation(s)
- M C Chaboissier
- Institut de Génétique Humaine, CNRS, Montpellier cedex 5, France
| | | | | |
Collapse
|
39
|
Malik HS, Eickbush TH. NeSL-1, an ancient lineage of site-specific non-LTR retrotransposons from Caenorhabditis elegans. Genetics 2000; 154:193-203. [PMID: 10628980 PMCID: PMC1460889 DOI: 10.1093/genetics/154.1.193] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Phylogenetic analyses of non-LTR retrotransposons suggest that all elements can be divided into 11 lineages. The 3 oldest lineages show target site specificity for unique locations in the genome and encode an endonuclease with an active site similar to certain restriction enzymes. The more "modern" non-LTR lineages possess an apurinic endonuclease-like domain and generally lack site specificity. The genome sequence of Caenorhabditis elegans reveals the presence of a non-LTR retrotransposon that resembles the older elements, in that it contains a single open reading frame with a carboxyl-terminal restriction-like endonuclease domain. Located near the N-terminal end of the ORF is a cysteine protease domain not found in any other non-LTR element. The N2 strain of C. elegans appears to contain only one full-length and several 5' truncated copies of this element. The elements specifically insert in the Spliced leader-1 genes; hence the element has been named NeSL-1 (Nematode Spliced Leader-1). Phylogenetic analysis confirms that NeSL-1 branches very early in the non-LTR lineage and that it represents a 12th lineage of non-LTR elements. The target specificity of NeSL-1 for the spliced leader exons and the similarity of its structure to that of R2 elements leads to a simple model for its expression and retrotransposition.
Collapse
Affiliation(s)
- H S Malik
- Department of Biology, University of Rochester, Rochester, New York 14627-0211, USA
| | | |
Collapse
|