1
|
Rodríguez-Vargas A, Collins K. Distinct and overlapping RNA determinants for binding and target-primed reverse transcription by Bombyx mori R2 retrotransposon protein. Nucleic Acids Res 2024; 52:6571-6585. [PMID: 38499488 PMCID: PMC11194090 DOI: 10.1093/nar/gkae194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 02/08/2024] [Accepted: 03/09/2024] [Indexed: 03/20/2024] Open
Abstract
Eukaryotic retrotransposons encode a reverse transcriptase that binds RNA to template DNA synthesis. The ancestral non-long terminal repeat (non-LTR) retrotransposons encode a protein that performs target-primed reverse transcription (TPRT), in which the nicked genomic target site initiates complementary DNA (cDNA) synthesis directly into the genome. The best understood model system for biochemical studies of TPRT is the R2 protein from the silk moth Bombyx mori. The R2 protein selectively binds the 3' untranslated region of its encoding RNA as template for DNA insertion to its target site in 28S ribosomal DNA. Here, binding and TPRT assays define RNA contributions to RNA-protein interaction, template use for TPRT and the fidelity of template positioning for TPRT cDNA synthesis. We quantify both sequence and structure contributions to protein-RNA interaction. RNA determinants of binding affinity overlap but are not equivalent to RNA features required for TPRT and its fidelity of template positioning for full-length TPRT cDNA synthesis. Additionally, we show that a previously implicated RNA-binding protein surface of R2 protein makes RNA binding affinity dependent on the presence of two stem-loops. Our findings inform evolutionary relationships across R2 retrotransposon RNAs and are a step toward understanding the mechanism and template specificity of non-LTR retrotransposon mobility.
Collapse
Affiliation(s)
- Anthony Rodríguez-Vargas
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Kathleen Collins
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
2
|
Lee RJ, Horton CA, Van Treeck B, McIntyre JJR, Collins K. Conserved and divergent DNA recognition specificities and functions of R2 retrotransposon N-terminal domains. Cell Rep 2024; 43:114239. [PMID: 38753487 PMCID: PMC11204384 DOI: 10.1016/j.celrep.2024.114239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 04/04/2024] [Accepted: 05/01/2024] [Indexed: 05/18/2024] Open
Abstract
R2 non-long terminal repeat (non-LTR) retrotransposons are among the most extensively distributed mobile genetic elements in multicellular eukaryotes and show promise for applications in transgene supplementation of the human genome. They insert new gene copies into a conserved site in 28S ribosomal DNA with exquisite specificity. R2 clades are defined by the number of zinc fingers (ZFs) at the N terminus of the retrotransposon-encoded protein, postulated to additively confer DNA site specificity. Here, we illuminate general principles of DNA recognition by R2 N-terminal domains across and between clades, with extensive, specific recognition requiring only one or two compact domains. DNA-binding and protection assays demonstrate broadly shared as well as clade-specific DNA interactions. Gene insertion assays in cells identify the N-terminal domains sufficient for target-site insertion and reveal roles in second-strand cleavage or synthesis for clade-specific ZFs. Our results have implications for understanding evolutionary diversification of non-LTR retrotransposon insertion mechanisms and the design of retrotransposon-based gene therapies.
Collapse
Affiliation(s)
- Rosa Jooyoung Lee
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Connor A Horton
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Briana Van Treeck
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Jeremy J R McIntyre
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Kathleen Collins
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
3
|
Deng P, Tan SQ, Yang QY, Fu L, Wu Y, Zhu HZ, Sun L, Bao Z, Lin Y, Zhang QC, Wang H, Wang J, Liu JJG. Structural RNA components supervise the sequential DNA cleavage in R2 retrotransposon. Cell 2023; 186:2865-2879.e20. [PMID: 37301196 DOI: 10.1016/j.cell.2023.05.032] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 03/14/2023] [Accepted: 05/20/2023] [Indexed: 06/12/2023]
Abstract
Retroelements are the widespread jumping elements considered as major drivers for genome evolution, which can also be repurposed as gene-editing tools. Here, we determine the cryo-EM structures of eukaryotic R2 retrotransposon with ribosomal DNA target and regulatory RNAs. Combined with biochemical and sequencing analysis, we reveal two essential DNA regions, Drr and Dcr, required for recognition and cleavage. The association of 3' regulatory RNA with R2 protein accelerates the first-strand cleavage, blocks the second-strand cleavage, and initiates the reverse transcription starting from the 3'-tail. Removing 3' regulatory RNA by reverse transcription allows the association of 5' regulatory RNA and initiates the second-strand cleavage. Taken together, our work explains the DNA recognition and RNA supervised sequential retrotransposition mechanisms by R2 machinery, providing insights into the retrotransposon and application reprogramming.
Collapse
Affiliation(s)
- Pujuan Deng
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Shun-Qing Tan
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Qi-Yu Yang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Liangzheng Fu
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yachao Wu
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Han-Zhou Zhu
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lei Sun
- Shandong Provincial Key Laboratory of Animal Cell and Developmental Biology, School of Life Sciences, Shandong University, Qingdao 266237, China
| | - Zhangbin Bao
- Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
| | - Yi Lin
- Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; IDG/McGovern Institute for Brain Research, Tsinghua University, Beijing 100084, China
| | - Qiangfeng Cliff Zhang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Haoyi Wang
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jia Wang
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| | - Jun-Jie Gogo Liu
- Beijing Advanced Innovation Center for Structural Biology & Frontier Research Center for Biological Structure, School of Life Sciences, Tsinghua University, Beijing 100084, China; Tsinghua-Peking Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
4
|
Hartig N, Seibt KM, Heitkam T. How to start a LINE: 5' switching rejuvenates LINE retrotransposons in tobacco and related Nicotiana species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023. [PMID: 36965091 DOI: 10.1111/tpj.16208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 02/10/2023] [Accepted: 02/19/2023] [Indexed: 06/18/2023]
Abstract
By contrast to their conserved mammalian counterparts, plant long interspersed nuclear elements (LINEs) are highly variable, splitting into many low-copy families. Curiously, LINE families from the retrotransposable element (RTE) clade retain a stronger sequence conservation and hence reach higher copy numbers. The cause of this RTE-typical property is not yet understood, but would help clarify why some transposable elements are removed quickly, whereas others persist in plant genomes. Here, we bring forward a detailed study of RTE LINE structure, diversity and evolution in plants. For this, we argue that the nightshade family is the ideal taxon to follow the evolutionary trajectories of RTE LINEs, given their high abundance, recent activity and partnership to non-autonomous elements. Using bioinformatic, cytogenetic and molecular approaches, we detect 4029 full-length RTE LINEs across the Solanaceae. We finely characterize and manually curate a core group of 458 full-length LINEs in allotetraploid tobacco, show an integration event after polyploidization and trace hybridization by RTE LINE composition of parental genomes. Finally, we reveal the role of the untranslated regions (UTRs) as causes for the unique RTE LINE amplification and evolution pattern in plants. On the one hand, we detected a highly conserved motif at the 3' UTR, suggesting strong selective constraints acting on the RTE terminus. On the other hand, we observed successive rounds of 5' UTR cycling, constantly rejuvenating the promoter sequences. This interplay between exchangeable promoters and conserved LINE bodies and 3' UTR likely allows RTE LINEs to persist and thrive in plant genomes.
Collapse
Affiliation(s)
- Nora Hartig
- Faculty of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Tony Heitkam
- Faculty of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| |
Collapse
|
5
|
Park SK, Mohr G, Yao J, Russell R, Lambowitz AM. Group II intron-like reverse transcriptases function in double-strand break repair. Cell 2022; 185:3671-3688.e23. [PMID: 36113466 PMCID: PMC9530004 DOI: 10.1016/j.cell.2022.08.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Revised: 06/16/2022] [Accepted: 08/14/2022] [Indexed: 01/26/2023]
Abstract
Bacteria encode reverse transcriptases (RTs) of unknown function that are closely related to group II intron-encoded RTs. We found that a Pseudomonas aeruginosa group II intron-like RT (G2L4 RT) with YIDD instead of YADD at its active site functions in DNA repair in its native host and when expressed in Escherichia coli. G2L4 RT has biochemical activities strikingly similar to those of human DNA repair polymerase θ and uses them for translesion DNA synthesis and double-strand break repair (DSBR) via microhomology-mediated end-joining (MMEJ). We also found that a group II intron RT can function similarly in DNA repair, with reciprocal active-site substitutions showing isoleucine favors MMEJ and alanine favors primer extension in both enzymes. These DNA repair functions utilize conserved structural features of non-LTR-retroelement RTs, including human LINE-1 and other eukaryotic non-LTR-retrotransposon RTs, suggesting such enzymes may have inherent ability to function in DSBR in a wide range of organisms.
Collapse
Affiliation(s)
- Seung Kuk Park
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Georg Mohr
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Jun Yao
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Rick Russell
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Alan M Lambowitz
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA.
| |
Collapse
|
6
|
Pimentel SC, Upton HE, Collins K. Separable structural requirements for cDNA synthesis, nontemplated extension, and template jumping by a non-LTR retroelement reverse transcriptase. J Biol Chem 2022; 298:101624. [PMID: 35065960 PMCID: PMC8857657 DOI: 10.1016/j.jbc.2022.101624] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 01/12/2022] [Accepted: 01/13/2022] [Indexed: 11/16/2022] Open
Abstract
Broad evolutionary expansion of polymerase families has enabled specialization of their activities for distinct cellular roles. In addition to template-complementary synthesis, many polymerases extend their duplex products by nontemplated nucleotide addition (NTA). This activity is exploited for laboratory strategies of cloning and sequencing nucleic acids and could have important biological function, although the latter has been challenging to test without separation-of-function mutations. Several retroelement and retroviral reverse transcriptases (RTs) support NTA and also template jumping, by which the RT performs continuous complementary DNA (cDNA) synthesis using physically separate templates. Previous studies that aimed to dissect the relationship between NTA and template jumping leave open questions about structural requirements for each activity and their interdependence. Here, we characterize the structural requirements for cDNA synthesis, NTA, template jumping, and the unique terminal transferase activity of Bombyx mori R2 non-long terminal repeat retroelement RT. With sequence alignments and structure modeling to guide mutagenesis, we generated enzyme variants across motifs generally conserved or specific to RT subgroups. Enzyme variants had diverse NTA profiles not correlated with other changes in cDNA synthesis activity or template jumping. Using these enzyme variants and panels of activity assay conditions, we show that template jumping requires NTA. However, template jumping by NTA-deficient enzymes can be rescued using primer duplex with a specific length of 3′ overhang. Our findings clarify the relationship between NTA and template jumping as well as additional activities of non-long terminal repeat RTs, with implications for the specialization of RT biological functions and laboratory applications.
Collapse
Affiliation(s)
- Sydney C Pimentel
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, USA
| | - Heather E Upton
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, USA
| | - Kathleen Collins
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, USA.
| |
Collapse
|
7
|
Kaur D, Agrahari M, Bhattacharya A, Bhattacharya S. The non-LTR retrotransposons of Entamoeba histolytica: genomic organization and biology. Mol Genet Genomics 2022; 297:1-18. [PMID: 34999963 DOI: 10.1007/s00438-021-01843-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Accepted: 11/26/2021] [Indexed: 11/24/2022]
Abstract
Genome sequence analysis of Entamoeba species revealed various classes of transposable elements. While E. histolytica and E. dispar are rich in non-long terminal repeat (LTR) retrotransposons, E. invadens contains predominantly DNA transposons. Non-LTR retrotransposons of E. histolytica constitute three families of long interspersed nuclear elements (LINEs), and their short, nonautonomous partners, SINEs. They occupy ~ 11% of the genome. The EhLINE1/EhSINE1 family is the most abundant and best studied. EhLINE1 is 4.8 kb, with two ORFs that encode functions needed for retrotransposition. ORF1 codes for the nucleic acid-binding protein, and ORF2 has domains for reverse transcriptase (RT) and endonuclease (EN). Most copies of EhLINEs lack complete ORFs. ORF1p is expressed constitutively, but ORF2p is not detected. Retrotransposition could be demonstrated upon ectopic over expression of ORF2p, showing that retrotransposition machinery is functional. The newly retrotransposed sequences showed a high degree of recombination. In transcriptomic analysis, RNA-Seq reads were mapped to individual EhLINE1 copies. Although full-length copies were transcribed, no full-length 4.8 kb transcripts were seen. Rather, sense transcripts mapped to ORF1, RT and EN domains. Intriguingly, there was strong antisense transcription almost exclusively from the RT domain. These unique features of EhLINE1 could serve to attenuate retrotransposition in E. histolytica.
Collapse
|
8
|
Low-bias ncRNA libraries using ordered two-template relay: Serial template jumping by a modified retroelement reverse transcriptase. Proc Natl Acad Sci U S A 2021; 118:2107900118. [PMID: 34649994 PMCID: PMC8594491 DOI: 10.1073/pnas.2107900118] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/01/2021] [Indexed: 02/07/2023] Open
Abstract
Retrotransposons are noninfectious, mobile genetic elements that proliferate in host genomes via an RNA intermediate that is copied into DNA by a reverse transcriptase (RT) enzyme. RTs are important for biotechnological applications involving information capture from RNA since RNA is first converted into complementary DNA for detection or sequencing. Here, we biochemically characterized RTs from two retroelements and uncovered several activities that allowed us to design a streamlined, efficient workflow for determining the inventory of RNA sequences in processed RNA pools. The unique properties of nonretroviral RT activities obviate many technical issues associated with current methods of RNA sequence analysis, with wide applications in research, biotechnology, and diagnostics. Selfish, non-long terminal repeat (non-LTR) retroelements and mobile group II introns encode reverse transcriptases (RTs) that can initiate DNA synthesis without substantial base pairing of primer and template. Biochemical characterization of these enzymes has been limited by recombinant expression challenges, hampering understanding of their properties and the possible exploitation of their properties for research and biotechnology. We investigated the activities of representative RTs using a modified non-LTR RT from Bombyx mori and a group II intron RT from Eubacterium rectale. Only the non-LTR RT supported robust and serial template jumping, producing one complementary DNA (cDNA) from several templates each copied end to end. We also discovered an unexpected terminal deoxynucleotidyl transferase activity of the RTs that adds nucleotide(s) of choice to 3′ ends of single- and/or double-stranded RNA or DNA. Combining these two types of activity with additional insights about nontemplated nucleotide additions to duplexed cDNA product, we developed a streamlined protocol for fusion of next-generation sequencing adaptors to both cDNA ends in a single RT reaction. When benchmarked using a reference pool of microRNAs (miRNAs), library production by Ordered Two-Template Relay (OTTR) using recombinant non-LTR retroelement RT outperformed all commercially available kits and rivaled the low bias of technically demanding home-brew protocols. We applied OTTR to inventory RNAs purified from extracellular vesicles, identifying miRNAs as well as myriad other noncoding RNAs (ncRNAs) and ncRNA fragments. Our results establish the utility of OTTR for automation-friendly, low-bias, end-to-end RNA sequence inventories of complex ncRNA samples.
Collapse
|
9
|
Structural basis for template switching by a group II intron-encoded non-LTR-retroelement reverse transcriptase. J Biol Chem 2021; 297:100971. [PMID: 34280434 PMCID: PMC8363836 DOI: 10.1016/j.jbc.2021.100971] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Revised: 06/30/2021] [Accepted: 07/15/2021] [Indexed: 12/02/2022] Open
Abstract
Reverse transcriptases (RTs) can switch template strands during complementary DNA synthesis, enabling them to join discontinuous nucleic acid sequences. Template switching (TS) plays crucial roles in retroviral replication and recombination, is used for adapter addition in RNA-Seq, and may contribute to retroelement fitness by increasing evolutionary diversity and enabling continuous complementary DNA synthesis on damaged templates. Here, we determined an X-ray crystal structure of a TS complex of a group II intron RT bound simultaneously to an acceptor RNA and donor RNA template–DNA primer heteroduplex with a 1-nt 3′-DNA overhang. The structure showed that the 3′ end of the acceptor RNA binds in a pocket formed by an N-terminal extension present in non–long terminal repeat–retroelement RTs and the RT fingertips loop, with the 3′ nucleotide of the acceptor base paired to the 1-nt 3′-DNA overhang and its penultimate nucleotide base paired to the incoming dNTP at the RT active site. Analysis of structure-guided mutations identified amino acids that contribute to acceptor RNA binding and a phenylalanine residue near the RT active site that mediates nontemplated nucleotide addition. Mutation of the latter residue decreased multiple sequential template switches in RNA-Seq. Our results provide new insights into the mechanisms of TS and nontemplated nucleotide addition by RTs, suggest how these reactions could be improved for RNA-Seq, and reveal common structural features for TS by non–long terminal repeat–retroelement RTs and viral RNA–dependent RNA polymerases.
Collapse
|
10
|
Liu S, Wu I, Yu YP, Balamotis M, Ren B, Ben Yehezkel T, Luo JH. Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer. Commun Biol 2021; 4:506. [PMID: 33907296 PMCID: PMC8079361 DOI: 10.1038/s42003-021-02024-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 03/09/2021] [Indexed: 02/02/2023] Open
Abstract
The characterization of human gene expression is limited by short read lengths, high error rates and large input requirements. Here, we used a synthetic long read (SLR) sequencing approach, LoopSeq, to generate accurate sequencing reads that span full length transcripts using standard short read data. LoopSeq identified isoforms from control samples with 99.4% accuracy and a 0.01% per-base error rate, exceeding the accuracy reported for other long-read technologies. Applied to targeted transcriptome sequencing from colon cancers and their metastatic counterparts, LoopSeq revealed large scale isoform redistributions from benign colon mucosa to primary colon cancer and metastatic cancer and identified several previously unknown fusion isoforms. Strikingly, single nucleotide variants (SNVs) occurred dominantly in specific isoforms and some SNVs underwent isoform switching in cancer progression. The ability to use short reads to generate accurate long-read data as the raw unit of information holds promise as a widely accessible approach in transcriptome sequencing.
Collapse
Affiliation(s)
- Silvia Liu
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- Pittsburgh Liver Research Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
| | - Indira Wu
- Loop Genomics, Inc., San Jose, CA, 95138, USA
| | - Yan-Ping Yu
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- Pittsburgh Liver Research Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
| | | | - Baoguo Ren
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
| | | | - Jian-Hua Luo
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA.
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA.
- Pittsburgh Liver Research Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA.
| |
Collapse
|
11
|
R2 and Non-Site-Specific R2-Like Retrotransposons of the German Cockroach, Blattella germanica. Genes (Basel) 2020; 11:genes11101202. [PMID: 33076367 PMCID: PMC7650587 DOI: 10.3390/genes11101202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 10/10/2020] [Accepted: 10/12/2020] [Indexed: 11/17/2022] Open
Abstract
The structural and functional organization of the ribosomal RNA gene cluster and the full-length R2 non-LTR retrotransposon (integrated into a specific site of 28S ribosomal RNA genes) of the German cockroach, Blattella germanica, is described. A partial sequence of the R2 retrotransposon of the cockroach Rhyparobia maderae is also analyzed. The analysis of previously published next-generation sequencing data from the B. germanica genome reveals a new type of retrotransposon closely related to R2 retrotransposons but with a random distribution in the genome. Phylogenetic analysis reveals that these newly described retrotransposons form a separate clade. It is shown that proteins corresponding to the open reading frames of newly described retrotransposons exhibit unequal structural domains. Within these retrotransposons, a recombination event is described. New mechanism of transposition activity is discussed. The essential structural features of R2 retrotransposons are conserved in cockroaches and are typical of previously described R2 retrotransposons. However, the investigation of the number and frequency of 5′-truncated R2 retrotransposon insertion variants in eight B. germanica populations suggests recent mobile element activity. It is shown that the pattern of 5′-truncated R2 retrotransposon copies can be an informative molecular genetic marker for revealing genetic distances between insect populations.
Collapse
|
12
|
Khadgi BB, Govindaraju A, Christensen SM. Completion of LINE integration involves an open '4-way' branched DNA intermediate. Nucleic Acids Res 2019; 47:8708-8719. [PMID: 31392993 PMCID: PMC6895275 DOI: 10.1093/nar/gkz673] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Revised: 06/26/2019] [Accepted: 07/29/2019] [Indexed: 12/20/2022] Open
Abstract
Long Interspersed Elements (LINEs), also known as non-LTR retrotransposons, encode a multifunctional protein that reverse transcribes its mRNA into DNA at the site of insertion by target primed reverse transcription. The second half of the integration reaction remains very poorly understood. Second-strand DNA cleavage and second-strand DNA synthesis were investigated in vitro using purified components from a site-specific restriction-like endonuclease (RLE) bearing LINE. DNA structure was shown to be a critical component of second-strand DNA cleavage. A hitherto unknown and unexplored integration intermediate, an open ‘4-way’ DNA junction, was recognized by the element protein and cleaved in a Holliday junction resolvase-like reaction. Cleavage of the 4-way junction resulted in a natural primer-template pairing used for second-strand DNA synthesis. A new model for RLE LINE integration is presented.
Collapse
Affiliation(s)
- Brijesh B Khadgi
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Aruna Govindaraju
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Shawn M Christensen
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| |
Collapse
|
13
|
Lentzsch AM, Yao J, Russell R, Lambowitz AM. Template-switching mechanism of a group II intron-encoded reverse transcriptase and its implications for biological function and RNA-Seq. J Biol Chem 2019; 294:19764-19784. [PMID: 31712313 DOI: 10.1074/jbc.ra119.011337] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 11/01/2019] [Indexed: 12/18/2022] Open
Abstract
The reverse transcriptases (RTs) encoded by mobile group II introns and other non-LTR retroelements differ from retroviral RTs in being able to template-switch efficiently from the 5' end of one template to the 3' end of another with little or no complementarity between the donor and acceptor templates. Here, to establish a complete kinetic framework for the reaction and to identify conditions that more efficiently capture acceptor RNAs or DNAs, we used a thermostable group II intron RT (TGIRT; GsI-IIC RT) that can template switch directly from synthetic RNA template/DNA primer duplexes having either a blunt end or a 3'-DNA overhang end. We found that the rate and amplitude of template switching are optimal from starter duplexes with a single nucleotide 3'-DNA overhang complementary to the 3' nucleotide of the acceptor RNA, suggesting a role for nontemplated nucleotide addition of a complementary nucleotide to the 3' end of cDNAs synthesized from natural templates. Longer 3'-DNA overhangs progressively decreased the template-switching rate, even when complementary to the 3' end of the acceptor template. The reliance on only a single bp with the 3' nucleotide of the acceptor together with discrimination against mismatches and the high processivity of group II intron RTs enable synthesis of full-length DNA copies of nucleic acids beginning directly at their 3' end. We discuss the possible biological functions of the template-switching activity of group II intron- and other non-LTR retroelement-encoded RTs, as well as the optimization of this activity for adapter addition in RNA- and DNA-Seq protocols.
Collapse
Affiliation(s)
- Alfred M Lentzsch
- Institute for Cellular and Molecular Biology, Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, Texas 78712
| | - Jun Yao
- Institute for Cellular and Molecular Biology, Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, Texas 78712
| | - Rick Russell
- Institute for Cellular and Molecular Biology, Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, Texas 78712
| | - Alan M Lambowitz
- Institute for Cellular and Molecular Biology, Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, Texas 78712
| |
Collapse
|
14
|
Su Y, Nichuguti N, Kuroki-Kami A, Fujiwara H. Sequence-specific retrotransposition of 28S rDNA-specific LINE R2Ol in human cells. RNA (NEW YORK, N.Y.) 2019; 25:1432-1438. [PMID: 31434792 PMCID: PMC6795142 DOI: 10.1261/rna.072512.119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Accepted: 08/12/2019] [Indexed: 06/10/2023]
Abstract
R2 is a long interspersed element (LINE) found in a specific sequence of the 28S rDNA among a wide variety of animals. Recently, we observed that R2Ol isolated from medaka fish, Oryzias latipes, retrotransposes sequence specifically into the target sequence of zebrafish. Because the 28S target and flanking regions are widely conserved among vertebrates, we examined whether R2Ol can also integrate in a sequence-specific manner in human cells. Using adenovirus-mediated expression of R2Ol constructs, we confirmed an accurate insertion of R2Ol into the 28S target of human 293T cells. However, the R2Ol mutant devoid of endonuclease (EN) activity showed no retrotransposition ability, suggesting that the sequence-specific integration of R2Ol into 28S rDNA occurs via the cleavage activity of EN. By introducing both R2Ol helper virus and donor plasmid in human cells, we succeeded in retrotransposing an exogenous EGFP gene into the 28S target site by the trans-complementation system, which enabled simplification of specific gene knock-in in a time-efficient manner. We believe that R2Ol may provide an alternative targeted gene knock-in method for practical applications such as gene therapy in future.
Collapse
Affiliation(s)
- Yuting Su
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Narisu Nichuguti
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Azusa Kuroki-Kami
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Haruhiko Fujiwara
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| |
Collapse
|
15
|
Wang PL, Luchetti A, Alberto Ruggieri A, Xiong XM, Xu MRX, Zhang XG, Zhang HH. Successful Invasions of Short Internally Deleted Elements (SIDEs) and Its Partner CR1 in Lepidoptera Insects. Genome Biol Evol 2019; 11:2505-2516. [PMID: 31384954 PMCID: PMC6740152 DOI: 10.1093/gbe/evz174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/01/2019] [Indexed: 11/28/2022] Open
Abstract
Although DNA transposons often generated internal deleted derivatives such as miniature inverted-repeat transposable elements, short internally deleted elements (SIDEs) derived from nonlong terminal-repeat retrotransposons are rare. Here, we found a novel SIDE, named Persaeus, that originated from the chicken repeat 1 (CR1) retrotransposon Zenon and it has been found widespread in Lepidoptera insects. Our findings suggested that Persaeus and the partner Zenon have experienced a transposition burst in their host genomes and the copy number of Persaeus and Zenon in assayed genomes are significantly correlated. Accordingly, the activity though age analysis indicated that the replication wave of Persaeus coincided with that of Zenon. Phylogenetic analyses suggested that Persaeus may have evolved at least four times independently, and that it has been vertically transferred into its host genomes. Together, our results provide new insights into the evolution dynamics of SIDEs and its partner non-LTRs.
Collapse
Affiliation(s)
- Ping-Lan Wang
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Italy
| | | | | | - Min-Rui-Xuan Xu
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Xiao-Gu Zhang
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Hua-Hao Zhang
- College of Pharmacy and Life Science, Jiujiang University, China
| |
Collapse
|
16
|
Kögler A, Schmidt T, Wenke T. Evolutionary modes of emergence of short interspersed nuclear element (SINE) families in grasses. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 92:676-695. [PMID: 28857316 DOI: 10.1111/tpj.13676] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2017] [Revised: 08/18/2017] [Accepted: 08/22/2017] [Indexed: 06/07/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous transposable elements which are propagated by retrotransposition and constitute an inherent part of the genome of most eukaryotic species. Knowledge of heterogeneous and highly abundant SINEs is crucial for de novo (or improvement of) annotation of whole genome sequences. We scanned Poaceae genome sequences of six important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Panicum virgatum, Sorghum bicolor, Zea mays) and Brachypodium distachyon to examine the diversity and evolution of SINE populations. We comparatively analyzed the structural features, distribution, evolutionary relation and abundance of 32 SINE families and subfamilies within grasses, comprising 11 052 individual copies. The investigation of activity profiles within the Poaceae provides insights into their species-specific diversification and amplification. We found that Poaceae SINEs (PoaS) fall into two length categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Detailed analysis at the nucleotide level revealed that multimerization of related and unrelated SINE copies is an important evolutionary mechanism of SINE formation. We conclude that PoaS families diversify by massive reshuffling between SINE families, likely caused by insertion of truncated copies, and provide a model for this evolutionary scenario. Twenty-eight of 32 PoaS families and subfamilies show significant conservation, in particular either in the 5' or 3' regions, across Poaceae species and share large sequence stretches with one or more other PoaS families.
Collapse
Affiliation(s)
- Anja Kögler
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| |
Collapse
|
17
|
Alenko A, Fleming AM, Burrows CJ. Reverse Transcription Past Products of Guanine Oxidation in RNA Leads to Insertion of A and C opposite 8-Oxo-7,8-dihydroguanine and A and G opposite 5-Guanidinohydantoin and Spiroiminodihydantoin Diastereomers. Biochemistry 2017; 56:5053-5064. [PMID: 28845978 DOI: 10.1021/acs.biochem.7b00730] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Reactive oxygen species, both endogenous and exogenous, can damage nucleobases of RNA and DNA. Among the nucleobases, guanine has the lowest redox potential, making it a major target of oxidation. Although RNA is more prone to oxidation than DNA is, oxidation of guanine in RNA has been studied to a significantly lesser extent. One of the reasons for this is that many tools that were previously developed to study oxidation of DNA cannot be used on RNA. In the study presented here, the lack of a method for seeking sites of modification in RNA where oxidation occurs is addressed. For this purpose, reverse transcription of RNA containing major products of guanine oxidation was used. Extension of a DNA primer annealed to an RNA template containing 8-oxo-7,8-dihydroguanine (OG), 5-guanidinohydantoin (Gh), or the R and S diastereomers of spiroiminodihydantoin (Sp) was studied under standing start conditions. SuperScript III reverse transcriptase is capable of bypassing these lesions in RNA inserting predominantly A opposite OG, predominantly G opposite Gh, and almost an equal mixture of A and G opposite the Sp diastereomers. These data should allow RNA sequencing of guanine oxidation products by following characteristic mutation signatures formed by the reverse transcriptase during primer elongation past G oxidation sites in the template RNA strand.
Collapse
Affiliation(s)
- Anton Alenko
- Department of Chemistry, University of Utah , 315 South 1400 East, Salt Lake City, Utah 84112-0850, United States
| | - Aaron M Fleming
- Department of Chemistry, University of Utah , 315 South 1400 East, Salt Lake City, Utah 84112-0850, United States
| | - Cynthia J Burrows
- Department of Chemistry, University of Utah , 315 South 1400 East, Salt Lake City, Utah 84112-0850, United States
| |
Collapse
|
18
|
Abstract
R2 elements are sequence specific non-LTR retrotransposons that exclusively insert in the 28S rRNA genes of animals. R2s encode an endonuclease that cleaves the insertion site and a reverse transcriptase that uses the cleaved DNA to prime reverse transcription of the R2 transcript, a process termed target primed reverse transcription. Additional unusual properties of the reverse transcriptase as well as DNA and RNA binding domains of the R2 encoded protein have been characterized. R2 expression is through co-transcription with the 28S gene and self-cleavage by a ribozyme encoded at the R2 5' end. Studies in laboratory stocks and natural populations of Drosophila suggest that R2 expression is tied to the distribution of R2-inserted units within the rDNA locus. Most individuals have no R2 expression because only a small fraction of their rRNA genes need to be active, and a contiguous region of the locus free of R2 insertions can be selected for activation. However, if the R2-free region is not large enough to produce sufficient rRNA, flanking units - including those inserted with R2 - must be activated. Finally, R2 copies rapidly turnover within the rDNA locus, yet R2 has been vertically maintained in animal lineages for hundreds of millions of years. The key to this stability is R2's ability to remain dormant in rDNA units outside the transcribed regions for generations until the stochastic nature of the crossovers that drive the concerted evolution of the rDNA locus inevitably reshuffle the inserted and uninserted units, resulting in transcription of the R2-inserted units.
Collapse
|
19
|
Abstract
This review focuses on recent developments in our understanding of group II intron function, the relationships of these introns to retrotransposons and spliceosomes, and how their common features have informed thinking about bacterial group II introns as key elements in eukaryotic evolution. Reverse transcriptase-mediated and host factor-aided intron retrohoming pathways are considered along with retrotransposition mechanisms to novel sites in bacteria, where group II introns are thought to have originated. DNA target recognition and movement by target-primed reverse transcription infer an evolutionary relationship among group II introns, non-LTR retrotransposons, such as LINE elements, and telomerase. Additionally, group II introns are almost certainly the progenitors of spliceosomal introns. Their profound similarities include splicing chemistry extending to RNA catalysis, reaction stereochemistry, and the position of two divalent metals that perform catalysis at the RNA active site. There are also sequence and structural similarities between group II introns and the spliceosome's small nuclear RNAs (snRNAs) and between a highly conserved core spliceosomal protein Prp8 and a group II intron-like reverse transcriptase. It has been proposed that group II introns entered eukaryotes during bacterial endosymbiosis or bacterial-archaeal fusion, proliferated within the nuclear genome, necessitating evolution of the nuclear envelope, and fragmented giving rise to spliceosomal introns. Thus, these bacterial self-splicing mobile elements have fundamentally impacted the composition of extant eukaryotic genomes, including the human genome, most of which is derived from close relatives of mobile group II introns.
Collapse
|
20
|
Abstract
Diversity-generating retroelements (DGRs) are DNA diversification machines found in diverse bacterial and bacteriophage genomes that accelerate the evolution of ligand-receptor interactions. Diversification results from a unidirectional transfer of sequence information from an invariant template repeat (TR) to a variable repeat (VR) located in a protein-encoding gene. Information transfer is coupled to site-specific mutagenesis in a process called mutagenic homing, which occurs through an RNA intermediate and is catalyzed by a unique, DGR-encoded reverse transcriptase that converts adenine residues in the TR into random nucleotides in the VR. In the prototype DGR found in the Bordetella bacteriophage BPP-1, the variable protein Mtd is responsible for phage receptor recognition. VR diversification enables progeny phage to switch tropism, accelerating their adaptation to changes in sequence or availability of host cell-surface molecules for infection. Since their discovery, hundreds of DGRs have been identified, and their functions are just beginning to be understood. VR-encoded residues of many DGR-diversified proteins are displayed in the context of a C-type lectin fold, although other scaffolds, including the immunoglobulin fold, may also be used. DGR homing is postulated to occur through a specialized target DNA-primed reverse transcription mechanism that allows repeated rounds of diversification and selection, and the ability to engineer DGRs to target heterologous genes suggests applications for bioengineering. This chapter provides a comprehensive review of our current understanding of this newly discovered family of beneficial retroelements.
Collapse
|
21
|
Jimenez RM, Polanco JA, Lupták A. Chemistry and Biology of Self-Cleaving Ribozymes. Trends Biochem Sci 2015; 40:648-661. [PMID: 26481500 DOI: 10.1016/j.tibs.2015.09.001] [Citation(s) in RCA: 127] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 08/28/2015] [Accepted: 09/01/2015] [Indexed: 11/26/2022]
Abstract
Self-cleaving ribozymes were discovered 30 years ago, but their biological distribution and catalytic mechanisms are only beginning to be defined. Each ribozyme family is defined by a distinct structure, with unique active sites accelerating the same transesterification reaction across the families. Biochemical studies show that general acid-base catalysis is the most common mechanism of self-cleavage, but metal ions and metabolites can be used as cofactors. Ribozymes have been discovered in highly diverse genomic contexts throughout nature, from viroids to vertebrates. Their biological roles include self-scission during rolling-circle replication of RNA genomes, co-transcriptional processing of retrotransposons, and metabolite-dependent gene expression regulation in bacteria. Other examples, including highly conserved mammalian ribozymes, suggest that many new biological roles are yet to be discovered.
Collapse
Affiliation(s)
- Randi M Jimenez
- Department of Molecular Biology and Biochemistry, University of California, Irvine, CA, USA
| | - Julio A Polanco
- Department of Molecular Biology and Biochemistry, University of California, Irvine, CA, USA
| | - Andrej Lupták
- Department of Molecular Biology and Biochemistry, University of California, Irvine, CA, USA; Department of Pharmaceutical Sciences, University of California, Irvine, CA, USA; Department of Chemistry, University of California, Irvine, CA, USA.
| |
Collapse
|
22
|
Abstract
Eukaryotic genomes are colonized by various transposons including short interspersed elements (SINEs). The 5' region (head) of the majority of SINEs is derived from one of the three types of RNA genes--7SL RNA, transfer RNA (tRNA), or 5S ribosomal RNA (rRNA)--and the internal promoter inside the head promotes the transcription of the entire SINEs. Here I report a new group of SINEs whose heads originate from either the U1 or U2 small nuclear RNA gene. These SINEs, named SINEU, are distributed among crocodilians and classified into three families. The structures of the SINEU-1 subfamilies indicate the recurrent addition of a U1- or U2-derived sequence onto the 5' end of SINEU-1 elements. SINEU-1 and SINEU-3 are ancient and shared among alligators, crocodiles, and gharials, while SINEU-2 is absent in the alligator genome. SINEU-2 is the only SINE family that was active after the split of crocodiles and gharials. All SINEU families, especially SINEU-3, are preferentially inserted into a family of Mariner DNA transposon, Mariner-N4_AMi. A group of Tx1 non-long terminal repeat retrotransposons designated Tx1-Mar also show target preference for Mariner-N4_AMi, indicating that SINEU was mobilized by Tx1-Mar.
Collapse
|
23
|
Martoni F, Eickbush DG, Scavariello C, Luchetti A, Mantovani B. Dead element replicating: degenerate R2 element replication and rDNA genomic turnover in the Bacillus rossius stick insect (Insecta: Phasmida). PLoS One 2015; 10:e0121831. [PMID: 25799008 PMCID: PMC4370867 DOI: 10.1371/journal.pone.0121831] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Accepted: 02/04/2015] [Indexed: 11/18/2022] Open
Abstract
R2 is an extensively investigated non-LTR retrotransposon that specifically inserts into the 28S rRNA gene sequences of a wide range of metazoans, disrupting its functionality. During R2 integration, first strand synthesis can be incomplete so that 5’ end deleted copies are occasionally inserted. While active R2 copies repopulate the locus by retrotransposing, the non-functional truncated elements should frequently be eliminated by molecular drive processes leading to the concerted evolution of the rDNA array(s). Although, multiple R2 lineages have been discovered in the genome of many animals, the rDNA of the stick insect Bacillus rossius exhibits a peculiar situation: it harbors both a canonical, functional R2 element (R2Brfun) as well as a full-length but degenerate element (R2Brdeg). An intensive sequencing survey in the present study reveals that all truncated variants in stick insects are present in multiple copies suggesting they were duplicated by unequal recombination. Sequencing results also demonstrate that all R2Brdeg copies are full-length, i. e. they have no associated 5' end deletions, and functional assays indicate they have lost the active ribozyme necessary for R2 RNA maturation. Although it cannot be completely ruled out, it seems unlikely that the degenerate elements replicate via reverse transcription, exploiting the R2Brfun element enzymatic machinery, but rather via genomic amplification of inserted 28S by unequal recombination. That inactive copies (both R2Brdeg or 5'-truncated elements) are not eliminated in a short term in stick insects contrasts with findings for the Drosophila R2, suggesting a widely different management of rDNA loci and a lower efficiency of the molecular drive while achieving the concerted evolution.
Collapse
Affiliation(s)
- Francesco Martoni
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Danna G. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Claudia Scavariello
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
- * E-mail:
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| |
Collapse
|
24
|
Jamburuthugoda VK, Eickbush TH. Identification of RNA binding motifs in the R2 retrotransposon-encoded reverse transcriptase. Nucleic Acids Res 2014; 42:8405-15. [PMID: 24957604 PMCID: PMC4117753 DOI: 10.1093/nar/gku514] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
R2 non-LTR retrotransposons insert at a specific site in the 28S rRNA genes of many animal phyla. R2 elements encode a single polypeptide with reverse transcriptase, endonuclease and nucleic acid binding domains. Integration involves separate cleavage of the two DNA strands at the target site and utilization of the released 3' ends to prime DNA synthesis. Critical to this integration is the ability of the protein to specifically bind 3' and 5' regions of the R2 RNA. In this report, alanine mutations in two conserved motifs N-terminal to the reverse transcriptase domain were generated and shown to result in proteins that retained the ability to cleave the first strand of the DNA target, to reverse transcribe RNA from an annealed primer and to displace annealed RNA when using DNA as a template. However, the mutant proteins had greatly reduced ability to bind 3' and 5' RNA in mobility shift assays, use the DNA target to prime reverse transcription and conduct second-strand DNA cleavage. These motifs thus appear to participate in all activities of the R2 protein known to require specific RNA binding. The similarity of these R2 RNA binding motifs to those of telomerase and group II introns is discussed.
Collapse
Affiliation(s)
| | - Thomas H Eickbush
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| |
Collapse
|
25
|
Eickbush DG, Burke WD, Eickbush TH. Evolution of the R2 retrotransposon ribozyme and its self-cleavage site. PLoS One 2013; 8:e66441. [PMID: 24066021 PMCID: PMC3774820 DOI: 10.1371/journal.pone.0066441] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 05/07/2013] [Indexed: 12/23/2022] Open
Abstract
R2 is a non-long terminal repeat retrotransposon that inserts site-specifically in the tandem 28S rRNA genes of many animals. Previously, R2 RNA from various species of Drosophila was shown to self-cleave from the 28S rRNA/R2 co-transcript by a hepatitis D virus (HDV)-like ribozyme encoded at its 5' end. RNA cleavage was at the precise 5' junction of the element with the 28S gene. Here we report that RNAs encompassing the 5' ends of R2 elements from throughout its species range fold into HDV-like ribozymes. In vitro assays of RNA self-cleavage conducted in many R2 lineages confirmed activity. For many R2s, RNA self-cleavage was not at the 5' end of the element but at 28S rRNA sequences up to 36 nucleotides upstream of the junction. The location of cleavage correlated well with the types of endogenous R2 5' junctions from different species. R2 5' junctions were uniform for most R2s in which RNA cleavage was upstream in the rRNA sequences. The 28S sequences remaining on the first DNA strand synthesized during retrotransposition are postulated to anneal to the target site and uniformly prime second strand DNA synthesis. In species where RNA cleavage occurred at the R2 5' end, the 5' junctions were variable. This junction variation is postulated to result from the priming of second strand DNA synthesis by chance microhomologies between the target site and the first DNA strand. Finally, features of R2 ribozyme evolution, especially changes in cleavage site and convergence on the same active site sequences, are discussed.
Collapse
Affiliation(s)
- Danna G. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - William D. Burke
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Thomas H. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
- * E-mail:
| |
Collapse
|
26
|
Mohr S, Ghanem E, Smith W, Sheeter D, Qin Y, King O, Polioudakis D, Iyer VR, Hunicke-Smith S, Swamy S, Kuersten S, Lambowitz AM. Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing. RNA (NEW YORK, N.Y.) 2013; 19:958-70. [PMID: 23697550 PMCID: PMC3683930 DOI: 10.1261/rna.039743.113] [Citation(s) in RCA: 140] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility ("retrohoming") by a process that requires reverse transcription of a highly structured, 2-2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3' ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications.
Collapse
Affiliation(s)
- Sabine Mohr
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Eman Ghanem
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Whitney Smith
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Dennis Sheeter
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Yidan Qin
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Olga King
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Damon Polioudakis
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Vishwanath R. Iyer
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | - Scott Hunicke-Smith
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
| | | | - Scott Kuersten
- Epicentre—An Illumina Company, Madison, Wisconsin 53713, USA
| | - Alan M. Lambowitz
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712, USA
- Corresponding authorE-mail
| |
Collapse
|
27
|
Monot C, Kuciak M, Viollet S, Mir AA, Gabus C, Darlix JL, Cristofari G. The specificity and flexibility of l1 reverse transcription priming at imperfect T-tracts. PLoS Genet 2013; 9:e1003499. [PMID: 23675310 PMCID: PMC3649969 DOI: 10.1371/journal.pgen.1003499] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2012] [Accepted: 03/22/2013] [Indexed: 01/18/2023] Open
Abstract
L1 retrotransposons have a prominent role in reshaping mammalian genomes. To replicate, the L1 ribonucleoprotein particle (RNP) first uses its endonuclease (EN) to nick the genomic DNA. The newly generated DNA end is subsequently used as a primer to initiate reverse transcription within the L1 RNA poly(A) tail, a process known as target-primed reverse transcription (TPRT). Prior studies demonstrated that most L1 insertions occur into sequences related to the L1 EN consensus sequence (degenerate 5′-TTTT/A-3′ sites) and frequently preceded by imperfect T-tracts. However, it is currently unclear whether—and to which degree—the liberated 3′-hydroxyl extremity on the genomic DNA needs to be accessible and complementary to the poly(A) tail of the L1 RNA for efficient priming of reverse transcription. Here, we employed a direct assay for the initiation of L1 reverse transcription to define the molecular rules that guide this process. First, efficient priming is detected with as few as 4 matching nucleotides at the primer 3′ end. Second, L1 RNP can tolerate terminal mismatches if they are compensated within the 10 last bases of the primer by an increased number of matching nucleotides. All terminal mismatches are not equally detrimental to DNA extension, a C being extended at higher levels than an A or a G. Third, efficient priming in the context of duplex DNA requires a 3′ overhang. This suggests the possible existence of additional DNA processing steps, which generate a single-stranded 3′ end to allow L1 reverse transcription. Based on these data we propose that the specificity of L1 reverse transcription initiation contributes, together with the specificity of the initial EN cleavage, to the distribution of new L1 insertions within the human genome. Jumping genes are DNA sequences present in the genome of most living organisms. They contribute to genome dynamics and occasionally result in hereditary genetic diseases or cancer. L1 elements are the only autonomously active jumping genes in the human genome. They replicate through an RNA–mediated copy-and-paste mechanism by cleaving the host genome and then using this new DNA end as a primer to reverse transcribe its own RNA, generating a new L1 DNA copy. The molecular determinants that influence L1 target site choice are not fully understood. Here we present a quantitative assay to measure the influence of DNA target site sequence and structure on the reverse transcription step. By testing more than 65 potential DNA primers, we observe that not all sites are equally extended by the L1 machinery, and we define the rules guiding this process. In particular, we highlight the importance of partial sequence complementarity between the target site and the L1 RNA extremity, but also the high level of flexibility of this process, since detrimental terminal mismatches can be compensated by an increasing number of interacting nucleotides. We propose that this mechanism contributes to the distribution of new L1 insertions within the human genome.
Collapse
Affiliation(s)
- Clément Monot
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Monika Kuciak
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Sébastien Viollet
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Ashfaq Ali Mir
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Caroline Gabus
- Ecole Normale Supérieure de Lyon, Human Virology Department, INSERM U758, Lyon, France
| | - Jean-Luc Darlix
- Ecole Normale Supérieure de Lyon, Human Virology Department, INSERM U758, Lyon, France
| | - Gaël Cristofari
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
- * E-mail:
| |
Collapse
|
28
|
Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013; 4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open
Abstract
The molecular mechanisms of the transposition of non-long terminal repeat (non-LTR) retrotransposons are not well understood; the key questions of how the 3′-ends of cDNA copies integrate and how site-specific integration occurs remain unresolved. Integration depends on properties of the endonuclease (EN) domain of retrotransposons. Using the EN domain of the Drosophila R2 retrotransposon as a model for other, closely related non-LTR retrotransposons, we investigated the EN domain and found that it resembles archaeal Holliday-junction resolvases. We suggest that these non-LTR retrotransposons are co-transcribed with the host transcript. Combined with the proposed resolvase activity of the EN domain, this model yields a novel mechanism for site-specific retrotransposition within this class of retrotransposons, with resolution proceeding via a Holliday junction intermediate.
Collapse
Affiliation(s)
- Dmitry V Mukha
- Vavilov Institute of General Genetics, Russian Academy of Sciences Moscow, Russia
| | | | | | | |
Collapse
|
29
|
Han JS, Shao S. Circular retrotransposition products generated by a LINE retrotransposon. Nucleic Acids Res 2012; 40:10866-77. [PMID: 22977178 PMCID: PMC3510499 DOI: 10.1093/nar/gks859] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
Non-long terminal repeat (non-LTR) retrotransposons are highly abundant elements that are present in chromosomes throughout the eukaryotic domain of life. The long interspersed nuclear element (LINE-1) (L1) clade of non-LTR retrotransposons has been particularly successful in mammals, accounting for 30–40% of human genome sequence. The current model of LINE retrotransposition, target-primed reverse transcription, culminates in a chromosomally integrated end product. Using a budding yeast model of non-LTR retrotransposition, we show that in addition to producing these ‘classical’, chromosomally integrated products, a fungal L1 clade member (Zorro3) can generate abundant, RNA-derived episomal products. Genetic evidence suggests that these products are likely to be formed via a variation of target-primed reverse transcription. These episomal products are a previously unseen alternative fate of LINE retrotransposition, and may represent an unexpected source for de novo retrotransposition.
Collapse
Affiliation(s)
- Jeffrey S Han
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD 21218, USA.
| | | |
Collapse
|
30
|
Starnes JH, Thornbury DW, Novikova OS, Rehmeyer CJ, Farman ML. Telomere-targeted retrotransposons in the rice blast fungus Magnaporthe oryzae: agents of telomere instability. Genetics 2012; 191:389-406. [PMID: 22446319 PMCID: PMC3374306 DOI: 10.1534/genetics.111.137950] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2011] [Accepted: 03/11/2012] [Indexed: 02/07/2023] Open
Abstract
The fungus Magnaporthe oryzae is a serious pathogen of rice and other grasses. Telomeric restriction fragments in Magnaporthe isolates that infect perennial ryegrass (prg) are hotspots for genomic rearrangement and undergo frequent, spontaneous alterations during fungal culture. The telomeres of rice-infecting isolates are very stable by comparison. Sequencing of chromosome ends from a number of prg-infecting isolates revealed two related non-LTR retrotransposons (M. oryzae Telomeric Retrotransposons or MoTeRs) inserted in the telomere repeats. This contrasts with rice pathogen telomeres that are uninterrupted by other sequences. Genetic evidence indicates that the MoTeR elements are responsible for the observed instability. MoTeRs represent a new family of telomere-targeted transposons whose members are found exclusively in fungi.
Collapse
Affiliation(s)
| | - David W. Thornbury
- Department of Plant Pathology, University of Kentucky, Lexington, Kentucky 40546
| | - Olga S. Novikova
- Department of Plant Pathology, University of Kentucky, Lexington, Kentucky 40546
| | | | - Mark L. Farman
- Department of Plant Pathology, University of Kentucky, Lexington, Kentucky 40546
| |
Collapse
|
31
|
Eickbush DG, Eickbush TH. R2 and R2/R1 hybrid non-autonomous retrotransposons derived by internal deletions of full-length elements. Mob DNA 2012; 3:10. [PMID: 22621441 PMCID: PMC3414825 DOI: 10.1186/1759-8753-3-10] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2012] [Accepted: 05/23/2012] [Indexed: 02/05/2023] Open
Abstract
BACKGROUND R2 is a non-long terminal repeat (non-LTR) retrotransposable element that inserts site specifically into the 28S genes of the ribosomal (r)RNA gene loci. Encoded at the 5' end is a ribozyme that generates the precise 5' end by self-cleavage of a 28S gene cotranscript. Sequences at the 3' end are necessary for the R2 protein to bind RNA and initiate the target primed reverse transcription (TPRT) reaction. These minimal RNA requirements suggested that if recombination/DNA repair conjoined the 5' and 3' ends of R2, the result would be a non-autonomous element that could survive as long as autonomous R2 elements supplied the TPRT activity. RESULTS A PCR-based survey of 39 Drosophila species aided by genomic sequences from 12 of these species revealed two types of non-autonomous elements. We call these elements SIDEs (for 'Short Internally Deleted Elements'). The first consisted of a 5' ribozyme and a 3' end of an R2 element as predicted. Variation at the 5' junctions of the R2 SIDE copies was typical for R2 insertions suggesting their propagation by TPRT. The second class of SIDE contained sequences from R1 elements, another non-LTR retrotransposon that inserts into rRNA gene loci. These insertions had an R2 ribozyme immediately upstream of R1 3' end sequences. These hybrid SIDEs were inserted at the R1 site with 14 bp target site duplications typical of R1 insertions suggesting they used the R1 machinery for retrotransposition. Finally, the survey revealed examples of U12 small nuclear (sn)RNA and tRNA sequences at the 5' end of R2 elements suggesting the R2 reverse transcriptase can template jump from the R2 transcript to a second RNA during TPRT. CONCLUSIONS The R2 SIDE and R2/R1 hybrid SIDEs are rare examples of non-autonomous retrotransposons in the Drosophila genome. Associated non-autonomous elements and in vivo template jumps are two additional characteristics R2 shares with other non-LTR retrotransposons such as mammalian L1s. Analysis of the hybrid SIDEs provides supporting evidence that R1 elements, like R2 elements, recognize their 3' untranslated region (UTR) sequences and, thus, belong to the stringent class of non-LTR elements.
Collapse
Affiliation(s)
- Danna G Eickbush
- Department of Biology, University of Rochester, Rochester, NY, 14627, USA.
| | | |
Collapse
|
32
|
Yadav VP, Mandal PK, Bhattacharya A, Bhattacharya S. Recombinant SINEs are formed at high frequency during induced retrotransposition in vivo. Nat Commun 2012; 3:854. [PMID: 22617294 DOI: 10.1038/ncomms1855] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2012] [Accepted: 04/19/2012] [Indexed: 11/09/2022] Open
Abstract
Non-long terminal repeat Retrotransposons are referred to as long interspersed nuclear elements (LINEs) and their non-autonomous partners are short interspersed nuclear elements (SINEs). It is believed that an active SINE copy, upon retrotransposition, generates near identical copies of itself, which subsequently accumulate mutations resulting in sequence polymorphism. Here we show that when a retrotransposition-competent cell line of the parasitic protist Entamoeba histolytica, transfected with a marked SINE copy, is induced to retrotranspose, >20% of the newly retrotransposed copies are neither identical to the marked SINE nor to the mobilized resident SINEs. Rather they are recombinants of resident SINEs and the marked SINE. They are a consequence of retrotransposition and not DNA recombination, as they are absent in cells not expressing the retrotransposition functions. This high-frequency recombination provides a new explanation for the existence of mosaic SINEs, which may impact on genetic analysis of SINE lineages, and measurement of phylogenetic distances.
Collapse
Affiliation(s)
- Vijay Pal Yadav
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | | | | | | |
Collapse
|
33
|
Luchetti A, Mingazzini V, Mantovani B. 28S junctions and chimeric elements of the rDNA targeting non-LTR retrotransposon R2 in crustacean living fossils (Branchiopoda, Notostraca). Genomics 2012; 100:51-6. [PMID: 22564473 DOI: 10.1016/j.ygeno.2012.04.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2012] [Revised: 04/17/2012] [Accepted: 04/23/2012] [Indexed: 11/29/2022]
Abstract
The 28S rRNA genes of several metazoans are interrupted by site-specific targeting non-LTR retrotransposons, such as R2. R2 elements have been deeply analyzed but aspects of their retrotransposition mechanism and the origin of the wide diversity observed are still debated. We characterized six new R2 lineages in four tadpole shrimp species (Notostraca), samples deriving from a parthenogenetic population of Triops cancriformis (R2Tc_it) and from bisexual Lepidurus populations of L. lubbocki (R2Ll), L. couesii (R2LcA, R2LcB, R2LcC) and L. arcticus (R2La). All elements fit the canonical R2 structure but R2Ll which turned out to be a chimera with an additional ORF originating from another R2. Consistently with data on LINEs, R2Ll could be the result of recombination due to reverse transcriptase template jump. The analysis of 28S/R2 5' end junctions further suggests aberrant homologous recombination, as observed in RNA viruses.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dip. Biologia Evoluzionistica Sperimentale, Università di Bologna, via Selmi 3, 40126 Bologna, Italy.
| | | | | |
Collapse
|
34
|
White TB, Lambowitz AM. The retrohoming of linear group II intron RNAs in Drosophila melanogaster occurs by both DNA ligase 4-dependent and -independent mechanisms. PLoS Genet 2012; 8:e1002534. [PMID: 22359518 PMCID: PMC3280974 DOI: 10.1371/journal.pgen.1002534] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2011] [Accepted: 12/24/2011] [Indexed: 12/31/2022] Open
Abstract
Mobile group II introns are bacterial retrotransposons that are thought to have invaded early eukaryotes and evolved into introns and retroelements in higher organisms. In bacteria, group II introns typically retrohome via full reverse splicing of an excised intron lariat RNA into a DNA site, where it is reverse transcribed by the intron-encoded protein. Recently, we showed that linear group II intron RNAs, which can result from hydrolytic splicing or debranching of lariat RNAs, can retrohome in eukaryotes by performing only the first step of reverse splicing, ligating their 3' end to the downstream DNA exon. Reverse transcription then yields an intron cDNA, whose free end is linked to the upstream DNA exon by an error-prone process that yields junctions similar to those formed by non-homologous end joining (NHEJ). Here, by using Drosophila melanogaster NHEJ mutants, we show that linear intron RNA retrohoming occurs by major Lig4-dependent and minor Lig4-independent mechanisms, which appear to be related to classical and alternate NHEJ, respectively. The DNA repair polymerase θ plays a crucial role in both pathways. Surprisingly, however, mutations in Ku70, which functions in capping chromosome ends during NHEJ, have only moderate, possibly indirect effects, suggesting that both Lig4 and the alternate end-joining ligase act in some retrohoming events independently of Ku. Another potential Lig4-independent mechanism, reverse transcriptase template switching from the intron RNA to the upstream exon DNA, occurs in vitro, but gives junctions differing from the majority in vivo. Our results show that group II introns can utilize cellular NHEJ enzymes for retromobility in higher organisms, possibly exploiting mechanisms that contribute to retrotransposition and mitigate DNA damage by resident retrotransposons. Additionally, our results reveal novel activities of group II intron reverse transcriptases, with implications for retrohoming mechanisms and potential biotechnological applications.
Collapse
Affiliation(s)
- Travis B. White
- Institute for Cellular and Molecular Biology, Department of Chemistry and Biochemistry and Section of Molecular Genetics and Microbiology, University of Texas at Austin, Austin, Texas, United States of America
| | - Alan M. Lambowitz
- Institute for Cellular and Molecular Biology, Department of Chemistry and Biochemistry and Section of Molecular Genetics and Microbiology, University of Texas at Austin, Austin, Texas, United States of America
| |
Collapse
|
35
|
Ruminski DJ, Webb CHT, Riccitelli NJ, Lupták A. Processing and translation initiation of non-long terminal repeat retrotransposons by hepatitis delta virus (HDV)-like self-cleaving ribozymes. J Biol Chem 2011; 286:41286-41295. [PMID: 21994949 DOI: 10.1074/jbc.m111.297283] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Many non-long terminal repeat (non-LTR) retrotransposons lack internal promoters and are co-transcribed with their host genes. These transcripts need to be liberated before inserting into new loci. Using structure-based bioinformatics, we show that several classes of retrotransposons in phyla-spanning arthropods, nematodes, and chordates utilize self-cleaving ribozymes of the hepatitis delta virus (HDV) family for processing their 5' termini. Ribozyme-terminated retrotransposons include rDNA-specific R2, R4, and R6, telomere-specific SART, and Baggins and RTE. The self-scission of the R2 ribozyme is strongly modulated by the insertion site sequence in the rDNA, with the most common insertion sequences promoting faster processing. The ribozymes also promote translation initiation of downstream open reading frames in vitro and in vivo. In some organisms HDV-like and hammerhead ribozymes appear to be dedicated to processing long and short interspersed elements, respectively. HDV-like ribozymes serve several distinct functions in non-LTR retrotransposition, including 5' processing, translation initiation, and potentially trans-templating.
Collapse
Affiliation(s)
- Dana J Ruminski
- Departments of Molecular Biology and Biochemistry, University of California, Irvine, California 92697
| | - Chiu-Ho T Webb
- Departments of Molecular Biology and Biochemistry, University of California, Irvine, California 92697
| | | | - Andrej Lupták
- Departments of Molecular Biology and Biochemistry, University of California, Irvine, California 92697; Department of Chemistry, University of California, Irvine, California 92697; Department of Pharmaceutical Sciences, University of California, Irvine, California 92697.
| |
Collapse
|
36
|
Abstract
Reverse transcriptases have shaped genomes in many ways. A remarkable example of this shaping is found on telomeres of the genus Drosophila, where retrotransposons have a vital role in chromosome structure. Drosophila lacks telomerase; instead, three telomere-specific retrotransposons maintain chromosome ends. Repeated transpositions to chromosome ends produce long head to tail arrays of these elements. In both form and function, these arrays are analogous to the arrays of repeats added by telomerase to chromosomes in other organisms. Distantly related Drosophila exhibit this variant mechanism of telomere maintenance, which was established before the separation of extant Drosophila species. Nevertheless, the telomere-specific elements still have the hallmarks that characterize non-long terminal repeat (non-LTR) retrotransposons; they have also acquired characteristics associated with their roles at telomeres. These telomeric retrotransposons have shaped the Drosophila genome, but they have also been shaped by the genome. Here, we discuss ways in which these three telomere-specific retrotransposons have been modified for their roles in Drosophila chromosomes.
Collapse
|
37
|
Siol O, Spaller T, Schiefner J, Winckler T. Genetically tagged TRE5-A retrotransposons reveal high amplification rates and authentic target site preference in the Dictyostelium discoideum genome. Nucleic Acids Res 2011; 39:6608-19. [PMID: 21525131 PMCID: PMC3159450 DOI: 10.1093/nar/gkr261] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2011] [Revised: 03/21/2011] [Accepted: 04/06/2011] [Indexed: 11/14/2022] Open
Abstract
Retrotransposons contribute significantly to the evolution of eukaryotic genomes. They replicate by producing DNA copies of their own RNA, which are integrated at new locations in the host cell genome. In the gene-dense genome of the social amoeba Dictyostelium discoideum, retrotransposon TRE5-A avoids insertional mutagenesis by targeting the transcription factor (TF) IIIC/IIIB complex and integrating ∼ 50 bp upstream of tRNA genes. We generated synthetic TRE5-A retrotransposons (TRE5-A(bsr)) that were tagged with a selection marker that conferred resistance to blasticidin after a complete retrotransposition cycle. We found that the TRE5-A(bsr) elements were efficiently mobilized in trans by proteins expressed from the endogenous TRE5-A population found in D. discoideum cells. ORF1 protein translated from TRE5-A(bsr) elements significantly enhanced retrotransposition. We observed that the 5' untranslated region of TRE5-A could be replaced by an unrelated promoter, whereas the 3' untranslated region of TRE5-A was essential for retrotransposition. A predicted secondary structure in the RNA of the 3' untranslated region of TRE5-A may be involved in the retrotransposition process. The TRE5-A(bsr) elements were capable of identifying authentic integration targets in vivo, including formerly unnoticed, putative binding sites for TFIIIC on the extrachromosomal DNA element that carries the ribosomal RNA genes.
Collapse
Affiliation(s)
| | | | | | - Thomas Winckler
- Department of Pharmaceutical Biology, School of Biology and Pharmacy, Institute of Pharmacy, University of Jena, Semmelweisstrasse 10, 07743 Jena, Germany
| |
Collapse
|
38
|
Abstract
Short interspersed elements (SINEs) are one of the two most prolific mobile genomic elements in most of the higher eukaryotes. Although their biology is still not thoroughly understood, unusual life cycle of these simple elements amplified as genomic parasites makes their evolution unique in many ways. In contrast to most genetic elements including other transposons, SINEs emerged de novo many times in evolution from available molecules (for example, tRNA). The involvement of reverse transcription in their amplification cycle, huge number of genomic copies and modular structure allow variation mechanisms in SINEs uncommon or rare in other genetic elements (module exchange between SINE families, dimerization, and so on.). Overall, SINE evolution includes their emergence, progressive optimization and counteraction to the cell's defense against mobile genetic elements.
Collapse
|
39
|
Thompson BK, Christensen SM. Independently derived targeting of 28S rDNA by A- and D-clade R2 retrotransposons: Plasticity of integration mechanism. Mob Genet Elements 2011; 1:29-37. [PMID: 22016843 PMCID: PMC3190273 DOI: 10.4161/mge.1.1.16485] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2011] [Revised: 05/16/2011] [Accepted: 05/16/2011] [Indexed: 12/24/2022] Open
Abstract
Restriction-like endonuclease (RLE) bearing non-LTR retrotransposons are site-specific elements that integrate into the genome through a target primed reverse transcription mechanism (TPRT). R2 elements have been used as a model system for investigating non-LTR retrotransposon integration. We previously demonstrated that R2 retrotransposons require two subunits of the element-encoded multifunctional protein to integrate-one subunit bound upstream of the insertion site and one bound downstream. R2 elements have been phylogenetically categorized into four clades: R2-A, B, C and D, that diverged from a common ancestor more than 850 million years ago. All R2 elements target the same sequence within 28S rDNA. The amino-terminal domain of R2Bm, an R2-D clade element, contains a single zinc finger and a Myb motif that are responsible for binding R2 protein downstream of the insertion site. Target site recognition is of interest as it is the first step in the integration reaction and may help elucidate evolutionary history and integration mechanism. The amino-terminal domain of R2-A clade members contains three zinc fingers and a Myb motif. We show here that R2Lp, an R2-A clade member, uses its amino-terminal DNA binding motifs to bind upstream of the insertion site. Because the R2-A and R2-D clade elements recognize 28S rDNA differently, we conclude the A- and D-clades represent independent targeting events to the 28S site. Our results also indicate a certain plasticity of insertional mechanics exists between the two clades.
Collapse
Affiliation(s)
- Blaine K Thompson
- Department of Biology; University of Texas at Arlington; Arlington, TX USA
| | | |
Collapse
|
40
|
The reverse transcriptase encoded by the non-LTR retrotransposon R2 is as error-prone as that encoded by HIV-1. J Mol Biol 2011; 407:661-72. [PMID: 21320510 DOI: 10.1016/j.jmb.2011.02.015] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2010] [Revised: 02/02/2011] [Accepted: 02/04/2011] [Indexed: 11/20/2022]
Abstract
Reverse transcriptases (RTs) encoded by a wide range of mobile retroelements have had a major impact on the structure and function of genomes. Among the most abundant elements in eukaryotes are the non long terminal repeat (LTR) retrotransposons. Here we compare the dNTP concentration requirements and error rates of the RT encoded by the non-LTR retrotransposon R2 of Bombyx mori with the well-characterized RTs of retroviruses. Surprisingly, R2 was found to have properties more similar to those of lentiviral RTs, such as human immunodeficiency virus type 1 (HIV-1), than to those of oncoretroviral RTs, such as murine leukemia virus. Like HIV-1 RT, R2 RT was able to synthesize DNA at low dNTP concentrations, suggesting that R2 is able to retrotranspose in nondividing cells. R2 RT also showed levels of misincorporation in biased dNTP pools and replication error rates in M13 lacZα forward mutation assays, similar to HIV-1 RT. Most of the R2 base substitutions in the forward mutation assay were caused by the misincorporation of dTMP. Analogous to HIV-1, the high error rate of R2 RT appears to be a result of its ability to extend mismatches once generated. We suggest that the low fidelity of R2 RT is a by-product of the flexibility of its active site/dNTP binding pocket required for the target-primed reverse transcription reaction used by R2 for retrotransposition. Finally, we discuss that in spite of the high R2 RT error rate, the long-term nucleotide substitution rate for R2 is not significantly above that associated with cellular DNA replication, based on the frequency of R2 retrotranspositions determined in natural populations.
Collapse
|
41
|
Kojima KK. Different integration site structures between L1 protein-mediated retrotransposition in cis and retrotransposition in trans. Mob DNA 2010; 1:17. [PMID: 20615209 PMCID: PMC2912911 DOI: 10.1186/1759-8753-1-17] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2010] [Accepted: 07/08/2010] [Indexed: 11/10/2022] Open
Abstract
Background Long interspersed nuclear element-1 (LINE-1 or L1) is a dominant repetitive sequence in the human genome. Besides mediating its own retrotransposition, L1 can mobilize Alu and messenger RNA (mRNA) in trans, and probably also SVA and non-coding RNA. The structures of L1 copies and trans-mobilized retrocopies are variable and can be classified into three categories: full-length; 5'-truncated; and 5'-inverted insertions. These structures may be generated by different 5' integration mechanisms. Results In this study, a method to correctly characterize insertions with short target site duplications (TSDs) is developed and extranucleotides, TSDs and microhomologies (MHs) at junctions were analysed for the three types of insertions. Only 5'-truncated L1 insertions were found to be associated with short TSDs. Both full-length and 5'-truncated retrotransposed sequences in trans, including Alu, SVA and mRNA retrocopies and also full-length and 5'-inverted L1, were not associated with short TSDs, indicating the difference of 5' attachment between retrotransposition in cis and retrotransposition in trans. Target sequence analysis suggested that short TSDs were generated in an L1 endonuclease-dependent manner. The MHs were longer for 5'-inverted L1 than for 5'-truncated L1, indicating less dependence on annealing in 5'-truncated L1 insertions. Conclusions The results suggest that insertions flanked by short TSDs occur more often coupled with the insertion of 5'-truncated L1 than with those of other types of insertions in vivo. The method used in this study can be used to characterize elements without any apparent boundary structures.
Collapse
Affiliation(s)
- Kenji K Kojima
- Department of Biological Sciences, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-21 Nagatsuta-Cho, Midori-Ku, Yokohama, Kanagawa 226-8501, Japan.
| |
Collapse
|
42
|
Han JS. Non-long terminal repeat (non-LTR) retrotransposons: mechanisms, recent developments, and unanswered questions. Mob DNA 2010; 1:15. [PMID: 20462415 PMCID: PMC2881922 DOI: 10.1186/1759-8753-1-15] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2010] [Accepted: 05/12/2010] [Indexed: 12/22/2022] Open
Abstract
Non-long terminal repeat (non-LTR) retrotransposons are present in most eukaryotic genomes. In some species, such as humans, these elements are the most abundant genome sequence and continue to replicate to this day, creating a source of endogenous mutations and potential genotoxic stress. This review will provide a general outline of the replicative cycle of non-LTR retrotransposons. Recent findings regarding the host regulation of non-LTR retrotransposons will be summarized. Finally, future directions of interest will be discussed.
Collapse
Affiliation(s)
- Jeffrey S Han
- Department of Embryology, Carnegie Institution of Washington, Baltimore, MD, USA.
| |
Collapse
|
43
|
Rangwala SH, Richards EJ. The structure, organization and radiation of Sadhu non-long terminal repeat retroelements in Arabidopsis species. Mob DNA 2010; 1:10. [PMID: 20226007 PMCID: PMC2848041 DOI: 10.1186/1759-8753-1-10] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2009] [Accepted: 03/01/2010] [Indexed: 11/10/2022] Open
Abstract
Background Sadhu elements are non-autonomous retroposons first recognized in Arabidopsis thaliana. There is a wide degree of divergence among different elements, suggesting that these sequences are ancient in origin. Here we report the results of several lines of investigation into the genomic organization and evolutionary history of this element family. Results We present a classification scheme for Sadhu elements in A. thaliana, describing derivative elements related to the full-length elements we reported previously. We characterized Sadhu5 elements in a set of A. thaliana strains in order to trace the history of radiation in this subfamily. Sequences surrounding the target sites of different Sadhu insertions are consistent with mobilization by LINE retroelements. Finally, we identified Sadhu elements grouping into distinct subfamilies in two related species, Arabidopsis arenosa and Arabidopsis lyrata. Conclusions Our analyses suggest that the Sadhu retroelement family has undergone target primed reverse transcription-driven retrotransposition during the divergence of different A. thaliana strains. In addition, Sadhu elements can be found at moderate copy number in three distinct Arabidopsis species, indicating that the evolutionary history of these sequences can be traced back at least several millions of years.
Collapse
Affiliation(s)
- Sanjida H Rangwala
- Department of Biology, Washington University in St Louis, St Louis, MO, USA.
| | | |
Collapse
|
44
|
Terai G, Yoshizawa A, Okida H, Asai K, Mituyama T. Discovery of short pseudogenes derived from messenger RNAs. Nucleic Acids Res 2009; 38:1163-71. [PMID: 19965772 PMCID: PMC2831318 DOI: 10.1093/nar/gkp1098] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
More than 40% of the human genome is generated by retrotransposition, a series of in vivo processes involving reverse transcription of RNA molecules and integration of the transcripts into the genomic sequence. The mechanism of retrotransposition, however, is not fully understood, and additional genomic elements generated by retrotransposition may remain to be discovered. Here, we report that the human genome contains many previously unidentified short pseudogenes generated by retrotransposition of mRNAs. Genomic elements generated by non-long terminal repeat retrotransposition have specific sequence signatures: a poly-A tract that is immediately downstream and a pair of duplicated sequences, called target site duplications (TSDs), at either end. Using a new computer program, TSDscan, that can accurately detect pseudogenes based on the presence of the poly-A tract and TSDs, we found 654 short (≤300 bp), previously unknown pseudogenes derived from mRNAs. Comprehensive analyses of the pseudogenes that we identified and their parent mRNAs revealed that the pseudogene length depends on the parent mRNA length: long mRNAs generate more short pseudogenes than do short mRNAs. To explain this phenomenon, we hypothesize that most long mRNAs are truncated before they are reverse transcribed. Truncated mRNAs would be rapidly degraded during reverse transcription, resulting in the generation of short pseudogenes.
Collapse
Affiliation(s)
- Goro Terai
- INTEC Systems Institute Inc., Koto-ku 136-0075, Japan.
| | | | | | | | | |
Collapse
|
45
|
Tomaska L, Nosek J, Kramara J, Griffith JD. Telomeric circles: universal players in telomere maintenance? Nat Struct Mol Biol 2009; 16:1010-5. [PMID: 19809492 PMCID: PMC4041010 DOI: 10.1038/nsmb.1660] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
To maintain linear DNA genomes, organisms have evolved numerous means of solving problems associated with DNA ends (telomeres), including telomere-associated retrotransposons, palindromes, hairpins, covalently bound proteins and the addition of arrays of simple DNA repeats. Telomeric arrays can be maintained through various mechanisms such as telomerase activity or recombination. The recombination-dependent maintenance pathways may include telomeric loops (t-loops) and telomeric circles (t-circles). The potential involvement of t-circles in telomere maintenance was first proposed for linear mitochondrial genomes. The occurrence of t-circles in a wide range of organisms, spanning yeasts, plants and animals, suggests the involvement of t-circles in many phenomena including the alternative-lengthening of telomeres (ALT) pathway and telomere rapid deletion (TRD). In this Perspective, we summarize these findings and discuss how t-circles may be related to t-loops and how t-circles may have initiated the evolution of telomeres.
Collapse
Affiliation(s)
- Lubomir Tomaska
- Department of Genetics, Comenius University in Bratislava, Faculty of Natural Sciences, Bratislava, Slovakia.
| | | | | | | |
Collapse
|
46
|
Stage DE, Eickbush TH. Origin of nascent lineages and the mechanisms used to prime second-strand DNA synthesis in the R1 and R2 retrotransposons of Drosophila. Genome Biol 2009; 10:R49. [PMID: 19416522 PMCID: PMC2718515 DOI: 10.1186/gb-2009-10-5-r49] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Revised: 03/27/2009] [Accepted: 05/05/2009] [Indexed: 01/31/2023] Open
Abstract
Comparative analysis of 12 Drosophila genomes reveals insights into the evolution and mechanism of integration of R1 and R2 retrotransposons. Background Most arthropods contain R1 and R2 retrotransposons that specifically insert into the 28S rRNA genes. Here, the sequencing reads from 12 Drosophila genomes have been used to address two questions concerning these elements. First, to what extent is the evolution of these elements subject to the concerted evolution process that is responsible for sequence homogeneity among the different copies of rRNA genes? Second, how precise are the target DNA cleavages and priming of DNA synthesis used by these elements? Results Most copies of R1 and R2 in each species were found to exhibit less than 0.2% sequence divergence. However, in many species evidence was obtained for the formation of distinct sublineages of elements, particularly in the case of R1. Analysis of the hundreds of R1 and R2 junctions with the 28S gene revealed that cleavage of the first DNA strand was precise both in location and the priming of reverse transcription. Cleavage of the second DNA strand was less precise within a species, differed between species, and gave rise to variable priming mechanisms for second strand synthesis. Conclusions These findings suggest that the high sequence identity amongst R1 and R2 copies is because all copies are relatively new. However, each active element generates its own independent lineage that can eventually populate the locus. Independent lineages occur more often with R1, possibly because these elements contain their own promoter. Finally, both R1 and R2 use imprecise, rapidly evolving mechanisms to cleave the second strand and prime second strand synthesis.
Collapse
Affiliation(s)
- Deborah E Stage
- Biology Department, University of Rochester, Rochester NY 14627-0211, USA.
| | | |
Collapse
|
47
|
Suzuki J, Yamaguchi K, Kajikawa M, Ichiyanagi K, Adachi N, Koyama H, Takeda S, Okada N. Genetic evidence that the non-homologous end-joining repair pathway is involved in LINE retrotransposition. PLoS Genet 2009; 5:e1000461. [PMID: 19390601 PMCID: PMC2666801 DOI: 10.1371/journal.pgen.1000461] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Accepted: 03/26/2009] [Indexed: 11/24/2022] Open
Abstract
Long interspersed elements (LINEs) are transposable elements that proliferate within eukaryotic genomes, having a large impact on eukaryotic genome evolution. LINEs mobilize via a process called retrotransposition. Although the role of the LINE-encoded protein(s) in retrotransposition has been extensively investigated, the participation of host-encoded factors in retrotransposition remains unclear. To address this issue, we examined retrotransposition frequencies of two structurally different LINEs—zebrafish ZfL2-2 and human L1—in knockout chicken DT40 cell lines deficient in genes involved in the non-homologous end-joining (NHEJ) repair of DNA and in human HeLa cells treated with a drug that inhibits NHEJ. Deficiencies of NHEJ proteins decreased retrotransposition frequencies of both LINEs in these cells, suggesting that NHEJ is involved in LINE retrotransposition. More precise characterization of ZfL2-2 insertions in DT40 cells permitted us to consider the possibility of dual roles for NHEJ in LINE retrotransposition, namely to ensure efficient integration of LINEs and to restrict their full-length formation. Long interspersed elements (LINEs) are transposable elements that mobilize and amplify their own copies within eukaryotic genomes. Although LINEs had been considered as “junk” DNA, recent studies have suggested that the LINE-induced alterations of host chromosomes are a major driving force for eukaryotic genome evolution. LINEs mobilize via a mechanism called retrotransposition, in which transcribed LINE RNA is reverse transcribed into DNA that is then integrated into the host chromosome. Although the role of LINE-encoded proteins in retrotransposition has been revealed, the participation of host-encoded proteins has not been well investigated. Here, using knockout chicken DT40 cell lines, we present genetic evidence that the host-encoded proteins involved in repair of DNA double-strand breaks participate in LINE retrotransposition. More precise characterization of LINE insertions in DT40 cells suggested dual roles for these host DNA repair proteins in LINE retrotransposition; one function is required for efficient integration of LINEs and the other restricts their full-length formation.
Collapse
Affiliation(s)
- Jun Suzuki
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama, Kanagawa, Japan
| | - Katsumi Yamaguchi
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama, Kanagawa, Japan
| | - Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama, Kanagawa, Japan
- * E-mail: (MK); (NO)
| | - Kenji Ichiyanagi
- Division of Human Genetics, Department of Integrated Genetics, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka, Japan
| | - Noritaka Adachi
- International Graduate School of Arts and Sciences, Yokohama City University, Kanazawa-ku, Yokohama, Kanagawa, Japan
| | - Hideki Koyama
- International Graduate School of Arts and Sciences, Yokohama City University, Kanazawa-ku, Yokohama, Kanagawa, Japan
| | - Shunichi Takeda
- Radiation Genetics, Graduate School of Medicine, Kyoto University, Konoe Yoshida, Kyoto, Kyoto, Japan
| | - Norihiro Okada
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama, Kanagawa, Japan
- * E-mail: (MK); (NO)
| |
Collapse
|
48
|
Abstract
Over one-third of human genome sequence is a product of non-LTR retrotransposition. The retrotransposon that currently drives this process in humans is the highly abundant LINE-1 (L1) element. Despite the ubiquitous nature of L1's in mammals, we still lack a complete mechanistic understanding of the L1 replication cycle and how it is regulated. To generate a genetically amenable model for non-LTR retrotransposition, we have reengineered the Zorro3 retrotransposon, an L1 homolog from Candida albicans, for use in the budding yeast Saccharomyces cerevisiae. We found that S. cerevisiae, which has no endogenous L1 homologs or remnants, can still support Zorro3 retrotransposition. Analysis of Zorro3 mutants and insertion structures suggest that this is authentic L1-like retrotransposition with remarkable resemblance to mammalian L1-mediated events. This suggests that S. cerevisiae has unexpectedly retained the basal host machinery required for L1 retrotransposition. This model will also serve as a powerful system to study the cell biology of L1 elements and for the genetic identification and characterization of cellular factors involved in L1 retrotransposition.
Collapse
|
49
|
Eickbush TH, Jamburuthugoda VK. The diversity of retrotransposons and the properties of their reverse transcriptases. Virus Res 2008; 134:221-34. [PMID: 18261821 DOI: 10.1016/j.virusres.2007.12.010] [Citation(s) in RCA: 171] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2007] [Revised: 12/14/2007] [Accepted: 12/14/2007] [Indexed: 11/30/2022]
Abstract
A number of abundant mobile genetic elements called retrotransposons reverse transcribe RNA to generate DNA for insertion into eukaryotic genomes. Four major classes of retrotransposons are described here. First, the long-terminal-repeat (LTR) retrotransposons have similar structures and mechanisms to those of the vertebrate retroviruses. Genes that may enable these retrotransposons to leave a cell have been acquired by these elements in a number of animal and plant lineages. Second, the tyrosine recombinase retrotransposons are similar to the LTR retrotransposons except that they have substituted a recombinase for the integrase and recombine into the host chromosomes. Third, the non-LTR retrotransposons use a cleaved chromosomal target site generated by an encoded endonuclease to prime reverse transcription. Finally, the Penelope-like retrotransposons are not well understood but appear to also use cleaved DNA or the ends of chromosomes as primer for reverse transcription. Described in the second part of this review are the enzymatic properties of the reverse transcriptases (RTs) encoded by retrotransposons. The RTs of the LTR retrotransposons are highly divergent in sequence but have similar enzymatic activities to those of retroviruses. The RTs of the non-LTR retrotransposons have several unique properties reflecting their adaptation to a different mechanism of retrotransposition.
Collapse
Affiliation(s)
- Thomas H Eickbush
- Department of Biology, University of Rochester, Rochester, NY 14627, USA.
| | | |
Collapse
|
50
|
Gogvadze E, Barbisan C, Lebrun MH, Buzdin A. Tripartite chimeric pseudogene from the genome of rice blast fungus Magnaporthe grisea suggests double template jumps during long interspersed nuclear element (LINE) reverse transcription. BMC Genomics 2007; 8:360. [PMID: 17922896 PMCID: PMC2104539 DOI: 10.1186/1471-2164-8-360] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2007] [Accepted: 10/08/2007] [Indexed: 01/26/2023] Open
Abstract
Background A systematic survey of loci carrying retrotransposons in the genome of the rice blast fungus Magnaporthe grisea allowed the identification of novel non-canonical retropseudogenes. These elements are chimeric retrogenes composed of DNA copies from different cellular transcripts directly fused to each other. Their components are copies of a non protein-coding highly expressed RNA of unknown function termed WEIRD and of two fungal retrotransposons: MGL and Mg-SINE. Many of these chimeras are transcribed in various M. grisea tissues and during plant infection. Chimeric retroelements with a similar structure were recently found in three mammalian genomes. All these chimeras are likely formed by RNA template switches during the reverse transcription of diverse LINE elements. Results We have shown that in M. grisea template switching occurs at specific sites within the initial template RNA which contains a characteristic consensus sequence. We also provide evidence that both single and double template switches may occur during LINE retrotransposition, resulting in the fusion of three different transcript copies. In addition to the 33 bipartite elements, one tripartite chimera corresponding to the fusion of three retrotranscripts (WEIRD, Mg-SINE, MGL-LINE) was identified in the M. grisea genome. Unlike the previously reported two human tripartite elements, this fungal retroelement is flanked by identical 14 bp-long direct repeats. The presence of these short terminal direct repeats demonstrates that the LINE enzymatic machinery was involved in the formation of this chimera and its integration in the M. grisea genome. Conclusion A survey of mammalian genomic databases also revealed two novel tripartite chimeric retroelements, suggesting that double template switches occur during reverse transcription of LINE retrotransposons in different eukaryotic organisms.
Collapse
Affiliation(s)
- Elena Gogvadze
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow 117871, Russia.
| | | | | | | |
Collapse
|