1
|
Zhang X, Van Treeck B, Horton CA, McIntyre JJR, Palm SM, Shumate JL, Collins K. Harnessing eukaryotic retroelement proteins for transgene insertion into human safe-harbor loci. Nat Biotechnol 2024:10.1038/s41587-024-02137-y. [PMID: 38379101 DOI: 10.1038/s41587-024-02137-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 01/10/2024] [Indexed: 02/22/2024]
Abstract
Current approaches for inserting autonomous transgenes into the genome, such as CRISPR-Cas9 or virus-based strategies, have limitations including low efficiency and high risk of untargeted genome mutagenesis. Here, we describe precise RNA-mediated insertion of transgenes (PRINT), an approach for site-specifically primed reverse transcription that directs transgene synthesis directly into the genome at a multicopy safe-harbor locus. PRINT uses delivery of two in vitro transcribed RNAs: messenger RNA encoding avian R2 retroelement-protein and template RNA encoding a transgene of length validated up to 4 kb. The R2 protein coordinately recognizes the target site, nicks one strand at a precise location and primes complementary DNA synthesis for stable transgene insertion. With a cultured human primary cell line, over 50% of cells can gain several 2 kb transgenes, of which more than 50% are full-length. PRINT advantages include no extragenomic DNA, limiting risk of deleterious mutagenesis and innate immune responses, and the relatively low cost, rapid production and scalability of RNA-only delivery.
Collapse
Affiliation(s)
- Xiaozhu Zhang
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Briana Van Treeck
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Connor A Horton
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Jeremy J R McIntyre
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Sarah M Palm
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Justin L Shumate
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Kathleen Collins
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA.
| |
Collapse
|
2
|
Abstract
R2 elements are sequence specific non-LTR retrotransposons that exclusively insert in the 28S rRNA genes of animals. R2s encode an endonuclease that cleaves the insertion site and a reverse transcriptase that uses the cleaved DNA to prime reverse transcription of the R2 transcript, a process termed target primed reverse transcription. Additional unusual properties of the reverse transcriptase as well as DNA and RNA binding domains of the R2 encoded protein have been characterized. R2 expression is through co-transcription with the 28S gene and self-cleavage by a ribozyme encoded at the R2 5' end. Studies in laboratory stocks and natural populations of Drosophila suggest that R2 expression is tied to the distribution of R2-inserted units within the rDNA locus. Most individuals have no R2 expression because only a small fraction of their rRNA genes need to be active, and a contiguous region of the locus free of R2 insertions can be selected for activation. However, if the R2-free region is not large enough to produce sufficient rRNA, flanking units - including those inserted with R2 - must be activated. Finally, R2 copies rapidly turnover within the rDNA locus, yet R2 has been vertically maintained in animal lineages for hundreds of millions of years. The key to this stability is R2's ability to remain dormant in rDNA units outside the transcribed regions for generations until the stochastic nature of the crossovers that drive the concerted evolution of the rDNA locus inevitably reshuffle the inserted and uninserted units, resulting in transcription of the R2-inserted units.
Collapse
|
3
|
Jamburuthugoda VK, Eickbush TH. Identification of RNA binding motifs in the R2 retrotransposon-encoded reverse transcriptase. Nucleic Acids Res 2014; 42:8405-15. [PMID: 24957604 PMCID: PMC4117753 DOI: 10.1093/nar/gku514] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
R2 non-LTR retrotransposons insert at a specific site in the 28S rRNA genes of many animal phyla. R2 elements encode a single polypeptide with reverse transcriptase, endonuclease and nucleic acid binding domains. Integration involves separate cleavage of the two DNA strands at the target site and utilization of the released 3' ends to prime DNA synthesis. Critical to this integration is the ability of the protein to specifically bind 3' and 5' regions of the R2 RNA. In this report, alanine mutations in two conserved motifs N-terminal to the reverse transcriptase domain were generated and shown to result in proteins that retained the ability to cleave the first strand of the DNA target, to reverse transcribe RNA from an annealed primer and to displace annealed RNA when using DNA as a template. However, the mutant proteins had greatly reduced ability to bind 3' and 5' RNA in mobility shift assays, use the DNA target to prime reverse transcription and conduct second-strand DNA cleavage. These motifs thus appear to participate in all activities of the R2 protein known to require specific RNA binding. The similarity of these R2 RNA binding motifs to those of telomerase and group II introns is discussed.
Collapse
Affiliation(s)
| | - Thomas H Eickbush
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| |
Collapse
|
4
|
Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013; 4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open
Abstract
The molecular mechanisms of the transposition of non-long terminal repeat (non-LTR) retrotransposons are not well understood; the key questions of how the 3′-ends of cDNA copies integrate and how site-specific integration occurs remain unresolved. Integration depends on properties of the endonuclease (EN) domain of retrotransposons. Using the EN domain of the Drosophila R2 retrotransposon as a model for other, closely related non-LTR retrotransposons, we investigated the EN domain and found that it resembles archaeal Holliday-junction resolvases. We suggest that these non-LTR retrotransposons are co-transcribed with the host transcript. Combined with the proposed resolvase activity of the EN domain, this model yields a novel mechanism for site-specific retrotransposition within this class of retrotransposons, with resolution proceeding via a Holliday junction intermediate.
Collapse
Affiliation(s)
- Dmitry V Mukha
- Vavilov Institute of General Genetics, Russian Academy of Sciences Moscow, Russia
| | | | | | | |
Collapse
|
5
|
Han JS. Non-long terminal repeat (non-LTR) retrotransposons: mechanisms, recent developments, and unanswered questions. Mob DNA 2010; 1:15. [PMID: 20462415 PMCID: PMC2881922 DOI: 10.1186/1759-8753-1-15] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2010] [Accepted: 05/12/2010] [Indexed: 12/22/2022] Open
Abstract
Non-long terminal repeat (non-LTR) retrotransposons are present in most eukaryotic genomes. In some species, such as humans, these elements are the most abundant genome sequence and continue to replicate to this day, creating a source of endogenous mutations and potential genotoxic stress. This review will provide a general outline of the replicative cycle of non-LTR retrotransposons. Recent findings regarding the host regulation of non-LTR retrotransposons will be summarized. Finally, future directions of interest will be discussed.
Collapse
Affiliation(s)
- Jeffrey S Han
- Department of Embryology, Carnegie Institution of Washington, Baltimore, MD, USA.
| |
Collapse
|
6
|
Gu SQ, Cui X, Mou S, Mohr S, Yao J, Lambowitz AM. Genetic identification of potential RNA-binding regions in a group II intron-encoded reverse transcriptase. RNA (NEW YORK, N.Y.) 2010; 16:732-747. [PMID: 20179150 PMCID: PMC2844621 DOI: 10.1261/rna.2007310] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2009] [Accepted: 12/22/2009] [Indexed: 05/28/2023]
Abstract
Mobile group II introns encode a reverse transcriptase that binds the intron RNA to promote RNA splicing and intron mobility, the latter via reverse splicing of the excised intron into DNA sites, followed by reverse transcription. Previous work showed that the Lactococcus lactis Ll.LtrB intron reverse transcriptase, denoted LtrA protein, binds with high affinity to DIVa, a stem-loop structure at the beginning of the LtrA open reading frame and makes additional contacts with intron core regions that stabilize the active RNA structure for forward and reverse splicing. LtrA's binding to DIVa down-regulates its translation and is critical for initiation of reverse transcription. Here, by using high-throughput unigenic evolution analysis with a genetic assay in which LtrA binding to DIVa down-regulates translation of GFP, we identified regions at LtrA's N terminus that are required for DIVa binding. Then, by similar analysis with a reciprocal genetic assay, we confirmed that residual splicing of a mutant intron lacking DIVa does not require these N-terminal regions, but does require other reverse transcriptase (RT) and X/thumb domain regions that bind the intron core. We also show that N-terminal fragments of LtrA by themselves bind specifically to DIVa in vivo and in vitro. Our results suggest a model in which the N terminus of nascent LtrA binds DIVa of the intron RNA that encoded it and nucleates further interactions with core regions that promote RNP assembly for RNA splicing and intron mobility. Features of this model may be relevant to evolutionarily related non-long-terminal-repeat (non-LTR)-retrotransposon RTs.
Collapse
Affiliation(s)
- Shan-Qing Gu
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas 78712-0159, USA
| | | | | | | | | | | |
Collapse
|
7
|
Christensen SM, Eickbush TH. R2 target-primed reverse transcription: ordered cleavage and polymerization steps by protein subunits asymmetrically bound to the target DNA. Mol Cell Biol 2005; 25:6617-28. [PMID: 16024797 PMCID: PMC1190342 DOI: 10.1128/mcb.25.15.6617-6628.2005] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
R2 elements are non-long terminal repeat retrotransposons that specifically insert into 28S rRNA genes of many animal groups. These elements encode a single protein with reverse transcriptase and endonuclease activities as well as specific DNA and RNA binding properties. In this report, gel shift experiments were conducted to investigate the stoichiometry of the DNA, RNA, and protein components of the integration reaction. The enzymatic functions associated with each of the protein complexes were also determined, and DNase I digests were used to footprint the protein onto the target DNA. Additionally, a short polypeptide containing the N-terminal putative DNA-binding motifs was footprinted on the DNA target site. These combined findings revealed that one protein subunit binds the R2 RNA template and the DNA 10 to 40 bp upstream of the insertion site. This subunit cleaves the first DNA strand and uses that cleavage to prime reverse transcription of the R2 RNA transcript. Another protein subunit(s) uses the N-terminal DNA binding motifs to bind to the 18 bp of target DNA downstream of the insertion site and is responsible for cleavage of the second DNA strand. A complete model for the R2 integration reaction is presented, which with minor modifications is adaptable to other non-LTR retrotransposons.
Collapse
|
8
|
Kajikawa M, Ichiyanagi K, Tanaka N, Okada N. Isolation and Characterization of Active LINE and SINEs from the Eel. Mol Biol Evol 2004; 22:673-82. [PMID: 15548748 DOI: 10.1093/molbev/msi054] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Long interspersed elements (LINEs) and short interspersed elements (SINEs) are retrotransposons. These elements can mobilize by the "copy-and-paste" mechanism, in which their own RNA is reverse-transcribed into complementary DNA (cDNA). LINEs and SINEs not only are components of eukaryotic genomes but also drivers of genomic evolution. Thus, studies of the amplification mechanism of LINEs and SINEs are important for understanding eukaryotic genome evolution. Here we report the characterization of one LINE family (UnaL2) and two SINE families (UnaSINE1 and UnaSINE2) from the eel (Anguilla japonica) genome. UnaL2 is approximately 3.6 kilobases (kb) and encodes only one open reading frame (ORF). UnaL2 belongs to the stringent type--thought to be a major group of LINEs--and can mobilize in HeLa cells. We also show that UnaL2 and the two UnaSINEs have similar 3' tails, and that both UnaSINE1 and UnaSINE2 can be mobilized by UnaL2 in HeLa cells. These elements are thus useful for delineating the amplification mechanism of stringent type LINEs as well as that of SINEs.
Collapse
Affiliation(s)
- Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | | | | | | |
Collapse
|
9
|
Ruschak AM, Mathews DH, Bibillo A, Spinelli SL, Childs JL, Eickbush TH, Turner DH. Secondary structure models of the 3' untranslated regions of diverse R2 RNAs. RNA (NEW YORK, N.Y.) 2004; 10:978-87. [PMID: 15146081 PMCID: PMC1370589 DOI: 10.1261/rna.5216204] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2003] [Accepted: 03/10/2004] [Indexed: 05/19/2023]
Abstract
The RNA structure of the 3' untranslated region (UTR) of the R2 retrotransposable element is recognized by the R2-encoded reverse transcriptase in a reaction called target primed reverse transcription (TPRT). To provide insight into structure-function relationships important for TPRT, we have created alignments that reveal the secondary structure for 22 Drosophila and five silkmoth 3' UTR R2 sequences. In addition, free energy minimization has been used to predict the secondary structure for the 3' UTR R2 RNA of Forficula auricularia. The predicted structures for Bombyx mori and F. auricularia are consistent with chemical modification data obtained with beta-ethoxy-alpha-ketobutyraldehyde (kethoxal), dimethyl sulfate, and 1-cyclohexyl-3-(2-morpholinoethyl)carbodiimide metho-p-toluene sulfonate. The structures appear to have common helices that are likely important for function.
Collapse
Affiliation(s)
- Amy M Ruschak
- Department of Chemistry, University of Rochester, Rochester, New York 14627-0216, USA
| | | | | | | | | | | | | |
Collapse
|
10
|
Christensen S, Eickbush TH. Footprint of the retrotransposon R2Bm protein on its target site before and after cleavage. J Mol Biol 2004; 336:1035-45. [PMID: 15037067 DOI: 10.1016/j.jmb.2003.12.077] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2003] [Revised: 12/23/2003] [Accepted: 12/24/2003] [Indexed: 10/26/2022]
Abstract
R2 elements are non-long terminal repeat (non-LTR) retrotransposons that specifically integrate into the 28 S rRNA genes of their host. These elements encode a single open reading frame with a genome-specific endonuclease and a reverse transcriptase that uses the cleaved chromosomal target site to prime reverse transcription. Cleavage of the DNA strand that is used to prime reverse transcription is an efficient process that occurs in the presence or absence of RNA. Cleavage of the second DNA strand is much less efficient and requires RNA. Reverse transcription occurs before second strand cleavage and only if the RNA bound to the protein contains the 3' untranslated region of the R2 element. Thus a complex series of protein interactions with the DNA and conformational changes in the protein are likely to occur during this retrotransposition reaction. Here, we conduct electrophoretic mobility-shift assays and DNase I footprint studies on the binding of the R2 protein to the DNA target in the presence and absence of RNA both before and after first strand cleavage. While the total expanse of the protein footprint on the DNA eventually covers five helical turns, before cleavage the footprint only extends from 17 bp to 40 bp upstream of the cleavage site. This footprint is the same in the presence and absence of RNA. We hypothesize that the active site of the endonuclease domain is analogous to type IIS restriction enzymes in that it is located on a flexible domain that is not tightly bound to the cleavage site. After first strand cleavage the protein footprint extends beyond the cleavage site. We suggest that this increased protection after cleavage is the RT domain that is positioned over the free DNA end to begin reverse transcription on the nicked DNA substrate.
Collapse
Affiliation(s)
- Shawn Christensen
- Department of Biology, University of Rochester, Hutchinson Hall 334, Rochester, NY 14627-0211, USA
| | | |
Collapse
|
11
|
Fujimoto H, Hirukawa Y, Tani H, Matsuura Y, Hashido K, Tsuchida K, Takada N, Kobayashi M, Maekawa H. Integration of the 5' end of the retrotransposon, R2Bm, can be complemented by homologous recombination. Nucleic Acids Res 2004; 32:1555-65. [PMID: 14999096 PMCID: PMC390292 DOI: 10.1093/nar/gkh304] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
R2Bm is a non-long-terminal-repeat (non-LTR) retrotransposon that was identified at a specific target site in the 28S rRNA genes of the silkworm, Bombyx mori. Although in vitro analysis has revealed that the 3' end of R2Bm is integrated into the target site by means of target-primed reverse transcription (TPRT), the mechanism of the 5' end integration is not well understood. We established a novel in vivo system to assay the insertion mechanism of R2Bm using a cultured cell line, C65, and a baculovirus, AcNPV, as host and vector, respectively. The 3' end of R2Bm integrated at the target site in the rRNA genes of C65 cells when an AcNPV containing both the full-length 3' UTR and the entire open reading frame (ORF) of R2Bm was introduced while the 5' end integration was incorrect. The 5' end of R2Bm was integrated, however, when the 28S gene sequence upstream of the R2Bm target site was added to the R2Bm sequence. Thus, in our assay, homologous sequences were likely essential for the successful integration of the entire R2Bm into the host cell genome. We also demonstrated that the failure to integrate caused by a frame-shifted ORF was rescued by co-infection with a helper virus that contained only the R2Bm ORF. This indicates that R2 retrotransposition can be complemented in trans. These findings suggest that the host's mechanism for DNA repair may be necessary for the integration of the 5' end of R2Bm and that R2Bm protein may only have the ability to integrate the 3' end of the element by TPRT.
Collapse
Affiliation(s)
- Hirofumi Fujimoto
- Division of Radiological Protection and Biology, National Institute of Infectious Diseases, Toyama 1-23-1, Shinjuku-ku, Tokyo 162-8640, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Bibillo A, Eickbush TH. End-to-end template jumping by the reverse transcriptase encoded by the R2 retrotransposon. J Biol Chem 2004; 279:14945-53. [PMID: 14752111 DOI: 10.1074/jbc.m310450200] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The reverse transcriptase encoded by the non-long terminal repeat retrotransposon R2 has been shown to be able to jump from the 5'-end of one RNA template (the donor) to the 3'-end of a second RNA template (the acceptor) in the absence of preexisting sequence identity between the two templates. These jumps between RNA templates have similarity to the end-to-end template jumps described for the RNA-directed RNA polymerases encoded by certain RNA viruses. Here we describe for the first time the mechanism by which such end-to-end template jumps can occur. Most template jumps by the R2 reverse transcriptase are brought about by the enzyme's ability to add nontemplated (overhanging) nucleotides to the cDNA when it reaches the end of the donor RNA. The enzyme then anneals these overhanging nucleotides to sequences at the 3'-end of the acceptor RNA. The annealing is most efficient if it involves the terminal nucleotide(s) of the acceptor RNA but can occur to sites at least 5 nucleotides from the 3'-end. These end-to-end jumps are similar to steps proposed to be part of the integration reaction of non-long terminal repeat retrotransposons and can explain chimeric integration products derived from multiple RNA templates.
Collapse
Affiliation(s)
- Arkadiusz Bibillo
- Department of Biology, University of Rochester, Rochester, New York 14627, USA.
| | | |
Collapse
|
13
|
Ye J, Yang Z, Hayes JJ, Eickbush TH. R2 retrotransposition on assembled nucleosomes depends on the translational position of the target site. EMBO J 2002; 21:6853-64. [PMID: 12486006 PMCID: PMC139086 DOI: 10.1093/emboj/cdf665] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
R2 retrotransposons insert into the 28S rRNA genes of insects. Integration occurs by specific cleavage of the target site and utilization of the released DNA end to prime reverse transcription of the RNA transcript. Specificity of the protein to the target site is dependent upon nucleotide sequence recognition extending from 35 bp upstream to 15 bp downstream of the cleavage site. In this report, we show that sequence recognition and cleavage by the R2 protein can occur while the target site is assembled into nucleosomes. Reconstitution of DNA fragments containing the 28S gene sequence into a set of nucleosomes with different translational frames revealed that the R2 site adopted the same rotational orientation with respect to the histone octamer. Binding and cleavage by the R2 protein were most efficient when the upstream binding site for the R2 protein was near a nucleosome end. Interaction of the R2 protein with the nucleosome disrupted the histone:DNA contacts in the 50 bp region directly bound by R2, but did not modify the remainder of the nucleosome structure.
Collapse
Affiliation(s)
| | - Zungyoon Yang
- University of Rochester, Department of Biology, Rochester, NY 14627 and
University of Rochester Medical Center, Department of Biochemistry and Biophysics, Rochester, NY 14642, USA Corresponding author e-mail:
| | - Jeffrey J. Hayes
- University of Rochester, Department of Biology, Rochester, NY 14627 and
University of Rochester Medical Center, Department of Biochemistry and Biophysics, Rochester, NY 14642, USA Corresponding author e-mail:
| | - Thomas H. Eickbush
- University of Rochester, Department of Biology, Rochester, NY 14627 and
University of Rochester Medical Center, Department of Biochemistry and Biophysics, Rochester, NY 14642, USA Corresponding author e-mail:
| |
Collapse
|
14
|
Bibillo A, Eickbush TH. High processivity of the reverse transcriptase from a non-long terminal repeat retrotransposon. J Biol Chem 2002; 277:34836-45. [PMID: 12101182 DOI: 10.1074/jbc.m204345200] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
R2 is a retrotransposable element that specifically inserts into the 28 S rRNA genes of arthropods. The element encodes a single protein with endonuclease activity that cleaves the 28 S gene target site and reverse transcriptase (RT) activity that uses the cleaved DNA to prime reverse transcription. Here we compare various properties of the R2 RT activity with those of the well characterized retroviral RT, avian myeloblastosis virus (AMV). In processivity assays using heterogeneous RNA templates, R2 RT can synthesize cDNA over twice the length of that synthesized by AMV RT and can synthesize cDNA over 4 times longer than AMV RT in assays with poly(rA) templates. The higher processivity of R2 RT compared with retroviral RTs is a result of the slower rate of dissociation of the enzyme from RNA templates. The elongation rates of the two enzymes are similar. Finally, a highly distinct property of the R2 RT, compared with retroviral enzymes, is its ability to displace RNA strands annealed to RNA templates during cDNA synthesis. We suggest that both the higher processivity and displacement properties of R2 RT compared with retroviral RT result from the greater affinity of the R2 protein for the RNA template upstream of its active site.
Collapse
Affiliation(s)
- Arkadiusz Bibillo
- Department of Biology, University of Rochester, Rochester, New York 14627-0211, USA
| | | |
Collapse
|
15
|
Chambeyron S, Bucheton A, Busseau I. Tandem UAA repeats at the 3'-end of the transcript are essential for the precise initiation of reverse transcription of the I factor in Drosophila melanogaster. J Biol Chem 2002; 277:17877-82. [PMID: 11882661 DOI: 10.1074/jbc.m200996200] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Non-long terminal repeat retrotransposons, widespread among eukaryotic genomes, transpose by reverse transcription of an RNA intermediate. Some of them, like L1 in the human, terminate at the 3'-end with a poly(dA) stretch whereas others, like the I factor in Drosophila melanogaster, have instead a short sequence repeated in tandem. This suggests different requirements for the initiation of reverse transcription. Here, we have used an RNA circularization/reverse transcription-PCR technique to analyze the 5'- and 3'-ends of the full-length transcripts produced by the I factor at the time of active retrotransposition. These transcripts are capped and polyadenylated similar to conventional messenger RNAs. We have analyzed the 3'-ends of transcripts and transposed copies produced by I elements mutated at the 3'-ends. Transcripts devoid of tandem UAA repeats, although capable of building the components of the retrotransposition machinery, are inefficiently used as retrotransposition intermediates. Such transcripts produce rare new integrated copies issued from the inaccurate initiation of reverse transcription near the 3'-end of the element. The tandem UAA repeats at the 3'-end of the transcripts of I are required for the efficient and precise initiation of reverse transcription. This strong specificity of the I factor reverse transcriptase for its own transcript has implications for the impact of I factor retrotransposition on the host genome.
Collapse
Affiliation(s)
- Séverine Chambeyron
- Institut de Génétique Humaine, CNRS, 141 Rue de la Cardonille, 34396 Montpellier Cedex 5, France
| | | | | |
Collapse
|
16
|
Bibiłło A, Eickbush TH. The reverse transcriptase of the R2 non-LTR retrotransposon: continuous synthesis of cDNA on non-continuous RNA templates. J Mol Biol 2002; 316:459-73. [PMID: 11866511 DOI: 10.1006/jmbi.2001.5369] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
R2 is a non-long terminal repeat (non-LTR) retrotransposon that inserts into the 28 S rRNA genes of arthropods. The element encodes two enzymatic activities: an endonuclease that specifically cleaves the 28 S gene target site, and a reverse transcriptase (RT) that can use the 3' end of the cleaved DNA to prime reverse transcription. R2 RT only utilizes RNA templates that contain the 3' untranslated region of the R2 element as templates in this target primed reverse transcription (TPRT) reaction. Here, detailed biochemical characterization of the R2 RT indicates that the enzyme is capable of making multiple, consecutive jumps between RNA templates. The terminal 3' nucleotide of the "acceptor" RNA and the 5' nucleotide of the "donor" RNA are frequently reverse transcribed in these jumps, indicating that the acceptor RNA does not anneal to the cDNA derived from the donor RNA template. These template jumps occur during TPRT as well as in non-specific extension reactions in which reverse transcription is primed by an oligonucleotide annealed to the RNA template. Analysis of these RT assays done in the absence of the target DNA also revealed that the R2 RT can initiate reverse transcription near the 3' end of any RNA molecule using the 3' end of a second RNA molecule as primer. Again there is no requirement for sequence complementarity between the RNA used as template and the RNA used as primer. These properties of the R2 RT differ substantially from those of retroviral RTs but have similarities to the RT of the Mauriceville retroplasmid of Neurospora crassa. We present a model which relates these unusual properties of the R2 RT to structural differences from retroviral RTs as well as correlates these properties to the likely retrotransposition mechanism of R2 and other non-LTR retrotransposons.
Collapse
Affiliation(s)
- Arkadiusz Bibiłło
- Department of Biology, University of Rochester, Rochester, NY 4627-0211, USA
| | | |
Collapse
|
17
|
Beattie TL, Zhou W, Robinson MO, Harrington L. Functional multimerization of the human telomerase reverse transcriptase. Mol Cell Biol 2001; 21:6151-60. [PMID: 11509658 PMCID: PMC87332 DOI: 10.1128/mcb.21.18.6151-6160.2001] [Citation(s) in RCA: 109] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2001] [Accepted: 07/02/2001] [Indexed: 01/17/2023] Open
Abstract
The telomerase enzyme exists as a large complex (approximately 1,000 kDa) in mammals and at minimum is composed of the telomerase RNA and the catalytic subunit telomerase reverse transcriptase (TERT). In Saccharomyces cerevisiae, telomerase appears to function as an interdependent dimer or multimer in vivo (J. Prescott and E. H. Blackburn, Genes Dev. 11:2790-2800, 1997). However, the requirements for multimerization are not known, and it remained unclear whether telomerase exists as a multimer in other organisms. We show here that human TERT (hTERT) forms a functional multimer in a rabbit reticulocyte lysate reconstitution assay and in human cell extracts. Two separate, catalytically inactive TERT proteins can complement each other in trans to reconstitute catalytic activity. This complementation requires the amino terminus of one hTERT and the reverse transcriptase and C-terminal domains of the second hTERT. The telomerase RNA must associate with only the latter hTERT for reconstitution of telomerase activity to occur. Multimerization of telomerase also facilitates the recognition and elongation of substrates in vitro and in vivo. These data suggest that the catalytic core of human telomerase may exist as a functionally cooperative dimer or multimer in vivo.
Collapse
Affiliation(s)
- T L Beattie
- Ontario Cancer Institute/Amgen Institute, Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.
| | | | | | | |
Collapse
|
18
|
Kubo Y, Okazaki S, Anzai T, Fujiwara H. Structural and phylogenetic analysis of TRAS, telomeric repeat-specific non-LTR retrotransposon families in Lepidopteran insects. Mol Biol Evol 2001; 18:848-57. [PMID: 11319268 DOI: 10.1093/oxfordjournals.molbev.a003866] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
TRAS1 is a non-LTR retrotransposon inserted specifically into the telomeric repeat (TTAGG)(n) in the silkworm, Bombyx mori. To characterize the evolutionary origin of TRAS-like elements, we identified seven TRAS families (TRAS3, TRAS4, TRAS5, TRAS6, TRASY, TRASZ, and TRASW) from B. mori and four elements from two Lepidoptera, Dictyoploca japonica (TRASDJ) and Samia cynthia ricini (TRASSC3, TRASSC4, and TRASSC9). More than 2,000 copies of various Bombyx TRAS elements accumulated within (TTAGG)(n) sequences as unusual but orderly tandem repeats. The 5' and 3' regions were highly conserved within each class of Bombyx TRAS elements without truncation. This suggests that distinct classes of TRAS have been maintained independently by retrotransposition into (TTAGG)(n). The phylogenetic tree of site-specific retroelements showed that nine TRAS families in Lepidoptera constitute a single phylogenetic group that is closely related to the R1 family that inserts specifically into arthropod 28S rDNA. The higher amino acid sequence identity from endonuclease (EN) to reverse transcriptase (RT) domains between TRAS groups (about 37%-70%) than among TRAS elements and R1Bm (about 25%-30%), may reflect the presence of some DNA structure responsible for their target specificity. Sequence comparison from EN to RT domains among non-LTR elements revealed several regions conserved only within TRAS elements. We found a highly conserved region that resembles the Myb-like DNA-binding structure, between the EN and RT domains. These regions may be involved in site-specific integration of TRAS elements into the (TTAGG)(n) telomeric repeats.
Collapse
Affiliation(s)
- Y Kubo
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | | | | | | |
Collapse
|
19
|
Anzai T, Takahashi H, Fujiwara H. Sequence-specific recognition and cleavage of telomeric repeat (TTAGG)(n) by endonuclease of non-long terminal repeat retrotransposon TRAS1. Mol Cell Biol 2001; 21:100-8. [PMID: 11113185 PMCID: PMC88784 DOI: 10.1128/mcb.21.1.100-108.2001] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The telomere of the silkworm Bombyx mori consists of (TTAGG/CCTAA)(n) repeats and harbors a large number of telomeric repeat-specific non-long terminal repeat retrotransposons, such as TRAS1 and SART1. To understand how these retrotransposons recognize and integrate into the telomeric repeat in a sequence-specific manner, we expressed the apurinic-apryrimidinic endonuclease-like endonuclease domain of TRAS1 (TRAS1 EN), which is supposed to digest the target DNA, and characterized its enzymatic properties. Purified TRAS1 EN could generate specific nicks on both strands of the telomeric repeat sequence between T and A of the (TTAGG)(n) strand (bottom strand) and between C and T of the (CCTAA)(n) strand (top strand). These sites are consistent with insertion sites expected from the genomic structure of boundary regions of TRAS1. Time course studies of nicking activities on both strands revealed that the cleavages on the bottom strand preceded those on the top strand, supporting the target-primed reverse transcription model. TRAS1 EN could cleave the telomeric repeats specifically even if it was flanked by longer tracts of nontelomeric sequence, indicating that the target site specificity of the TRAS1 element was mainly determined by its EN domain. Based on mutation analyses, TRAS1 EN recognizes less than 10 bp around the initial cleavage site (upstream 7 bp and downstream 3 bp), and the GTTAG sequence especially is essential for the cleavage reaction on the bottom strand (5'. TTAGGTT downward arrow AGG. 3'). TRAS1 EN, the first identified endonuclease digesting telomeric repeats, may be used as a genetic tool to shorten the telomere in insects and some other organisms.
Collapse
Affiliation(s)
- T Anzai
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
| | | | | |
Collapse
|
20
|
Furano AV. The biological properties and evolutionary dynamics of mammalian LINE-1 retrotransposons. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 2000; 64:255-94. [PMID: 10697412 DOI: 10.1016/s0079-6603(00)64007-2] [Citation(s) in RCA: 145] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Mammalian LINE-1 (L1) elements belong to the superfamily of autonomously replicating retrotransposable elements that lack the long terminal repeated (LTR) sequences typical of retroviruses and retroviral-like retrotransposons. The non-LTR superfamily is very ancient and L1-like elements are ubiquitous in nature, having been found in plants, fungi, invertebrates, and various vertebrate classes from fish to mammals. L1 elements have been replicating and evolving in mammals for at least the past 100 million years and now constitute 20% or more of some mammalian genomes. Therefore, L1 elements presumably have had a profound, perhaps defining, effect on the evolution, structure, and function of mammalian genomes. L1 elements contain regulatory signals and encode two proteins: one is an RNA-binding protein and the second one presumably functions as an integrase-replicase, because it has both endonuclease and reverse transcriptase activities. This work reviews the structure and biological properties of L1 elements, including their regulation, replication, evolution, and interaction with their mammalian hosts. Although each of these processes is incompletely understood, what is known indicates that they represent challenging and fascinating biological phenomena, the resolution of which will be essential for fully understanding the biology of mammals.
Collapse
Affiliation(s)
- A V Furano
- Section on Genomic Structure and Function, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
21
|
Christensen S, Pont-Kingdon G, Carroll D. Target specificity of the endonuclease from the Xenopus laevis non-long terminal repeat retrotransposon, Tx1L. Mol Cell Biol 2000; 20:1219-26. [PMID: 10648607 PMCID: PMC85248 DOI: 10.1128/mcb.20.4.1219-1226.2000] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Elements of the Tx1L family are non-long terminal repeat retrotransposons (NLRs) that are dispersed in the genome of Xenopus laevis. Essentially all genomic copies of Tx1L are found inserted at a specific site within another family of transposable elements (Tx1D). This suggests that Tx1L is a site-specific retrotransposon. Like many (but not all) other NLRs, the Xenopus element encodes an apparent endonuclease that is related in sequence to the apurinic-apyrimidinic endonucleases that participate in DNA repair. This enzyme is thought to introduce the single-strand break in target DNA that initiates transposition by the target-primed reverse transcription (TPRT) mechanism. To explore the issue of target specificity more fully, we expressed the polypeptide encoded by the endonuclease domain of open reading frame 2 from Tx1L (Tx1L EN) and characterized its cleavage capabilities. This endonuclease makes a specific nick in the bottom strand precisely at one end of the presumed Tx1L target duplication. Because this activity leaves a 5'-phosphate and 3'-hydroxyl at the nick, it has the location and chemistry required to initiate new insertion events by TPRT. Tx1L EN does not make a specific cut at a preferred target site for Tx1D elements, ruling out the alternative possibility that the composite Tx1L-Tx1D element moves as a unit under the control of functions encoded by Tx1L. Further characterization revealed that the endonuclease remains active for many hours at room temperature and that it is capable of enzymatic turnover. Scanning substitution mutagenesis located the recognition site for Tx1L EN within 10 bp surrounding the primary nick site. Implications of these features for natural transposition events are discussed.
Collapse
Affiliation(s)
- S Christensen
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, Utah 84132, USA
| | | | | |
Collapse
|
22
|
Eickbush DG, Luan DD, Eickbush TH. Integration of Bombyx mori R2 sequences into the 28S ribosomal RNA genes of Drosophila melanogaster. Mol Cell Biol 2000; 20:213-23. [PMID: 10594024 PMCID: PMC85077 DOI: 10.1128/mcb.20.1.213-223.2000] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
R2 non-long-terminal-repeat retrotransposable elements integrate into a precise location in the 28S rRNA genes of arthropods. The purified protein encoded by R2 can cleave the 28S gene target site and use the 3' hydroxyl group generated by this cleavage to prime reverse transcription of its own RNA, a process called target-primed reverse transcription. An integration system is described here in which components from the R2 element of the silkmoth, Bombyx mori, are injected into the preblastoderm embryo of Drosophila melanogaster. Silkmoth R2 sequences were readily detected in the 28S rRNA genes of the surviving adults as well as in the genes of their progeny. The 3' junctions of these insertions were similar to those seen in our in vitro assays, as well as those from endogenous R2 retrotransposition events. The 5' junctions of the insertions originally contained major deletions of both R2 and 28S gene sequences, a problem overcome by the inclusion of upstream 28S gene sequences at the 5' end of the injected RNA. The resulting 5' junctions suggested a recombination event between the cDNA and the upstream target sequences. This in vivo integration system should help determine the mechanism of R2 retrotransposition and be useful as a delivery system to integrate defined DNA sequences into the rRNA genes of organisms.
Collapse
Affiliation(s)
- D G Eickbush
- Department of Biology, University of Rochester, Rochester, New York 14627-0211, USA
| | | | | |
Collapse
|
23
|
Klobutcher LA. Characterization of in vivo developmental chromosome fragmentation intermediates in E. crassus. Mol Cell 1999; 4:695-704. [PMID: 10619017 DOI: 10.1016/s1097-2765(00)80380-9] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Ligation-mediated PCR was used to characterize intermediates in the fragmentation/de novo telomere addition process that occurs during sexual reproduction in the ciliate E. crassus. Fragmentation generates ends with 6-base, 3' overhangs that have 5'-phosphate and 3'-hydroxyl groups. These intermediates are detected only during the period of chromosome fragmentation. Fragmentation always occurs at a precise distance from a conserved sequence, the E-Cbs, indicating that it is a key cis-acting element in the process. The results also serve to identify the natural substrate for de novo telomere addition and indicate that telomerase recognizes, and compensates for, partial telomeric repeats at the ends of fragmentation intermediates. Similarities of the Euplotes fragmentation/telomere addition process to the movement of some non-long terminal repeat retrotransposons are discussed.
Collapse
MESH Headings
- Animals
- Base Sequence
- Cell Nucleus/enzymology
- Cell Nucleus/genetics
- Chromosome Breakage/genetics
- Chromosomes/chemistry
- Chromosomes/genetics
- Chromosomes/metabolism
- Conserved Sequence/genetics
- DNA Ligases/metabolism
- DNA, Protozoan/chemistry
- DNA, Protozoan/genetics
- DNA, Protozoan/metabolism
- DNA, Single-Stranded/chemistry
- DNA, Single-Stranded/genetics
- DNA, Single-Stranded/metabolism
- Euplotes/cytology
- Euplotes/enzymology
- Euplotes/genetics
- Euplotes/growth & development
- Evolution, Molecular
- Micronucleus, Germline/enzymology
- Micronucleus, Germline/genetics
- Models, Genetic
- Polymerase Chain Reaction/methods
- Regulatory Sequences, Nucleic Acid/genetics
- Reproduction
- Retroelements/genetics
- Telomerase/metabolism
- Telomere/chemistry
- Telomere/genetics
- Telomere/metabolism
Collapse
Affiliation(s)
- L A Klobutcher
- Department of Biochemistry, University of Connecticut Health Center, Farmington 06032, USA.
| |
Collapse
|
24
|
Yang J, Malik HS, Eickbush TH. Identification of the endonuclease domain encoded by R2 and other site-specific, non-long terminal repeat retrotransposable elements. Proc Natl Acad Sci U S A 1999; 96:7847-52. [PMID: 10393910 PMCID: PMC22150 DOI: 10.1073/pnas.96.14.7847] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The non-long terminal repeat (LTR) retrotransposon, R2, encodes a sequence-specific endonuclease responsible for its insertion at a unique site in the 28S rRNA genes of arthropods. Although most non-LTR retrotransposons encode an apurinic-like endonuclease upstream of a common reverse transcriptase domain, R2 and many other site-specific non-LTR elements do not (CRE1 and 2, SLACS, CZAR, Dong, R4). Sequence comparison of these site-specific elements has revealed that the region downstream of their reverse transcriptase domain is conserved and shares sequence features with various prokaryotic restriction endonucleases. In particular, these non-LTR elements have a Lys/Arg-Pro-Asp-X12-14aa-Asp/Glu motif known to lie near the scissile phosphodiester bonds in the protein-DNA complexes of restriction enzymes. Site-directed mutagenesis of the R2 protein was used to provide evidence that this motif is also part of the active site of the endonuclease encoded by this element. Mutations of this motif eliminate both DNA-cleavage activities of the R2 protein: first-strand cleavage in which the exposed 3' end is used to prime reverse transcription of the RNA template and second-strand cleavage, which occurs after reverse transcription. The general organization of the R2 protein appears similar to the type IIS restriction enzyme, FokI, in which specific DNA binding is controlled by a separate domain located amino terminal to the cleavage domain. Previous phylogenetic analysis of their reverse transcriptase domains has indicated that the non-LTR elements identified here as containing restriction-like endonucleases are the oldest lineages of non-LTR elements, suggesting a scenario for the evolution of non-LTR elements.
Collapse
Affiliation(s)
- J Yang
- Department of Biology, University of Rochester, Rochester, NY 14627-0211, USA
| | | | | |
Collapse
|
25
|
Abstract
A mobile bacterial group II intron can integrate into DNA by the reverse splicing into a target site of its RNA transcript, which then acts as a template for DNA synthesis by an encoded reverse transcriptase. Mobility does not require homologous recombination, which has important practical and evolutionary implications.
Collapse
Affiliation(s)
- T H Eickbush
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
| |
Collapse
|