1
|
Abstract
Although most of non-long terminal repeat (non-LTR) retrotransposons are incorporated in the host genome almost randomly, some non-LTR retrotransposons are incorporated into specific sequences within a target site. On the basis of structural and phylogenetic features, non-LTR retrotransposons are classified into two large groups, restriction enzyme-like endonuclease (RLE)-encoding elements and apurinic/apyrimidinic endonuclease (APE)-encoding elements. All clades of RLE-encoding non-LTR retrotransposons include site-specific elements. However, only two of more than 20 APE-encoding clades, Tx1 and R1, contain site-specific non-LTR elements. Site-specific non-LTR retrotransposons usually target within multi-copy RNA genes, such as rRNA gene (rDNA) clusters, or repetitive genomic sequences, such as telomeric repeats; this behavior may be a symbiotic strategy to reduce the damage to the host genome. Site- and sequence-specificity are variable even among closely related non-LTR elements and appeared to have changed during evolution. In the APE-encoding elements, the primary determinant of the sequence- specific integration is APE itself, which nicks one strand of the target DNA during the initiation of target primed reverse transcription (TPRT). However, other factors, such as interaction between mRNA and the target DNA, and access to the target region in the nuclei also affect the sequence-specificity. In contrast, in the RLE-encoding elements, DNA-binding motifs appear to affect their sequence-specificity, rather than the RLE domain itself. Highly specific integration properties of these site-specific non-LTR elements make them ideal alternative tools for sequence-specific gene delivery, particularly for therapeutic purposes in human diseases.
Collapse
|
2
|
Abstract
The piggyBac transposon was originally isolated from the cabbage looper moth, Trichoplusia ni, in the 1980s. Despite its early discovery and dissimilarity to the other DNA transposon families, the piggyBac transposon was not recognized as a member of a large transposon superfamily for a long time. Initially, the piggyBac transposon was thought to be a rare transposon. This view, however, has now been completely revised as a number of fully sequenced genomes have revealed the presence of piggyBac-like repetitive elements. The isolation of active copies of the piggyBac-like elements from several distinct species further supported this revision. This includes the first isolation of an active mammalian DNA transposon identified in the bat genome. To date, the piggyBac transposon has been deeply characterized and it represents a number of unique characteristics. In general, all members of the piggyBac superfamily use TTAA as their integration target sites. In addition, the piggyBac transposon shows precise excision, i.e., restoring the sequence to its preintegration state, and can transpose in a variety of organisms such as yeasts, malaria parasites, insects, mammals, and even in plants. Biochemical analysis of the chemical steps of transposition revealed that piggyBac does not require DNA synthesis during the actual transposition event. The broad host range has attracted researchers from many different fields, and the piggyBac transposon is currently the most widely used transposon system for genetic manipulations.
Collapse
|
3
|
Non-LTR retrotransposons encode noncanonical RRM domains in their first open reading frame. Proc Natl Acad Sci U S A 2009; 106:731-6. [PMID: 19139409 DOI: 10.1073/pnas.0809964106] [Citation(s) in RCA: 107] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Non-LTR retrotransposons (NLRs) are a unique class of mobile genetic elements that have significant impact on the evolution of eukaryotic genomes. However, the molecular details and functions of their encoded proteins, in particular of the accessory ORF1p proteins, are poorly understood. Here, we identify noncanonical RNA-recognition-motifs (RRMs) in several phylogenetically unrelated NLR ORF1p proteins. This provides an explanation for their RNA-binding properties and clearly shows that they are not related to the retroviral nucleocapsid protein Gag, despite the frequent presence of CCHC zinc knuckles. In particular, we characterize the ORF1p protein of the human long interspersed nuclear element 1 (LINE-1 or L1). We show that L1ORF1p is a multidomain protein, consisting of a coiled coil (cc), RRM, and C-terminal domain (CTD). Most importantly, we solved the crystal structure of the RRM domain, which is characterized by extended loops stabilized by unique salt bridges. Furthermore, we demonstrate that L1ORF1p trimerizes via its N-terminal cc domain, and we suggest that this property is functionally important for all homologues. The formation of distinct complexes with single-stranded nucleic acids requires the presence of the RRM and CTD domains on the same polypeptide chain as well as their close cooperation. Finally, the phylogenetic analysis of mammalian L1ORF1p shows an ancient origin of the RRM domain and supports a modular evolution of NLRs.
Collapse
|
4
|
Abstract
Over one-third of human genome sequence is a product of non-LTR retrotransposition. The retrotransposon that currently drives this process in humans is the highly abundant LINE-1 (L1) element. Despite the ubiquitous nature of L1's in mammals, we still lack a complete mechanistic understanding of the L1 replication cycle and how it is regulated. To generate a genetically amenable model for non-LTR retrotransposition, we have reengineered the Zorro3 retrotransposon, an L1 homolog from Candida albicans, for use in the budding yeast Saccharomyces cerevisiae. We found that S. cerevisiae, which has no endogenous L1 homologs or remnants, can still support Zorro3 retrotransposition. Analysis of Zorro3 mutants and insertion structures suggest that this is authentic L1-like retrotransposition with remarkable resemblance to mammalian L1-mediated events. This suggests that S. cerevisiae has unexpectedly retained the basal host machinery required for L1 retrotransposition. This model will also serve as a powerful system to study the cell biology of L1 elements and for the genetic identification and characterization of cellular factors involved in L1 retrotransposition.
Collapse
|
5
|
Repanas K, Zingler N, Layer LE, Schumann GG, Perrakis A, Weichenrieder O. Determinants for DNA target structure selectivity of the human LINE-1 retrotransposon endonuclease. Nucleic Acids Res 2007; 35:4914-26. [PMID: 17626046 PMCID: PMC1950540 DOI: 10.1093/nar/gkm516] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
The human LINE-1 endonuclease (L1-EN) is the targeting endonuclease encoded by the human LINE-1 (L1) retrotransposon. L1-EN guides the genomic integration of new L1 and Alu elements that presently account for ∼28% of the human genome. L1-EN bears considerable technological interest, because its target selectivity may ultimately be engineered to allow the site-specific integration of DNA into defined genomic locations. Based on the crystal structure, we generated L1-EN mutants to analyze and manipulate DNA target site recognition. Crystal structures and their dynamic and functional analysis show entire loop grafts to be feasible, resulting in altered specificity, while individual point mutations do not change the nicking pattern of L1-EN. Structural parameters of the DNA target seem more important for recognition than the nucleotide sequence, and nicking profiles on DNA oligonucleotides in vitro are less well defined than the respective integration site consensus in vivo. This suggests that additional factors other than the DNA nicking specificity of L1-EN contribute to the targeted integration of non-LTR retrotransposons.
Collapse
Affiliation(s)
- Kostas Repanas
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands and Paul-Ehrlich-Institut, Section PR2/Retroelements, 63225 Langen, Germany
| | - Nora Zingler
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands and Paul-Ehrlich-Institut, Section PR2/Retroelements, 63225 Langen, Germany
| | - Liliana E. Layer
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands and Paul-Ehrlich-Institut, Section PR2/Retroelements, 63225 Langen, Germany
| | - Gerald G. Schumann
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands and Paul-Ehrlich-Institut, Section PR2/Retroelements, 63225 Langen, Germany
| | - Anastassis Perrakis
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands and Paul-Ehrlich-Institut, Section PR2/Retroelements, 63225 Langen, Germany
| | - Oliver Weichenrieder
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands and Paul-Ehrlich-Institut, Section PR2/Retroelements, 63225 Langen, Germany
- *To whom correspondence should be addressed.+4970716011358+4970716011353
| |
Collapse
|
6
|
Anzai T, Takahashi H, Fujiwara H. Sequence-specific recognition and cleavage of telomeric repeat (TTAGG)(n) by endonuclease of non-long terminal repeat retrotransposon TRAS1. Mol Cell Biol 2001; 21:100-8. [PMID: 11113185 PMCID: PMC88784 DOI: 10.1128/mcb.21.1.100-108.2001] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The telomere of the silkworm Bombyx mori consists of (TTAGG/CCTAA)(n) repeats and harbors a large number of telomeric repeat-specific non-long terminal repeat retrotransposons, such as TRAS1 and SART1. To understand how these retrotransposons recognize and integrate into the telomeric repeat in a sequence-specific manner, we expressed the apurinic-apryrimidinic endonuclease-like endonuclease domain of TRAS1 (TRAS1 EN), which is supposed to digest the target DNA, and characterized its enzymatic properties. Purified TRAS1 EN could generate specific nicks on both strands of the telomeric repeat sequence between T and A of the (TTAGG)(n) strand (bottom strand) and between C and T of the (CCTAA)(n) strand (top strand). These sites are consistent with insertion sites expected from the genomic structure of boundary regions of TRAS1. Time course studies of nicking activities on both strands revealed that the cleavages on the bottom strand preceded those on the top strand, supporting the target-primed reverse transcription model. TRAS1 EN could cleave the telomeric repeats specifically even if it was flanked by longer tracts of nontelomeric sequence, indicating that the target site specificity of the TRAS1 element was mainly determined by its EN domain. Based on mutation analyses, TRAS1 EN recognizes less than 10 bp around the initial cleavage site (upstream 7 bp and downstream 3 bp), and the GTTAG sequence especially is essential for the cleavage reaction on the bottom strand (5'. TTAGGTT downward arrow AGG. 3'). TRAS1 EN, the first identified endonuclease digesting telomeric repeats, may be used as a genetic tool to shorten the telomere in insects and some other organisms.
Collapse
Affiliation(s)
- T Anzai
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
| | | | | |
Collapse
|
7
|
Christensen S, Pont-Kingdon G, Carroll D. Target specificity of the endonuclease from the Xenopus laevis non-long terminal repeat retrotransposon, Tx1L. Mol Cell Biol 2000; 20:1219-26. [PMID: 10648607 PMCID: PMC85248 DOI: 10.1128/mcb.20.4.1219-1226.2000] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Elements of the Tx1L family are non-long terminal repeat retrotransposons (NLRs) that are dispersed in the genome of Xenopus laevis. Essentially all genomic copies of Tx1L are found inserted at a specific site within another family of transposable elements (Tx1D). This suggests that Tx1L is a site-specific retrotransposon. Like many (but not all) other NLRs, the Xenopus element encodes an apparent endonuclease that is related in sequence to the apurinic-apyrimidinic endonucleases that participate in DNA repair. This enzyme is thought to introduce the single-strand break in target DNA that initiates transposition by the target-primed reverse transcription (TPRT) mechanism. To explore the issue of target specificity more fully, we expressed the polypeptide encoded by the endonuclease domain of open reading frame 2 from Tx1L (Tx1L EN) and characterized its cleavage capabilities. This endonuclease makes a specific nick in the bottom strand precisely at one end of the presumed Tx1L target duplication. Because this activity leaves a 5'-phosphate and 3'-hydroxyl at the nick, it has the location and chemistry required to initiate new insertion events by TPRT. Tx1L EN does not make a specific cut at a preferred target site for Tx1D elements, ruling out the alternative possibility that the composite Tx1L-Tx1D element moves as a unit under the control of functions encoded by Tx1L. Further characterization revealed that the endonuclease remains active for many hours at room temperature and that it is capable of enzymatic turnover. Scanning substitution mutagenesis located the recognition site for Tx1L EN within 10 bp surrounding the primary nick site. Implications of these features for natural transposition events are discussed.
Collapse
Affiliation(s)
- S Christensen
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, Utah 84132, USA
| | | | | |
Collapse
|
8
|
Yang J, Malik HS, Eickbush TH. Identification of the endonuclease domain encoded by R2 and other site-specific, non-long terminal repeat retrotransposable elements. Proc Natl Acad Sci U S A 1999; 96:7847-52. [PMID: 10393910 PMCID: PMC22150 DOI: 10.1073/pnas.96.14.7847] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The non-long terminal repeat (LTR) retrotransposon, R2, encodes a sequence-specific endonuclease responsible for its insertion at a unique site in the 28S rRNA genes of arthropods. Although most non-LTR retrotransposons encode an apurinic-like endonuclease upstream of a common reverse transcriptase domain, R2 and many other site-specific non-LTR elements do not (CRE1 and 2, SLACS, CZAR, Dong, R4). Sequence comparison of these site-specific elements has revealed that the region downstream of their reverse transcriptase domain is conserved and shares sequence features with various prokaryotic restriction endonucleases. In particular, these non-LTR elements have a Lys/Arg-Pro-Asp-X12-14aa-Asp/Glu motif known to lie near the scissile phosphodiester bonds in the protein-DNA complexes of restriction enzymes. Site-directed mutagenesis of the R2 protein was used to provide evidence that this motif is also part of the active site of the endonuclease encoded by this element. Mutations of this motif eliminate both DNA-cleavage activities of the R2 protein: first-strand cleavage in which the exposed 3' end is used to prime reverse transcription of the RNA template and second-strand cleavage, which occurs after reverse transcription. The general organization of the R2 protein appears similar to the type IIS restriction enzyme, FokI, in which specific DNA binding is controlled by a separate domain located amino terminal to the cleavage domain. Previous phylogenetic analysis of their reverse transcriptase domains has indicated that the non-LTR elements identified here as containing restriction-like endonucleases are the oldest lineages of non-LTR elements, suggesting a scenario for the evolution of non-LTR elements.
Collapse
Affiliation(s)
- J Yang
- Department of Biology, University of Rochester, Rochester, NY 14627-0211, USA
| | | | | |
Collapse
|
9
|
Feng Q, Schumann G, Boeke JD. Retrotransposon R1Bm endonuclease cleaves the target sequence. Proc Natl Acad Sci U S A 1998; 95:2083-8. [PMID: 9482842 PMCID: PMC19257 DOI: 10.1073/pnas.95.5.2083] [Citation(s) in RCA: 77] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The R1Bm element, found in the silkworm Bombyx mori, is a member of a group of widely distributed retrotransposons that lack long terminal repeats. Some of these elements are highly sequence-specific and others, like the human L1 sequence, are less so. The majority of R1Bm elements are associated with ribosomal DNA (rDNA). R1Bm inserts into 28S rDNA at a specific sequence; after insertion it is flanked by a specific 14-bp target site duplication of the 28S rDNA. The basis for this sequence specificity is unknown. We show that R1Bm encodes an enzyme related to the endonuclease found in the human L1 retrotransposon and also to the apurinic/apyrimidinic endonucleases. We expressed and purified the enzyme from bacteria and showed that it cleaves in vitro precisely at the positions in rDNA corresponding to the boundaries of the 14-bp target site duplication. We conclude that the function of the retrotransposon endonucleases is to define and cleave target site DNA.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Base Sequence
- Bombyx/genetics
- Cloning, Organism
- DNA, Ribosomal/chemistry
- DNA, Ribosomal/metabolism
- Endonucleases/chemistry
- Endonucleases/metabolism
- Humans
- Models, Molecular
- Molecular Sequence Data
- Mutagenesis, Site-Directed
- Nucleic Acid Conformation
- Polymerase Chain Reaction
- RNA, Ribosomal, 28S/biosynthesis
- RNA, Ribosomal, 28S/genetics
- Recombinant Proteins/chemistry
- Recombinant Proteins/metabolism
- Repetitive Sequences, Nucleic Acid
- Retroelements/physiology
- Substrate Specificity
Collapse
Affiliation(s)
- Q Feng
- Department of Molecular Biology and Genetics, The Johns Hopkins University School of Medicine, 725 North Wolfe Street, Baltimore MD 21205, USA
| | | | | |
Collapse
|
10
|
Pont-Kingdon G, Chi E, Christensen S, Carroll D. Ribonucleoprotein formation by the ORF1 protein of the non-LTR retrotransposon Tx1L in Xenopus oocytes. Nucleic Acids Res 1997; 25:3088-94. [PMID: 9224609 PMCID: PMC146839 DOI: 10.1093/nar/25.15.3088] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The Tx1L elements constitute a family of site-specific non-LTR retrotransposons found in the genome of the frog Xenopus laevis . The elements have two open reading frames (ORFs) with homology to proteins of retroviruses and other retroelements. This study demonstrates an expected activity of one of the element-encoded proteins. The RNA binding properties of ORF1p, the product of the first ORF of Tx1L, were examined after expression from RNA injected into Xenopus oocytes. Using sucrose gradient sedimentation and non-denaturing gel electrophoresis, we show that ORF1p associates with RNA in cytoplasmic ribonucleoprotein (RNP) particles. Discrete RNPs are formed with well-defined mobilities. The ORF1p RNPs are distinct from endogenous RNPs that contain stored oocyte mRNAs and two specific endogenous mRNAs do not become associated with ORF1p. ORF1p appears to be capable of associating with its own mRNA and with other injected RNAs, independent of specific recognition sequences. Although nuclear localization of ORF1p was anticipated, based both on the supposed mechanism of transposition and on the presence of a potential nuclear localization signal, no significant fraction of the protein was found in the oocyte nucleus. Nonetheless, the RNA binding capability of ORF1p is consistent with the proposed model for transposition of non-LTR retrotransposons.
Collapse
Affiliation(s)
- G Pont-Kingdon
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, UT 84132, USA.
| | | | | | | |
Collapse
|
11
|
Higashiyama T, Noutoshi Y, Fujie M, Yamada T. Zepp, a LINE-like retrotransposon accumulated in the Chlorella telomeric region. EMBO J 1997; 16:3715-23. [PMID: 9218812 PMCID: PMC1169995 DOI: 10.1093/emboj/16.12.3715] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Six copies of insertion elements accumulate in the subtelomeric region immediately proximal to the telomeric repeats on Chlorella chromosome I. The elements, designated Zepps, bear the characteristic features of non-viral (LINE-like) retrotransposons, including a poly(A) tail, 5'-truncations, a retroviral reverse transcriptase-like ORF and flanking target duplications. Detailed sequence analysis of the Chlorella subtelomeric region revealed a novel mechanism of Zepp transposition; successive insertions of each Zepp element into another Zepp as a target, leaving a tandem array of their 3'-regions with poly(A) tracts facing toward the centromere. Only the most distal Zepp copy was inverted to connect its poly(A) tail with the telomeric repeats. A similar Zepp cluster but without the telomeric repeats was also found at the terminus of another Chlorella chromosome. These structures contrast with that proposed for the addition of HeT-A and TART elements to Drosophila telomeres. Expression of Zepp elements is induced by heat shock treatment. Possible roles of the subtelomeric retrotransposons in formation and maintenance of telomeres are discussed.
Collapse
Affiliation(s)
- T Higashiyama
- Faculty of Engineering, Hiroshima University, Higashi-Hiroshima, Japan
| | | | | | | |
Collapse
|
12
|
Takahashi H, Okazaki S, Fujiwara H. A new family of site-specific retrotransposons, SART1, is inserted into telomeric repeats of the silkworm, Bombyx mori. Nucleic Acids Res 1997; 25:1578-84. [PMID: 9092665 PMCID: PMC146635 DOI: 10.1093/nar/25.8.1578] [Citation(s) in RCA: 77] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The telomeres of the silkworm, Bombyx mori, consist of pentanucleotide repeats (TTAGG)n . We previously characterized the non-LTR element TRAS1, which terminates with oligo (A) in a head to tail orientation at the exact position (between A and C) of the (CCTAA) n repeats. Here we characterized another family of telomere-specific non-LTR retrotransposon named SART1. The SART1 family was inserted at another site of the (TTAGG) n in a reverse orientation from that of TRAS1. The complete unit of SART1, 6.7 kb in length with a poly (A) stretch, contains two open reading frames encoding putative gag and pol products, overlapping by 54 bp in the -1 reading frame. Most of the 600 SART1 copies in the silkworm haploid genome are completely conserved in structure without 5'truncation. All SART1 sequences analyzed were inserted at the same position (between T and A) within the (TTAGG) n repeats. Fluorescence in situ hybridization showed that many of the SART1 copies were localized in the chromosomal ends. A phylogenetic tree showed that the SART1, TRAS1 and two other site-specific elements, R1 and RT, which insert into 28S ribosomal RNA genes in insects, belong to the same group. Based on the orientation for the chromosomal insertion and structural similarities, these elements could be further classified into two subgroups, R1/TRAS1 and RT/SART1, suggesting that the target specificity of the two telomere-associated elements was changed independently.
Collapse
Affiliation(s)
- H Takahashi
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Hongo, Bunkyo-ku, Tokyo 113, Japan
| | | | | |
Collapse
|
13
|
Luan DD, Eickbush TH. Downstream 28S gene sequences on the RNA template affect the choice of primer and the accuracy of initiation by the R2 reverse transcriptase. Mol Cell Biol 1996; 16:4726-34. [PMID: 8756630 PMCID: PMC231473 DOI: 10.1128/mcb.16.9.4726] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
R2 non-long terminal repeat retrotransposable elements insert at a unique site in the 28S rRNA genes of insects. The protein encoded by the single open reading frame of R2 is capable of conducting the initial steps of its integration in vitro. The protein nicks the noncoding strand of the 28S target DNA (the strand which serves as a template for RNA synthesis) and uses the 3' hydroxyl group exposed by this nick to prime reverse transcription of the R2 RNA template. This target-primed reverse transcription (TPRT) reaction requires that the RNA template contains the 250-nucleotide 3' untranslated region of the R2 element. If this RNA template ends at the precise 3' end of the R2 element, then extra nucleotides, which we refer to as nontemplated nucleotides, are added to the target before cDNA synthesis. The presence of downstream 28S gene sequences on the RNA template reduces the total efficiency but eliminates these nontemplated additions, resulting in nearly 90% of all TPRT products reproducing the 3' junctions seen in vivo. Templates with 5 to 10 nucleotides of the 28S sequence are used most efficiently in this in vitro TPRT reaction. The requirement for downstream 28S rRNA sequences probably explains why the R2 elements of most insects differ from the majority of non-long terminal repeat retrotransposons in that they do not contain an A-rich repeat at their 3' junction with the target DNA. The presence of downstream sequences on these in vitro R2 templates also revealed that the R2 reverse transcriptase can prime cDNA synthesis by using the 3' end of another RNA molecule. This RNA-primed cDNA synthesis is not based on sequence complementarity between the RNA primer and the R2 template. The ability to use the 3' end of a noncomplementary RNA molecule has also been seen with the reverse transcriptase of the mitochondrial Mauriceville plasmid of Neurospora crassa.
Collapse
Affiliation(s)
- D D Luan
- Department of Biology, University of Rochester, New York 14627, USA
| | | |
Collapse
|
14
|
Burke WD, Müller F, Eickbush TH. R4, a non-LTR retrotransposon specific to the large subunit rRNA genes of nematodes. Nucleic Acids Res 1995; 23:4628-34. [PMID: 8524653 PMCID: PMC307436 DOI: 10.1093/nar/23.22.4628] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
A 4.7 kb sequence-specific insertion in the 26S ribosomal RNA gene of Ascaris lumbricoides, named R4, is shown to be a non-long terminal repeat (non-LTR) retrotransposable element. The R4 element inserts at a site in the large subunit rRNA gene which is midway between two other sequence-specific non-LTR retrotransposable elements, R1 and R2, found in most insect species. Based on the structure of its open reading frame and the sequence of its reverse transcriptase domain, R4 elements do not appear to be a family of R1 or R2 elements that have changed their insertion site. R4 is most similar in structure and in sequence to the element Dong, which is not specialized for insertion into rRNA units. Thus R4 represents a separate non-LTR retrotransposable element that has become specialized for insertion in the rRNA genes of its host. Using oligonucleotide primers directed to a conserved region of the reverse transcriptase encoding domain, insertions in the R4 site were also amplified from Parascaris equorum and Haemonchus contortus. Why several non-LTR retrotransposable elements have become specialized for insertion into a short (87 bp) region of the large subunit rRNA gene is discussed.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Ascaridoidea/genetics
- Ascaris lumbricoides/genetics
- Base Sequence
- Binding Sites
- Cloning, Molecular
- DNA Primers
- DNA, Helminth/genetics
- DNA, Ribosomal/genetics
- DNA-Binding Proteins/genetics
- DNA-Binding Proteins/metabolism
- Haemonchus/genetics
- Molecular Sequence Data
- Mutagenesis, Insertional
- Nematoda/genetics
- Phylogeny
- Polymerase Chain Reaction
- RNA, Helminth/biosynthesis
- RNA, Helminth/genetics
- RNA, Ribosomal/biosynthesis
- RNA, Ribosomal/genetics
- Repetitive Sequences, Nucleic Acid
- Retroelements
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
- Species Specificity
Collapse
Affiliation(s)
- W D Burke
- Department of Biology, University of Rochester, NY 14627, USA
| | | | | |
Collapse
|
15
|
Stanhope MJ, Tagle DA, Shivji MS, Hattori M, Sakaki Y, Slightom JL, Goodman M. Multiple L1 progenitors in prosimian primates: phylogenetic evidence from ORF1 sequences. J Mol Evol 1993; 37:179-89. [PMID: 8411207 DOI: 10.1007/bf02407354] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
One of the uncertainties regarding the evolution of L1 elements is whether there are numerous progenitor genes. We present phylogenetic evidence from ORF1 sequences of slow loris (Nycticebus coucang) and galago (Galago crassicaudatus) that there were at least two distinct progenitors, active at the same time, in the ancestor of this family of prosimian primates. A maximum parsimony analysis that included representative L1s from human, rabbit, and rodents, along with the prosimian sequences, revealed that one of the galago L1s (Gc11) grouped very strongly with the slow loris sequences. The remaining galago elements formed their own unique and strongly supported clade. An analysis of replacement and silent site changes for each link of the most parsimonious tree indicated that during the descent of the Gc11 sequence approximately two times more synonymous than nonsynonymous substitutions had occurred, implying that the Gc11 founder was functional for some time after the split of galago and slow loris. Strong purifying selection was also evident on the galago branch of the tree. These data indicate that there were two distinct and contemporaneous L1 progenitors in the lorisoid ancestor, evolving under purifying selection, that were retained as functional L1s in the galago lineage (and presumably also in the slow loris). The prosimian ORF1 sequences could be further subdivided into subfamilies. ORF1 sequences from both the galago and slow loris have a premature termination codon near the 3' end, not shared by the other mammalian sequences, that shortens the open reading frame by 288 bp. An analysis of synonymous and nonsynonymous substitutions for the 5' and 3' portions, that included intra- and inter-subfamily comparisons, as well as comparisons among the other mammalian sequences, suggested that this premature stop codon is a prosimian acquisition that has rendered the 3' portion of ORF1 in these primates noncoding.
Collapse
Affiliation(s)
- M J Stanhope
- Department of Anatomy and Cell Biology, Wayne State University, Detroit, MI 48201
| | | | | | | | | | | | | |
Collapse
|
16
|
Greene JM, Otani H, Good PJ, Dawid IB. A novel family of retrotransposon-like elements in Xenopus laevis with a transcript inducible by two growth factors. Nucleic Acids Res 1993; 21:2375-81. [PMID: 8389440 PMCID: PMC309535 DOI: 10.1093/nar/21.10.2375] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
A cDNA clone named 1A11 was isolated in a screen for genes that are activated by both mesoderm inducing factors FGF and activin in animal explants of Xenopus laevis embryos. In undisturbed embryos, 1A11 is expressed during the gastrula stage in the entire marginal zone where mesoderm originates, and later in the somites, the tailbud, and at much lower levels in lateral mesoderm. The 1A11 sequence of 4.5 Kb has a 220 bp repeat at its ends, indicative of a retrotransposon-like structure. A long open reading frame encodes a predicted protein with only short homologies to the gag and protease regions of retroviruses and retrotransposons. Multiple copies of 1A11-related sequences were found in the Xenopus genome, constituting solo LTRs (long terminal repeats) of 1267 bp, and unique region copies (i.e., sequences internal to the repeats in the cDNA). Inverted repeats of 5 bp and apparent target site duplications of 5 bp surround the sequenced solo LTR. Thus, 1A11 is a new retrotransposon-like element in Xenopus laevis.
Collapse
Affiliation(s)
- J M Greene
- Laboratory of Molecular Genetics, National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892
| | | | | | | |
Collapse
|
17
|
Severynse DM, Hutchison CA, Edgell MH. Identification of transcriptional regulatory activity within the 5' A-type monomer sequence of the mouse LINE-1 retroposon. Mamm Genome 1991; 2:41-50. [PMID: 1311970 DOI: 10.1007/bf00570439] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
LINE-1 (L1) is a retroposon found in all mammals. In the mouse, approximately 10% of L1 elements are full-length and can be grouped into two classes, A or F, based upon the type of monomer sequence repeated at the 5' end. In order to test for promoter activity in the 5' end of the A-type mouse L1 element, we cloned several different A-monomers into a promoterless chloramphenicol acetyltransferase (CAT) vector. The A-monomer constructs varied in their ability to regulate transcription of the CAT gene, exhibiting CAT activity 16-37% of that detected with the Rous sarcoma virus promoter and enhancer. A series of A-monomer deletions were tested for their ability to regulate CAT expression and gel retardation experiments were performed to identify regions of the A-monomer that may be involved in L1 transcriptional regulation. A-monomer sequences are usually found repeated 2-5 times at the 5' end of a full-length mouse L1. In the absence of long terminal repeats or an internal promoter, the tandem array of A-monomers may provide a mechanism for A-type L1 elements to generate transcripts containing transcriptional regulatory sequences.
Collapse
Affiliation(s)
- D M Severynse
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27599
| | | | | |
Collapse
|
18
|
Abstract
A family of middle repetitive elements from the African malaria vector Anopheles gambiae is described. Approximately 100 copies of the element, designated T1Ag, are dispersed in the genome. Full-length elements are 4.6 kilobase pairs in length, but truncation of the 5' end is common. Nucleotide sequences of one full-length, two 5'-truncated, and two 5' ends of T1Ag elements were determined and aligned to define a consensus sequence. Sequence analysis revealed two long, overlapping open reading frames followed by a polyadenylation signal, AATAAA, and a tail consisting of tandem repetitions of the motif TGAAA. No direct or inverted long terminal repeats (LTRs) were detected. The first open reading frame, 442 amino acids in length, includes a domain resembling that of nucleic acid-binding proteins. The second open reading frame, 975 amino acids long, resembles the reverse transcriptases of a category of retrotransposable elements without LTRs, variously termed class II retrotransposons, class III elements or non-LTR retrotransposons. Similarity at the sequence and structural levels places T1Ag in this category.
Collapse
|
19
|
Abstract
A family of middle repetitive elements from the African malaria vector Anopheles gambiae is described. Approximately 100 copies of the element, designated T1Ag, are dispersed in the genome. Full-length elements are 4.6 kilobase pairs in length, but truncation of the 5' end is common. Nucleotide sequences of one full-length, two 5'-truncated, and two 5' ends of T1Ag elements were determined and aligned to define a consensus sequence. Sequence analysis revealed two long, overlapping open reading frames followed by a polyadenylation signal, AATAAA, and a tail consisting of tandem repetitions of the motif TGAAA. No direct or inverted long terminal repeats (LTRs) were detected. The first open reading frame, 442 amino acids in length, includes a domain resembling that of nucleic acid-binding proteins. The second open reading frame, 975 amino acids long, resembles the reverse transcriptases of a category of retrotransposable elements without LTRs, variously termed class II retrotransposons, class III elements or non-LTR retrotransposons. Similarity at the sequence and structural levels places T1Ag in this category.
Collapse
Affiliation(s)
- N J Besansky
- Department of Biology, Yale University, New Haven, Connecticut 06511
| |
Collapse
|