1
|
Papathanos PA, Windbichler N. Redkmer: An Assembly-Free Pipeline for the Identification of Abundant and Specific X-Chromosome Target Sequences for X-Shredding by CRISPR Endonucleases. CRISPR J 2018; 1:88-98. [PMID: 30627701 PMCID: PMC6319322 DOI: 10.1089/crispr.2017.0012] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
CRISPR-based synthetic sex ratio distorters, which operate by shredding the X-chromosome during male meiosis, are promising tools for the area-wide control of harmful insect pest or disease vector species. X-shredders have been proposed as tools to suppress insect populations by biasing the sex ratio of the wild population toward males, thus reducing its natural reproductive potential. However, to build synthetic X-shredders based on CRISPR, the selection of gRNA targets, in the form of high-copy sequence repeats on the X chromosome of a given species, is difficult, since such repeats are not accurately resolved in genome assemblies and cannot be assigned to chromosomes with confidence. We have therefore developed the redkmer computational pipeline, designed to identify short and highly abundant sequence elements occurring uniquely on the X chromosome. Redkmer was designed to use as input minimally processed whole genome sequence data from males and females. We tested redkmer with short- and long-read whole genome sequence data of Anopheles gambiae, the major vector of human malaria, in which the X-shredding paradigm was originally developed. Redkmer established long reads as chromosomal proxies with excellent correlation to the genome assembly and used them to rank X-candidate kmers for their level of X-specificity and abundance. Among these, a high-confidence set of 25-mers was identified, many belonging to previously known X-chromosome repeats of Anopheles gambiae, including the ribosomal gene array and the selfish elements harbored within it. Data from a control strain, in which these repeats are shared with the Y chromosome, confirmed the elimination of these kmers during filtering. Finally, we show that redkmer output can be linked directly to gRNA selection and off-target prediction. In addition, the output of redkmer, including the prediction of chromosomal origin of single-molecule long reads and chromosome specific kmers, could also be used for the characterization of other biologically relevant sex chromosome sequences, a task that is frequently hampered by the repetitiveness of sex chromosome sequence content.
Collapse
Affiliation(s)
- Philippos Aris Papathanos
- Department of Experimental Medicine, Section of Genomics and Genetics, University of Perugia, Perugia, Italy
| | - Nikolai Windbichler
- Department of Life Sciences, Imperial College London, Sir Alexander Fleming Building, South Kensington Campus, London, United Kingdom
| |
Collapse
|
2
|
Abstract
Although most of non-long terminal repeat (non-LTR) retrotransposons are incorporated in the host genome almost randomly, some non-LTR retrotransposons are incorporated into specific sequences within a target site. On the basis of structural and phylogenetic features, non-LTR retrotransposons are classified into two large groups, restriction enzyme-like endonuclease (RLE)-encoding elements and apurinic/apyrimidinic endonuclease (APE)-encoding elements. All clades of RLE-encoding non-LTR retrotransposons include site-specific elements. However, only two of more than 20 APE-encoding clades, Tx1 and R1, contain site-specific non-LTR elements. Site-specific non-LTR retrotransposons usually target within multi-copy RNA genes, such as rRNA gene (rDNA) clusters, or repetitive genomic sequences, such as telomeric repeats; this behavior may be a symbiotic strategy to reduce the damage to the host genome. Site- and sequence-specificity are variable even among closely related non-LTR elements and appeared to have changed during evolution. In the APE-encoding elements, the primary determinant of the sequence- specific integration is APE itself, which nicks one strand of the target DNA during the initiation of target primed reverse transcription (TPRT). However, other factors, such as interaction between mRNA and the target DNA, and access to the target region in the nuclei also affect the sequence-specificity. In contrast, in the RLE-encoding elements, DNA-binding motifs appear to affect their sequence-specificity, rather than the RLE domain itself. Highly specific integration properties of these site-specific non-LTR elements make them ideal alternative tools for sequence-specific gene delivery, particularly for therapeutic purposes in human diseases.
Collapse
|
3
|
Dritsou V, Deligianni E, Dialynas E, Allen J, Poulakakis N, Louis C, Lawson D, Topalis P. Non-coding RNA gene families in the genomes of anopheline mosquitoes. BMC Genomics 2014; 15:1038. [PMID: 25432596 PMCID: PMC4300560 DOI: 10.1186/1471-2164-15-1038] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Accepted: 11/19/2014] [Indexed: 12/12/2022] Open
Abstract
Background Only a small fraction of the mosquito species of the genus Anopheles are able to transmit malaria, one of the biggest killer diseases of poverty, which is mostly prevalent in the tropics. This diversity has genetic, yet unknown, causes. In a further attempt to contribute to the elucidation of these variances, the international “Anopheles Genomes Cluster Consortium” project (a.k.a. “16 Anopheles genomes project”) was established, aiming at a comprehensive genomic analysis of several anopheline species, most of which are malaria vectors. In the frame of the international consortium carrying out this project our team studied the genes encoding families of non-coding RNAs (ncRNAs), concentrating on four classes: microRNA (miRNA), ribosomal RNA (rRNA), small nuclear RNA (snRNA), and in particular small nucleolar RNA (snoRNA) and, finally, transfer RNA (tRNA). Results Our analysis was carried out using, exclusively, computational approaches, and evaluating both the primary NGS reads as well as the respective genome assemblies produced by the consortium and stored in VectorBase; moreover, the results of RNAseq surveys in cases in which these were available and meaningful were also accessed in order to obtain supplementary data, as were “pre-genomic era” sequence data stored in nucleic acid databases. The investigation included the identification and analysis, in most species studied, of ncRNA genes belonging to several families, as well as the analysis of the evolutionary relations of some of those genes in cross-comparisons to other members of the genus Anopheles. Conclusions Our study led to the identification of members of these gene families in the majority of twenty different anopheline taxa. A set of tools for the study of the evolution and molecular biology of important disease vectors has, thus, been obtained. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1038) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Pantelis Topalis
- Institute of Molecular Biology and Biotechnology, FORTH, Heraklion, Greece.
| |
Collapse
|
4
|
Windbichler N, Papathanos PA, Catteruccia F, Ranson H, Burt A, Crisanti A. Homing endonuclease mediated gene targeting in Anopheles gambiae cells and embryos. Nucleic Acids Res 2007; 35:5922-33. [PMID: 17726053 PMCID: PMC2034484 DOI: 10.1093/nar/gkm632] [Citation(s) in RCA: 97] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2007] [Revised: 07/31/2007] [Accepted: 07/31/2007] [Indexed: 01/18/2023] Open
Abstract
Homing endonuclease genes (HEGs) are 'selfish' genetic elements that combine the capability to selectively disrupt specific gene sequences with the ability to rapidly spread from a few individuals to an entire population through homologous recombination repair events. Because of these properties, HEGs are regarded as promising candidates to transfer genetic modifications from engineered laboratory mosquitoes to wild-type populations including Anopheles gambiae the vector of human malaria. Here we show that I-SceI and I-PpoI homing endonucleases cleave their recognition sites with high efficiency in A. gambiae cells and embryos and we demonstrate HEG-induced homologous and non-homologous repair events in a variety of functional assays. We also propose a gene drive system for mosquitoes that is based on our finding that I-PpoI cuts genomic rDNA located on the X chromosome in A. gambiae, which could be used to selectively incapacitate X-carrying spermatozoa thereby imposing a severe male-biased sex ratio.
Collapse
Affiliation(s)
- Nikolai Windbichler
- Division of Cell and Molecular Biology, Imperial College London, Imperial College Road, London SW7 2AZ and Division of Biology and NERC Centre for Population Biology, Imperial College London, Silwood Park, Ascot, Berks SL5 7PY, UK
| | - Philippos Aris Papathanos
- Division of Cell and Molecular Biology, Imperial College London, Imperial College Road, London SW7 2AZ and Division of Biology and NERC Centre for Population Biology, Imperial College London, Silwood Park, Ascot, Berks SL5 7PY, UK
| | - Flaminia Catteruccia
- Division of Cell and Molecular Biology, Imperial College London, Imperial College Road, London SW7 2AZ and Division of Biology and NERC Centre for Population Biology, Imperial College London, Silwood Park, Ascot, Berks SL5 7PY, UK
| | - Hilary Ranson
- Division of Cell and Molecular Biology, Imperial College London, Imperial College Road, London SW7 2AZ and Division of Biology and NERC Centre for Population Biology, Imperial College London, Silwood Park, Ascot, Berks SL5 7PY, UK
| | - Austin Burt
- Division of Cell and Molecular Biology, Imperial College London, Imperial College Road, London SW7 2AZ and Division of Biology and NERC Centre for Population Biology, Imperial College London, Silwood Park, Ascot, Berks SL5 7PY, UK
| | - Andrea Crisanti
- Division of Cell and Molecular Biology, Imperial College London, Imperial College Road, London SW7 2AZ and Division of Biology and NERC Centre for Population Biology, Imperial College London, Silwood Park, Ascot, Berks SL5 7PY, UK
| |
Collapse
|
5
|
Novikova O, Śliwińska E, Fet V, Settele J, Blinov A, Woyciechowski M. CR1 clade of non-LTR retrotransposons from Maculinea butterflies (Lepidoptera: Lycaenidae): evidence for recent horizontal transmission. BMC Evol Biol 2007; 7:93. [PMID: 17588269 PMCID: PMC1925062 DOI: 10.1186/1471-2148-7-93] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2006] [Accepted: 06/25/2007] [Indexed: 01/28/2023] Open
Abstract
Background Non-long terminal repeat (non-LTR) retrotransposons are mobile genetic elements that propagate themselves by reverse transcription of an RNA intermediate. Non-LTR retrotransposons are known to evolve mainly via vertical transmission and random loss. Horizontal transmission is believed to be a very rare event in non-LTR retrotransposons. Our knowledge of distribution and diversity of insect non-LTR retrotransposons is limited to a few species – mainly model organisms such as dipteran genera Drosophila, Anopheles, and Aedes. However, diversity of non-LTR retroelements in arthropods seems to be much richer. The present study extends the analysis of non-LTR retroelements to CR1 clade from four butterfly species of genus Maculinea (Lepidoptera: Lycaenidae). The lycaenid genus Maculinea, the object of interest for evolutionary biologists and also a model group for European biodiversity studies, possesses a unique, specialized myrmecophilous lifestyle at larval stage. Their caterpillars, after three weeks of phytophagous life on specific food plants drop to the ground where they are adopted to the ant nest by Myrmica foraging workers. Results We found that the genome of Maculinea butterflies contains multiple CR1 lineages of non-LTR retrotransposons, including those from MacCR1A, MacCR1B and T1Q families. A comparative analysis of RT nucleotide sequences demonstrated an extremely high similarity among elements both in interspecific and intraspecific comparisons. CR1A-like elements were found only in family Lycaenidae. In contrast, MacCR1B lineage clones were extremely similar to CR1B non-LTR retrotransposons from Bombycidae moths: silkworm Bombyx mori and Oberthueria caeca. Conclusion The degree of coding sequence similarity of the studied elements, their discontinuous distribution, and results of divergence-versus-age analysis make it highly unlikely that these sequences diverged at the same time as their host taxa. The only reasonable alternative explanation is horizontal transfer. In addition, phylogenetic markers for population analysis of Maculinea could be developed based on the described non-LTR retrotransposons.
Collapse
Affiliation(s)
- Olga Novikova
- Laboratory of Molecular Evolution, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Ewa Śliwińska
- UJAG – Jagiellonian University, Institute of Environmental Sciences, Krakow, Poland
| | - Victor Fet
- Marshall University, Huntington, West Virginia, USA
| | - Josef Settele
- Department of Community Ecology, UFZ – Centre for Environmental Research Leipzig-Halle, Halle (Saale), Germany
| | - Alexander Blinov
- Laboratory of Molecular Evolution, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Michal Woyciechowski
- UJAG – Jagiellonian University, Institute of Environmental Sciences, Krakow, Poland
| |
Collapse
|
6
|
Bringaud F, Biteau N, Melville SE, Hez S, El-Sayed NM, Leech V, Berriman M, Hall N, Donelson JE, Baltz T. A new, expressed multigene family containing a hot spot for insertion of retroelements is associated with polymorphic subtelomeric regions of Trypanosoma brucei. EUKARYOTIC CELL 2002; 1:137-51. [PMID: 12455980 PMCID: PMC118050 DOI: 10.1128/ec.1.1.137-151.2002] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We describe a novel gene family that forms clusters in subtelomeric regions of Trypanosoma brucei chromosomes and partially accounts for the observed clustering of retrotransposons. The ingi and ribosomal inserted mobile element (RIME) non-LTR retrotransposons share 250 bp at both extremities and are the most abundant putatively mobile elements, with about 500 copies per haploid genome. From cDNA clones and subsequently in the T. brucei genomic DNA databases, we identified 52 homologous gene and pseudogene sequences, 16 of which contain a RIME and/or ingi retrotransposon inserted at exactly the same relative position. Here these genes are called the RHS family, for retrotransposon hot spot. Comparison of the protein sequences encoded by RHS genes (21 copies) and pseudogenes (24 copies) revealed a conserved central region containing an ATP/GTP-binding motif and the RIME/ingi insertion site. The RHS proteins share between 13 and 96% identity, and six subfamilies, RHS1 to RHS6, can be defined on the basis of their divergent C-terminal domains. Immunofluorescence and Western blot analyses using RHS subfamily-specific immune sera show that RHS proteins are constitutively expressed and occur mainly in the nucleus. Analysis of Genome Survey Sequence databases indicated that the Trypanosoma brucei diploid genome contains about 280 RHS (pseudo)genes. Among the 52 identified RHS (pseudo)genes, 48 copies are in three RHS clusters located in subtelomeric regions of chromosomes Ia and II and adjacent to the active bloodstream form expression site in T. brucei strain TREU927/4 GUTat10.1. RHS genes comprise the remaining sequence of the size-polymorphic "repetitive region" described for T. brucei chromosome I, and a homologous gene family is present in the Trypanosoma cruzi genome.
Collapse
Affiliation(s)
- Frederic Bringaud
- Laboratoire de Parasitologie Moléculaire, Université Victor Segalen Bordeaux II, UMR-5016 CNRS, 33076 Bordeaux, France.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Anzai T, Takahashi H, Fujiwara H. Sequence-specific recognition and cleavage of telomeric repeat (TTAGG)(n) by endonuclease of non-long terminal repeat retrotransposon TRAS1. Mol Cell Biol 2001; 21:100-8. [PMID: 11113185 PMCID: PMC88784 DOI: 10.1128/mcb.21.1.100-108.2001] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The telomere of the silkworm Bombyx mori consists of (TTAGG/CCTAA)(n) repeats and harbors a large number of telomeric repeat-specific non-long terminal repeat retrotransposons, such as TRAS1 and SART1. To understand how these retrotransposons recognize and integrate into the telomeric repeat in a sequence-specific manner, we expressed the apurinic-apryrimidinic endonuclease-like endonuclease domain of TRAS1 (TRAS1 EN), which is supposed to digest the target DNA, and characterized its enzymatic properties. Purified TRAS1 EN could generate specific nicks on both strands of the telomeric repeat sequence between T and A of the (TTAGG)(n) strand (bottom strand) and between C and T of the (CCTAA)(n) strand (top strand). These sites are consistent with insertion sites expected from the genomic structure of boundary regions of TRAS1. Time course studies of nicking activities on both strands revealed that the cleavages on the bottom strand preceded those on the top strand, supporting the target-primed reverse transcription model. TRAS1 EN could cleave the telomeric repeats specifically even if it was flanked by longer tracts of nontelomeric sequence, indicating that the target site specificity of the TRAS1 element was mainly determined by its EN domain. Based on mutation analyses, TRAS1 EN recognizes less than 10 bp around the initial cleavage site (upstream 7 bp and downstream 3 bp), and the GTTAG sequence especially is essential for the cleavage reaction on the bottom strand (5'. TTAGGTT downward arrow AGG. 3'). TRAS1 EN, the first identified endonuclease digesting telomeric repeats, may be used as a genetic tool to shorten the telomere in insects and some other organisms.
Collapse
Affiliation(s)
- T Anzai
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bunkyo-ku, Tokyo 113-0033, Japan
| | | | | |
Collapse
|
8
|
Mathiopoulos KD, della Torre A, Predazzi V, Petrarca V, Coluzzi M. Cloning of inversion breakpoints in the Anopheles gambiae complex traces a transposable element at the inversion junction. Proc Natl Acad Sci U S A 1998; 95:12444-9. [PMID: 9770505 PMCID: PMC22850 DOI: 10.1073/pnas.95.21.12444] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Anopheles arabiensis, one of the two most potent malaria vectors of the gambiae complex, is characterized by the presence of chromosomal paracentric inversions. Elucidation of the nature and the dynamics of these inversions is of paramount importance for the understanding of the population genetics and evolutionary biology of this mosquito and of the impact on malaria epidemiology. We report here the cloning of the breakpoints of the naturally occurring polymorphic inversion 2Rd' of A. arabiensis. A cDNA clone that cytologically mapped on the proximal breakpoint was the starting material for the isolation of a cosmid clone that spanned the breakpoint. Analysis of the surrounding sequences demonstrated that adjacent to the distal breakpoint lies a repetitive element that exhibits distinct distribution in different A. arabiensis strains. Sequencing analysis of that area revealed elements characteristic of transposable element terminal repeats. We called this presumed transposable element Odysseus. The presence of Odysseus at the junction of the naturally occuring inversion 2Rd' suggests that the inversion may be the result of the transposable element's activity. Characteristics of Odysseus' terminal region as well as its cytological distribution in different strains may indicate a relatively recent activity of Odysseus.
Collapse
Affiliation(s)
- K D Mathiopoulos
- Istituto di Parassitologia, Fondazione Pasteur-Cenci Bolognetti, Università di Roma "La Sapienza," Rome 00185, Italy.
| | | | | | | | | |
Collapse
|
9
|
Takahashi H, Okazaki S, Fujiwara H. A new family of site-specific retrotransposons, SART1, is inserted into telomeric repeats of the silkworm, Bombyx mori. Nucleic Acids Res 1997; 25:1578-84. [PMID: 9092665 PMCID: PMC146635 DOI: 10.1093/nar/25.8.1578] [Citation(s) in RCA: 77] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The telomeres of the silkworm, Bombyx mori, consist of pentanucleotide repeats (TTAGG)n . We previously characterized the non-LTR element TRAS1, which terminates with oligo (A) in a head to tail orientation at the exact position (between A and C) of the (CCTAA) n repeats. Here we characterized another family of telomere-specific non-LTR retrotransposon named SART1. The SART1 family was inserted at another site of the (TTAGG) n in a reverse orientation from that of TRAS1. The complete unit of SART1, 6.7 kb in length with a poly (A) stretch, contains two open reading frames encoding putative gag and pol products, overlapping by 54 bp in the -1 reading frame. Most of the 600 SART1 copies in the silkworm haploid genome are completely conserved in structure without 5'truncation. All SART1 sequences analyzed were inserted at the same position (between T and A) within the (TTAGG) n repeats. Fluorescence in situ hybridization showed that many of the SART1 copies were localized in the chromosomal ends. A phylogenetic tree showed that the SART1, TRAS1 and two other site-specific elements, R1 and RT, which insert into 28S ribosomal RNA genes in insects, belong to the same group. Based on the orientation for the chromosomal insertion and structural similarities, these elements could be further classified into two subgroups, R1/TRAS1 and RT/SART1, suggesting that the target specificity of the two telomere-associated elements was changed independently.
Collapse
Affiliation(s)
- H Takahashi
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Hongo, Bunkyo-ku, Tokyo 113, Japan
| | | | | |
Collapse
|
10
|
Burke WD, Müller F, Eickbush TH. R4, a non-LTR retrotransposon specific to the large subunit rRNA genes of nematodes. Nucleic Acids Res 1995; 23:4628-34. [PMID: 8524653 PMCID: PMC307436 DOI: 10.1093/nar/23.22.4628] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
A 4.7 kb sequence-specific insertion in the 26S ribosomal RNA gene of Ascaris lumbricoides, named R4, is shown to be a non-long terminal repeat (non-LTR) retrotransposable element. The R4 element inserts at a site in the large subunit rRNA gene which is midway between two other sequence-specific non-LTR retrotransposable elements, R1 and R2, found in most insect species. Based on the structure of its open reading frame and the sequence of its reverse transcriptase domain, R4 elements do not appear to be a family of R1 or R2 elements that have changed their insertion site. R4 is most similar in structure and in sequence to the element Dong, which is not specialized for insertion into rRNA units. Thus R4 represents a separate non-LTR retrotransposable element that has become specialized for insertion in the rRNA genes of its host. Using oligonucleotide primers directed to a conserved region of the reverse transcriptase encoding domain, insertions in the R4 site were also amplified from Parascaris equorum and Haemonchus contortus. Why several non-LTR retrotransposable elements have become specialized for insertion into a short (87 bp) region of the large subunit rRNA gene is discussed.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Ascaridoidea/genetics
- Ascaris lumbricoides/genetics
- Base Sequence
- Binding Sites
- Cloning, Molecular
- DNA Primers
- DNA, Helminth/genetics
- DNA, Ribosomal/genetics
- DNA-Binding Proteins/genetics
- DNA-Binding Proteins/metabolism
- Haemonchus/genetics
- Molecular Sequence Data
- Mutagenesis, Insertional
- Nematoda/genetics
- Phylogeny
- Polymerase Chain Reaction
- RNA, Helminth/biosynthesis
- RNA, Helminth/genetics
- RNA, Ribosomal/biosynthesis
- RNA, Ribosomal/genetics
- Repetitive Sequences, Nucleic Acid
- Retroelements
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
- Species Specificity
Collapse
Affiliation(s)
- W D Burke
- Department of Biology, University of Rochester, NY 14627, USA
| | | | | |
Collapse
|
11
|
Teng SC, Wang SX, Gabriel A. A new non-LTR retrotransposon provides evidence for multiple distinct site-specific elements in Crithidia fasciculata miniexon arrays. Nucleic Acids Res 1995; 23:2929-36. [PMID: 7659515 PMCID: PMC307132 DOI: 10.1093/nar/23.15.2929] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
We have identified a new member of the family of trypanosome site-specific retrotransposons, using a degenerate oligonucleotide PCR strategy. The 9595 bp element, termed Crithidia retrotransposable element 2 (CRE2), was cloned and found to be inserted in the tandemly arrayed miniexon genes of Crithidia fasciculata. The element is flanked by 29 bp target site duplications but lacks the 3' poly dA tract characteristic of most other non-long terminal repeat retrotransposons. The amino terminal region of the single 2518-codon open reading frame contains a putative metal-binding motif and a proline-rich region similar to gag-like domains of other retrotransposons. The carboxy terminal region of this open reading frame shares sequence homology with the reverse transcriptase and putative endonuclease regions of three previously described trypanosomatid site-specific retrotransposons. All four of these retrotransposons are specifically inserted between nucleotides 11 and 12 of the highly conserved 39mer sequence of the miniexon gene. Most copies of CRE2 and the previously characterized CRE1 are located on different sized chromosomes. Additional CRE-related sequences were identified by screening Crithidia libraries. These results suggest that a particular sequence in the C. fasciculata miniexon repeat is the target for multiple distinct site-specific retrotransposon insertions.
Collapse
Affiliation(s)
- S C Teng
- Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ 08855, USA
| | | | | |
Collapse
|