1
|
Papathanos PA, Windbichler N. Redkmer: An Assembly-Free Pipeline for the Identification of Abundant and Specific X-Chromosome Target Sequences for X-Shredding by CRISPR Endonucleases. CRISPR J 2018; 1:88-98. [PMID: 30627701 PMCID: PMC6319322 DOI: 10.1089/crispr.2017.0012] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
CRISPR-based synthetic sex ratio distorters, which operate by shredding the X-chromosome during male meiosis, are promising tools for the area-wide control of harmful insect pest or disease vector species. X-shredders have been proposed as tools to suppress insect populations by biasing the sex ratio of the wild population toward males, thus reducing its natural reproductive potential. However, to build synthetic X-shredders based on CRISPR, the selection of gRNA targets, in the form of high-copy sequence repeats on the X chromosome of a given species, is difficult, since such repeats are not accurately resolved in genome assemblies and cannot be assigned to chromosomes with confidence. We have therefore developed the redkmer computational pipeline, designed to identify short and highly abundant sequence elements occurring uniquely on the X chromosome. Redkmer was designed to use as input minimally processed whole genome sequence data from males and females. We tested redkmer with short- and long-read whole genome sequence data of Anopheles gambiae, the major vector of human malaria, in which the X-shredding paradigm was originally developed. Redkmer established long reads as chromosomal proxies with excellent correlation to the genome assembly and used them to rank X-candidate kmers for their level of X-specificity and abundance. Among these, a high-confidence set of 25-mers was identified, many belonging to previously known X-chromosome repeats of Anopheles gambiae, including the ribosomal gene array and the selfish elements harbored within it. Data from a control strain, in which these repeats are shared with the Y chromosome, confirmed the elimination of these kmers during filtering. Finally, we show that redkmer output can be linked directly to gRNA selection and off-target prediction. In addition, the output of redkmer, including the prediction of chromosomal origin of single-molecule long reads and chromosome specific kmers, could also be used for the characterization of other biologically relevant sex chromosome sequences, a task that is frequently hampered by the repetitiveness of sex chromosome sequence content.
Collapse
Affiliation(s)
- Philippos Aris Papathanos
- Department of Experimental Medicine, Section of Genomics and Genetics, University of Perugia, Perugia, Italy
| | - Nikolai Windbichler
- Department of Life Sciences, Imperial College London, Sir Alexander Fleming Building, South Kensington Campus, London, United Kingdom
| |
Collapse
|
2
|
Dritsou V, Deligianni E, Dialynas E, Allen J, Poulakakis N, Louis C, Lawson D, Topalis P. Non-coding RNA gene families in the genomes of anopheline mosquitoes. BMC Genomics 2014; 15:1038. [PMID: 25432596 PMCID: PMC4300560 DOI: 10.1186/1471-2164-15-1038] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Accepted: 11/19/2014] [Indexed: 12/12/2022] Open
Abstract
Background Only a small fraction of the mosquito species of the genus Anopheles are able to transmit malaria, one of the biggest killer diseases of poverty, which is mostly prevalent in the tropics. This diversity has genetic, yet unknown, causes. In a further attempt to contribute to the elucidation of these variances, the international “Anopheles Genomes Cluster Consortium” project (a.k.a. “16 Anopheles genomes project”) was established, aiming at a comprehensive genomic analysis of several anopheline species, most of which are malaria vectors. In the frame of the international consortium carrying out this project our team studied the genes encoding families of non-coding RNAs (ncRNAs), concentrating on four classes: microRNA (miRNA), ribosomal RNA (rRNA), small nuclear RNA (snRNA), and in particular small nucleolar RNA (snoRNA) and, finally, transfer RNA (tRNA). Results Our analysis was carried out using, exclusively, computational approaches, and evaluating both the primary NGS reads as well as the respective genome assemblies produced by the consortium and stored in VectorBase; moreover, the results of RNAseq surveys in cases in which these were available and meaningful were also accessed in order to obtain supplementary data, as were “pre-genomic era” sequence data stored in nucleic acid databases. The investigation included the identification and analysis, in most species studied, of ncRNA genes belonging to several families, as well as the analysis of the evolutionary relations of some of those genes in cross-comparisons to other members of the genus Anopheles. Conclusions Our study led to the identification of members of these gene families in the majority of twenty different anopheline taxa. A set of tools for the study of the evolution and molecular biology of important disease vectors has, thus, been obtained. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1038) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Pantelis Topalis
- Institute of Molecular Biology and Biotechnology, FORTH, Heraklion, Greece.
| |
Collapse
|
3
|
Kojima KK, Kuma KI, Toh H, Fujiwara H. Identification of rDNA-specific non-LTR retrotransposons in Cnidaria. Mol Biol Evol 2006; 23:1984-93. [PMID: 16870681 DOI: 10.1093/molbev/msl067] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Ribosomal RNA genes are abundant repetitive sequences in most eukaryotes. Ribosomal DNA (rDNA) contains many insertions derived from mobile elements including non-long terminal repeat (non-LTR) retrotransposons. R2 is the well-characterized 28S rDNA-specific non-LTR retrotransposon family that is distributed over at least 4 bilaterian phyla. R2 is a large family sharing the same insertion specificity and classified into 4 clades (R2-A, -B, -C, and -D) based on the N-terminal domain structure and the phylogeny. There is no observation of horizontal transfer of R2; therefore, the origin of R2 dates back to before the split between protostomes and deuterostomes. Here, we in silico identified 1 R2 element from the sea anemone Nematostella vectensis and 2 R2-like retrotransposons from the hydrozoan Hydra magnipapillata. R2 from N. vectensis was inserted into the 28S rDNA like other R2, but the R2-like elements from H. magnipapillata were inserted into the specific sequence in the highly conserved region of the 18S rDNA. We designated the Hydra R2-like elements R8. R8 is inserted at 37 bp upstream from R7, another 18S rDNA-specific retrotransposon family. There is no obvious sequence similarity between targets of R2 and R8, probably because they recognize long DNA sequences. Domain structure and phylogeny indicate that R2 from N. vectensis is the member of the R2-D clade, and R8 from H. magnipapillata belongs to the R2-A clade despite its different sequence specificity. These results suggest that R2 had been generated before the split between cnidarians and bilaterians and that R8 is a retrotransposon family that changed its target from the 28S rDNA to the 18S rDNA.
Collapse
Affiliation(s)
- Kenji K Kojima
- Institute for Chemical Research, Kyoto University, Uji, Japan.
| | | | | | | |
Collapse
|
4
|
Zingler N, Weichenrieder O, Schumann GG. APE-type non-LTR retrotransposons: determinants involved in target site recognition. Cytogenet Genome Res 2005; 110:250-68. [PMID: 16093679 DOI: 10.1159/000084959] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2003] [Accepted: 02/05/2004] [Indexed: 10/25/2022] Open
Abstract
Non-long terminal repeat (Non-LTR) retrotransposons represent a diverse and widely distributed group of transposable elements and an almost ubiquitous component of eukaryotic genomes that has a major impact on evolution. Their copy number can range from a few to several million and they often make up a significant fraction of the genomes. The members of the dominating subtype of non-LTR retrotransposons code for an endonuclease with homology to apurinic/apyrimidinic endonucleases (APE), and are thus termed APE-type non-LTR retrotransposons. In the last decade both the number of identified non-LTR retrotransposons and our knowledge of biology and evolution of APE-type non-LTR retrotransposons has increased tremendously.
Collapse
Affiliation(s)
- N Zingler
- Paul-Ehrlich-Institut, Langen, Germany
| | | | | |
Collapse
|
5
|
Boulesteix M, Biémont C. Transposable elements in mosquitoes. Cytogenet Genome Res 2005; 110:500-9. [PMID: 16093703 DOI: 10.1159/000084983] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2003] [Accepted: 01/27/2004] [Indexed: 11/19/2022] Open
Abstract
We describe the current state of knowledge about transposable elements (TEs) in different mosquito species. DNA-based elements (class II elements), non-LTR retrotransposons (class I elements), and MITEs (Miniature Inverted Repeat Transposable Elements) are found in the three genera, Anopheles, Aedes and Culex, whereas LTR retrotransposons (class I elements) are found only in Anopheles and Aedes. Mosquitoes were the first insects in which MITEs were reported; they have several LTR retrotransposons belonging to the Pao family, which is distinct from the Gypsy-Ty3 and Copia-Ty1 families. The number of TE copies shows huge variations between classes of TEs within a given species (from 1 to 1000), in sharp contrast to Drosophila, which shows only relatively minor differences in copy number between elements (from 1 to 100). The genomes of these insects therefore display major differences in the amount of TEs and therefore in their structure and global composition. We emphasize the need for more population genetic data about the activity of TEs, their distribution over chromosomes and their frequencies in natural populations of mosquitoes, to further the current attempts to develop a transgenic mosquito unable to transmit malaria that is intended to replace the natural populations.
Collapse
Affiliation(s)
- M Boulesteix
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Lyon 1, Villeurbanne, France
| | | |
Collapse
|
6
|
Chen Y, Schneeberger RG, Cullis CA. A site-specific insertion sequence in flax genotrophs induced by environment. THE NEW PHYTOLOGIST 2005; 167:171-80. [PMID: 15948840 DOI: 10.1111/j.1469-8137.2005.01398.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
A single-copy 5.7 kilobase (kb) DNA fragment, termed Linum Insertion Sequence 1 (LIS-1), has been identified and characterized. This is one of the DNA changes associated with the environmentally induced heritable changes resulting in stable lines termed genotrophs in flax (Linum usitatissimum). The insertion sequence and its insertion site have been cloned from genomic libraries and sequenced. PCR products across the insertion and surrounding regions have also been cloned and sequenced. The 5.7 kb DNA fragment is inserted into a 3.7 kb EcoRI fragment in the plastic line (Pl) with the generation of a 3 base pair duplication at the insertion site, as well as additional sequence changes. The identical insertion was also found in other genotrophs and flax varieties. The intact element was not present in Pl but appeared to be generated by a reproducible series of complex rearrangements or insertion events. LIS-1 is the result of a targeted, highly specific, complex insertion event that occurs during the formation of some of the genotrophs, and occurs naturally in many flax and linseed varieties.
Collapse
Affiliation(s)
- Yiming Chen
- Department of Biology, Case Western Reserve University, Cleveland, Ohio 44106-7080, USA
| | | | | |
Collapse
|
7
|
Tu Z, Coates C. Mosquito transposable elements. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2004; 34:631-644. [PMID: 15242704 DOI: 10.1016/j.ibmb.2004.03.016] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2004] [Accepted: 03/18/2004] [Indexed: 05/24/2023]
Abstract
The completion of the genome assembly for the African malaria mosquito, Anopheles gambiae, and continuing genomic efforts for the yellow fever mosquito, Aedes aegypti, have allowed the use of bioinformatics tools to identify and characterize a diverse array of transposable elements (TEs) in these and other mosquito genomes. An overview of the types and number of both RNA-mediated and DNA-mediated TEs that are found in mosquito genomes is presented. A number of novel and interesting TEs from these species are discussed in more detail. These findings have significant implications for our understanding of mosquito genome evolution and for future modifications of natural mosquito populations through the use of TE-mediated genetic transformation.
Collapse
Affiliation(s)
- Zhijian Tu
- Department of Biochemistry, Virginia Tech, Blacksburg, VA 24061, USA.
| | | |
Collapse
|
8
|
Sharma R, Bagchi A, Bhattacharya A, Bhattacharya S. Characterization of a retrotransposon-like element from Entamoeba histolytica. Mol Biochem Parasitol 2001; 116:45-53. [PMID: 11463465 DOI: 10.1016/s0166-6851(01)00300-0] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The protozoan parasite Entamoeba histolytica is the causative agent of amoebiasis. The genome organization of this organism is not well understood. We had earlier reported the presence of a multicopy sequence, HMc, in E. histolytica. Subsequent analysis showed that HMc is a member of a retrotransposon family that we have named the E. histolytica retrotransposon-like element (EhRLE). Four other members of this family have been characterized. The EhRLE family is distributed across all chromosomes of the parasite. There are 140 copies, which show minor sequence variation with respect to one another (2--4% from the consensus sequence). From a sequence analysis of five members of the EhRLE family, the complete EhRLE unit is estimated to be 4086 bp in length. It has a 27-mer inverted repeat at its ends. A pairwise comparison with sequences in the database showed a highly significant match of a part of EhRLE with reverse transcriptases (RT), especially those encoded by non-long terminal repeat retrotransposons. There are stop codons in all the five EhRLEs, but a continuous open reading frame of 464 amino acids could be reconstructed by comparing the sequences of several EhRLEs. The reconstructed sequence showed a much better identity with RT as compared with any of the original EhRLE sequences. The non-pathogenic species, Entamoeba dispar, also contains this element, with 85% sequence identity with EhRLE. The data suggest that EhRLE may be a retrotransposon, but many of its members are probably nonfunctional due to the accumulation of mutations.
Collapse
Affiliation(s)
- R Sharma
- School of Life Sciences, Jawaharlal Nehru University, New Delhi-110067, India
| | | | | | | |
Collapse
|
9
|
Kubo Y, Okazaki S, Anzai T, Fujiwara H. Structural and phylogenetic analysis of TRAS, telomeric repeat-specific non-LTR retrotransposon families in Lepidopteran insects. Mol Biol Evol 2001; 18:848-57. [PMID: 11319268 DOI: 10.1093/oxfordjournals.molbev.a003866] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
TRAS1 is a non-LTR retrotransposon inserted specifically into the telomeric repeat (TTAGG)(n) in the silkworm, Bombyx mori. To characterize the evolutionary origin of TRAS-like elements, we identified seven TRAS families (TRAS3, TRAS4, TRAS5, TRAS6, TRASY, TRASZ, and TRASW) from B. mori and four elements from two Lepidoptera, Dictyoploca japonica (TRASDJ) and Samia cynthia ricini (TRASSC3, TRASSC4, and TRASSC9). More than 2,000 copies of various Bombyx TRAS elements accumulated within (TTAGG)(n) sequences as unusual but orderly tandem repeats. The 5' and 3' regions were highly conserved within each class of Bombyx TRAS elements without truncation. This suggests that distinct classes of TRAS have been maintained independently by retrotransposition into (TTAGG)(n). The phylogenetic tree of site-specific retroelements showed that nine TRAS families in Lepidoptera constitute a single phylogenetic group that is closely related to the R1 family that inserts specifically into arthropod 28S rDNA. The higher amino acid sequence identity from endonuclease (EN) to reverse transcriptase (RT) domains between TRAS groups (about 37%-70%) than among TRAS elements and R1Bm (about 25%-30%), may reflect the presence of some DNA structure responsible for their target specificity. Sequence comparison from EN to RT domains among non-LTR elements revealed several regions conserved only within TRAS elements. We found a highly conserved region that resembles the Myb-like DNA-binding structure, between the EN and RT domains. These regions may be involved in site-specific integration of TRAS elements into the (TTAGG)(n) telomeric repeats.
Collapse
Affiliation(s)
- Y Kubo
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | | | | | | |
Collapse
|
10
|
Busseau I, Berezikov E, Bucheton A. Identification of Waldo-A and Waldo-B, two closely related non-LTR retrotransposons in Drosophila. Mol Biol Evol 2001; 18:196-205. [PMID: 11158378 DOI: 10.1093/oxfordjournals.molbev.a003793] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We have identified two novel, closely related subfamilies of non-long-terminal-repeat (non-LTR) retrotransposons in Drosophila melanogaster, the Waldo-A and Waldo-B subfamilies, that are in the same lineage as site-specific LTR retrotransposons of the R1 clade. Both contain potentially active copies with two large open reading frames, having coding capacities for a nucleoprotein as well as endonuclease and reverse transcriptase activities. Many copies are truncated at the 5' end, and most are surrounded by target site duplications of variable lengths. Elements of both subfamilies have a nonrandom distribution in the genome, often being inserted within or very close to (CA)(n) arrays. At the DNA level, the longest elements of Waldo-A and Waldo-B are 69% identical on their entire length, except for the 5' untranslated regions, which have a mosaic organization, suggesting that one arose from the other following new promoter acquisition. This event occurred before the speciation of the D. melanogaster subgroup of species, since both Waldo-A and Waldo-B coexist in other species of this subgroup.
Collapse
Affiliation(s)
- I Busseau
- Institut de Génétique Humaine, Centre National de la Recherche Scientifique, 141 rue de la Cardonille, 34396 Montpellier cedex 05, France.
| | | | | |
Collapse
|
11
|
Severson DW, Brown SE, Knudson DL. Genetic and physical mapping in mosquitoes: molecular approaches. ANNUAL REVIEW OF ENTOMOLOGY 2001; 46:183-219. [PMID: 11112168 DOI: 10.1146/annurev.ento.46.1.183] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The genetic background of individual mosquito species and populations within those species influences the transmission of mosquito-borne pathogens to humans. Technical advances in contemporary genomics are contributing significantly to the detailed genetic analysis of this mosquito-pathogen interaction as well as all other aspects of mosquito biology, ecology, and evolution. A variety of DNA-based marker types are being used to develop genetic maps for a number of mosquito species. Complex phenotypic traits such as vector competence are being dissected into their discrete genetic components, with the intention of eventually using this information to develop new methods to prevent disease transmission. Both genetic- and physical-mapping techniques are being used to define and compare genome architecture among and within mosquito species. The integration of genetic- and physical-map information is providing a sound framework for map-based positional cloning of target genes of interest. This review focuses on advances in genome-based analysis and their specific applications to mosquitoes.
Collapse
Affiliation(s)
- D W Severson
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana 46556, USA.
| | | | | |
Collapse
|
12
|
Yu Z, Wright SI, Bureau TE. Mutator-like elements in Arabidopsis thaliana. Structure, diversity and evolution. Genetics 2000; 156:2019-31. [PMID: 11102392 PMCID: PMC1461377 DOI: 10.1093/genetics/156.4.2019] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
While genome-wide surveys of abundance and diversity of mobile elements have been conducted for some class I transposable element families, little is known about the nature of class II transposable elements on this scale. In this report, we present the results from analysis of the sequence and structural diversity of Mutator-like elements (MULEs) in the genome of Arabidopsis thaliana (Columbia). Sequence similarity searches and subsequent characterization suggest that MULEs exhibit extreme structure, sequence, and size heterogeneity. Multiple alignments at the nucleotide and amino acid levels reveal conserved, potentially transposition-related sequence motifs. While many MULEs share common structural features to Mu elements in maize, some groups lack characteristic long terminal inverted repeats. High sequence similarity and phylogenetic analyses based on nucleotide sequence alignments indicate that many of these elements with diverse structural features may remain transpositionally competent and that multiple MULE lineages may have been evolving independently over long time scales. Finally, there is evidence that MULEs are capable of the acquisition of host DNA segments, which may have implications for adaptive evolution, both at the element and host levels.
Collapse
Affiliation(s)
- Z Yu
- Department of Biology, McGill University, Montreal, Quebec, H3A 1B1 Canada
| | | | | |
Collapse
|
13
|
Cook JM, Martin J, Lewin A, Sinden RE, Tristem M. Systematic screening of Anopheles mosquito genomes yields evidence for a major clade of Pao-like retrotransposons. INSECT MOLECULAR BIOLOGY 2000; 9:109-117. [PMID: 10672078 DOI: 10.1046/j.1365-2583.2000.00167.x] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
We developed a degenerate PCR procedure to simultaneously amplify products from divergent retrotransposon families within the genomes of Anopheles mosquitoes. The procedure required cloning of multiple PCR products, but more than half of the clones subsequently sequenced were of retrotransposon origin. These included Copia-like and LINE retrotransposons, as well as the first Gypsy-like retrotransposons reported from mosquitoes. Furthermore, some Anopheles retrotransposon sequences showed similarity to the divergent Pao element from the silkmoth Bombyx mori. Phylogenetic analyses provided consistently strong bootstrap support (> 95%) for a major clade of Pao-like retrotransposons, which includes five mosquito sequences and the recently discovered Drosophila retrotransposons BEL and ninja. This appears to represent a new family of Pao-like LTR-retrotransposons distinct from the Copia and Gypsy families.
Collapse
Affiliation(s)
- J M Cook
- Department of Biology, Imperial College at Silwood Park, Ascot, UK.
| | | | | | | | | |
Collapse
|
14
|
Takahashi H, Fujiwara H. Transcription analysis of the telomeric repeat-specific retrotransposons TRAS1 and SART1 of the silkworm Bombyx mori. Nucleic Acids Res 1999; 27:2015-21. [PMID: 10198435 PMCID: PMC148415 DOI: 10.1093/nar/27.9.2015] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The telomeres of the silkworm Bombyx mori consist of (TTAGG)n repeats and harbor a large number of sequence-specific non-LTR retrotransposons such as TRAS1 and SART1. In order to ascertain if TRAS1 and SART1 are transcribed in vivo and if there is a novel transcription mechanism peculiar to the sequence-specific retrotransposons, we studied their transcription. We detected transcripts of TRAS1 and SART1 by northern hybridization in many tissues and the BmN4 cell line of the silkworm. 5'-Rapid amplification of cDNA ends analysis showed that transcription of both elements was initiated precisely from their own 5'-ends and that most of their genomic copies contained these initiation sites. TRAS1 contained an internal promoter and positively regulating elements in the +1/+581 nucleotides in its 2432 bp 5'-untranslated region (UTR). We could not, however, detect any promoter activity in the SART1 5'-UTR. This difference may be related to the fact that only TRAS1 contained an initiator-like element at its 5'-end. Placing 1-52 units of the telomeric repeat (TTAGG)n upstream of TRAS1 reduced transcription 5-fold. The evidence suggests that most of the TRAS1 genomic copies within the telomeric repeats are weakly transcribed in vivo.
Collapse
Affiliation(s)
- H Takahashi
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku,Tokyo 113-0033, Japan
| | | |
Collapse
|
15
|
Abstract
Aedes aegypti metaphase chromosome landmarks have been developed so that each chromosome of the haploid genome can be unambiguously identified and oriented by fluorescence in situ hybridization (FISH) and digital imaging microscopy. The FISH tags were derived from three cosmids that contain repetitive Ae. aegypti sequences and their unique FISH tagging characteristics are demonstrated. The sequence of the three chromosomal tags revealed that the chromosome 1 tag is an 18S fragment from the ribosomal cistron, and the other two chromosomal tags are repeats found in Ae. aegypti with no apparent similarity to known sequences. A single plasmid that contains the three chromosomes tag sequences has been constructed to simplify future FISH physical mapping.
Collapse
Affiliation(s)
- S E Brown
- Department of Bioagricultural Sciences and Pest Management, College of Agricultural Sciences, Colorado State University, Fort Collins 80523, USA
| | | |
Collapse
|
16
|
Takahashi H, Okazaki S, Fujiwara H. A new family of site-specific retrotransposons, SART1, is inserted into telomeric repeats of the silkworm, Bombyx mori. Nucleic Acids Res 1997; 25:1578-84. [PMID: 9092665 PMCID: PMC146635 DOI: 10.1093/nar/25.8.1578] [Citation(s) in RCA: 77] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The telomeres of the silkworm, Bombyx mori, consist of pentanucleotide repeats (TTAGG)n . We previously characterized the non-LTR element TRAS1, which terminates with oligo (A) in a head to tail orientation at the exact position (between A and C) of the (CCTAA) n repeats. Here we characterized another family of telomere-specific non-LTR retrotransposon named SART1. The SART1 family was inserted at another site of the (TTAGG) n in a reverse orientation from that of TRAS1. The complete unit of SART1, 6.7 kb in length with a poly (A) stretch, contains two open reading frames encoding putative gag and pol products, overlapping by 54 bp in the -1 reading frame. Most of the 600 SART1 copies in the silkworm haploid genome are completely conserved in structure without 5'truncation. All SART1 sequences analyzed were inserted at the same position (between T and A) within the (TTAGG) n repeats. Fluorescence in situ hybridization showed that many of the SART1 copies were localized in the chromosomal ends. A phylogenetic tree showed that the SART1, TRAS1 and two other site-specific elements, R1 and RT, which insert into 28S ribosomal RNA genes in insects, belong to the same group. Based on the orientation for the chromosomal insertion and structural similarities, these elements could be further classified into two subgroups, R1/TRAS1 and RT/SART1, suggesting that the target specificity of the two telomere-associated elements was changed independently.
Collapse
Affiliation(s)
- H Takahashi
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Hongo, Bunkyo-ku, Tokyo 113, Japan
| | | | | |
Collapse
|
17
|
Persengiev SP, Kilpatrick DL. Characterization of a cDNA containing trinucleotide repeat sequences that is highly enriched in spermatogenic cells. Mol Reprod Dev 1997; 46:476-81. [PMID: 9094094 DOI: 10.1002/(sici)1098-2795(199704)46:4<476::aid-mrd5>3.0.co;2-l] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Trinucleotide repeat sequences have become of great interest due to their association with specific genetic disorders. Here we report the identification of a cDNA containing opa trinucleotide repeats from mouse testis, termed t-OPA. The opa repeat is contained within the longest open reading frame within the cDNA. Northern analysis demonstrated that four distinct t-OPA transcripts (1.6, 2.5, 3.6, 4.0 kilobases) are preferentially expressed in mouse and rat testis, with low expression in the pituitary, brain, and adrenal gland. Further, t-OPA RNAs were highly abundant in both pachytene spermatocytes and round spermatids and decreased in cytoplasts. Polysome profile analysis indicated that t-OPA mRNAs are translated in mouse testis with efficiencies similar to other transcripts expressed in late meiotic/early post-meiotic spermatogenic cells. These findings thus suggest a role for cell-specific mRNAs containing opa repeats during mouse spermatogenesis.
Collapse
Affiliation(s)
- S P Persengiev
- Neurobiology Group, Worcester Foundation for Biomedical Research, Shrewsbury, MA 01545, USA
| | | |
Collapse
|
18
|
Blinov AG, Sobanov YV, Scherbik SV, Aimanova KG. The Chironomus (Camptochironomus) tentans genome contains two non-LTR retrotransposons. Genome 1997; 40:143-50. [PMID: 9061921 DOI: 10.1139/g97-021] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A cDNA library from salivary gland cells of Chironomus tentans was screened with a probe containing the NLRCth1 non-LTR (long terminal repeat) retrotransposon from Chironomus thummi. Several positive clones were obtained and one of them, p62, was characterized by in situ hybridization and sequencing. The sequencing analysis showed that this clone contained a 4607 bp nucleotide sequence of a new transposable element that hybridized in situ to more than 100 sites over all four C. tentans chromosomes. The detailed analysis of this sequence revealed the presence of the 3'-end of open reading frame 1 (ORF1), a complete ORF2, and a 1.3-kb 3'-end untranslated region (UTR). The new element has been designated NLRCt2 (non-LTR retrotransposon 2 from C. tentans). A comparison of the nucleotide sequences of NLRCth1 and NLRCt2 showed 30% similarity in the region of ORF1 and 70% similarity in the region of ORF2. Based on the results of Southern blot analysis, two transposable elements have been found in the C. tentans genome, one of which is identical to NLRCth1 from C. thummi. This may be explained by horizontal transmission. The second element, NLRCt2, has been found in two different forms in the C. tentans genome. These can be distinguished by the presence of the 1.3-kb 3'-end UTR in one of the forms. Since the cDNA clone investigated was isolated from a tissue-specific cDNA library, the data showed that NRLCt2 is expressed in somatic cells.
Collapse
Affiliation(s)
- A G Blinov
- Institute of Cytology and Genetics, Siberian Department of Russian Academy of Sciences, Novosibirsk, Russia.
| | | | | | | |
Collapse
|
19
|
Abstract
Transposable elements are discrete mobile DNA segments that can insert into non-homologous target sites. Diverse patterns of target site selectivity are observed: Some elements display considerable target site selectivity and others display little obvious selectivity, although none appears to be truly "random." A variety of mechanisms for target site selection are used: Some elements use direct interactions between the recombinase and target DNA whereas other elements depend upon interactions with accessory proteins that communicate both with the target DNA and the recombinase. The study of target site selectivity is useful in probing recombination mechanisms, in studying genome structure and function, and also in providing tools for genome manipulation.
Collapse
Affiliation(s)
- N L Craig
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| |
Collapse
|
20
|
Besansky NJ, Mukabayire O, Bedell JA, Lusz H. Pegasus, a small terminal inverted repeat transposable element found in the white gene of Anopheles gambiae. Genetica 1996; 98:119-29. [PMID: 8976060 DOI: 10.1007/bf00121360] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Pegasus, a novel transposable element, was discovered as a length polymorphism in the white gene of Anopheles gambiae. Sequence analysis revealed that this 535 bp element was flanked by 8 bp target site duplications and 8 bp perfect terminal inverted repeats similar to those found in many members of the Tc1 family. Its small size and lack of long open reading frames preclude protein coding capacity. Southern analysis and in situ hybridization to polytene chromosomes demonstrated that Pegasus occurs in approximately 30 copies in the genomes of An. gambiae and its sibling species and is homogenous in structure but polymorphic in chromosomal location. Characterization of five additional elements by sequencing revealed nucleotide identities of 95% to 99%. Of 30 Pegasus-containing phage clones examined by PCR, only one contained an element exceeding 535 bp in length, due to the insertion of another transposable element-like sequence. Thus, the majority, if not all, extant Pegasus elements may be defective copies of a complete element whose contemporary existence in An. gambiae is uncertain. No Pegasus-hybridizing sequences were detected in nine other anophelines and three culicines examined, suggesting a very limited taxonomic distribution.
Collapse
Affiliation(s)
- N J Besansky
- Division of Parasitic Diseases, Centers for Disease Control and Prevention, Chamblee, GA 30341, USA
| | | | | | | |
Collapse
|
21
|
Mukabayire O, Besansky NJ. Distribution of T1, Q, Pegasus and mariner transposable elements on the polytene chromosomes of PEST, a standard strain of Anopheles gambiae. Chromosoma 1996; 104:585-95. [PMID: 8662251 DOI: 10.1007/bf00352298] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
The chromosomal locations of four families of transposable elements, T1, Q, Pegasus and mariner, have been determined by in situ hybridization to polytene chromosomes of ovarian nurse cells of the mosquito Anopheles gambiae. As part of this effort, we have developed a vigorous pink-eyed laboratory strain of A. gambiae (PEST), rendered homozygous standard for chromosomal inversions on all autosomes. Ten different individuals of this strain were studied with each transposable element probe. The average number of hybridization sites per genome was 83.9 for T1, 63.4 for Q, 31.5 for Pegasus and 64.7 for mariner, excluding pericentric and centromeric regions. However, some degree of polymorphism was observed within each family such that, considering all ten individuals, 94 different sites were detected for T1, 82 sites for Q, 45 sites for Pegasus and 71 sites for mariner. The mean occupancy per site varied from 0.70 (Pegasus) to 0.91 (mariner), which, while significantly higher than that seen for transposable elements in natural populations of Drosophila melanogaster, is comparable to that seen in established laboratory stocks. In addition, these element families were not randomly distributed. All but Pegasus were concentrated in centromeric heterochromatin and centromere-proximal euchromatin, most showed a deficit of hybridization sites in the distal section of chromosomes, and a significant proportion of sites were coincident between families. These results provide the first detailed examination of the cytogenetic location of transposable elements in a nondrosophilid insect, and, through comparison with the behavior of transposable elements in Drosophila, may provide insight into the interaction between elements and host. The mapped elements are also expected to serve as landmarks useful in integrating the developing physical map of the PEST strain with the chromosomal banding pattern.
Collapse
Affiliation(s)
- O Mukabayire
- Division of Parasitic Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30333, USA
| | | |
Collapse
|
22
|
Abstract
Most of the transposons so far characterized from mosquito genomes are retroelements which seem to be distributed worldwide. The Juan transposons constitute a family of non-LTR retroelements, or LINE-retroposons, which are dispersed in the genomes of several mosquito species. Three different Juan subfamilies have been characterized, each being amplified in the genomes of many strains, if not all, of a given mosquito species. These subfamilies have been designated respectively Juan-C in Culex pipiens, Juan-Ct in Culex tarsalis and Juan-A in Aedes aegypti. A large number of the Juan retroposons which are amplified in the mosquito genomes are apparently full-length copies and potentially encode the enzymes necessary for their transposition, a nucleic acid binding protein and a reverse transcriptase. However, these complete Juan copies seem to be most frequently transcriptionally silent in insects reared under laboratory conditions. A few of them are transcribed in C. pipiens cells grown in vitro, but from an external promoter, the Juan-C specific RNA being fused to an upstream RNA sequence. Therefore, the transcription of Juan retroposons seems to depend on external promoters which are most frequently inactive. The occurrence and distribution of Juan retroposon subfamilies among mosquito species do not reflect the phylogeny of these species. Furthermore, complete Juan-C and Juan-A copies which are reiterated in strains collected from regions covering different continents are nearly identical. Juan-C copies belonging to geographically different C. pipiens strains display low levels of divergence between their nucleotide sequences and many of the mutations which have occurred among these copies do not alter their coding potential. These results indicate that the Juan retroposons occur as homogeneous subfamilies distributed worldwide and that selective constraints against amino acid change have been acting recently on these elements, despite the fact that they are now highly repeated through mosquito genomes. Therefore, Juan transposons have most probably been recently amplified in mosquito genomes. Each subfamily may have been amplified from one master element present in a unique population which has since spread worldwide. Alternatively, this amplification may have arisen in many mosquito populations, but from highly conserved master elements submitted to selection pressures. Horizontal transfers between species may also have contributed to the spread of these transposons.
Collapse
Affiliation(s)
- N Bensaadi-Merchermek
- Laboratoire d'Ecologie Moléculaire, Université de Pau et des Pays de l'Adour, France
| | | | | |
Collapse
|
23
|
Kumar V, Collins FH. A technique for nucleic acid in situ hybridization to polytene chromosomes of mosquitoes in the Anopheles gambiae complex. INSECT MOLECULAR BIOLOGY 1994; 3:41-47. [PMID: 8069415 DOI: 10.1111/j.1365-2583.1994.tb00149.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
A sensitive, simple, and reproducible in situ hybridization technique for the detection and precise localization of specific nucleic acid sequences on chromosomes of members of the Anopheles gambiae complex is described. Modifications of the in situ hybridization technique are described that allow simultaneous hybridization of several probes with the chromosomes on a single slide and the multiple use of a single chromosome preparation for several different probes hybridized successively on the same slide. Examples are shown that illustrate the utility of the technique for localization of both single copy and repeated sequences in both polytenized euchromatin and centromeric heterochromatin.
Collapse
Affiliation(s)
- V Kumar
- Malaria Branch, Centers for Disease Control and Prevention, Atlanta, GA 30341-3724
| | | |
Collapse
|
24
|
Besansky NJ, Bedell JA, Mukabayire O. Q: a new retrotransposon from the mosquito Anopheles gambiae. INSECT MOLECULAR BIOLOGY 1994; 3:49-56. [PMID: 8069416 DOI: 10.1111/j.1365-2583.1994.tb00150.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
A new family of retrotransposons (RTPs) without long terminal repeats (LTRs), designated Q, has been isolated from the malaria vector Anopheles gambiae. The nucleotide sequence of a complete element Q-22, was determined and analysed. Approximately 4.5 kb long, Q-22 contains two long overlapping open reading frames (ORFs) that potentially encode proteins with nucleic acid binding and reverse transcriptase domains similar to those of non-LTR RTPs previously described. The 3' end is characterized by variable numbers of the triplet repeat TAA, immediately following a polyadenylation signal. In situ hybridization of nurse cell polytene chromosomes revealed about twenty labelled sites distributed over all arms and diffuse hybridization to the chromocentre. Cross-hybridizing sequences with the same internal structure occur in all members of the A. gambiae complex. Genomic Southerns of wild A. gambiae specimens probed with Q suggest that Q is or recently was capable of retrotransposition.
Collapse
Affiliation(s)
- N J Besansky
- Malaria Branch, Centers for Disease Control, Atlanta, GA 30341
| | | | | |
Collapse
|
25
|
Besansky NJ. Codon usage patterns in chromosomal and retrotransposon genes of the mosquito Anopheles gambiae. INSECT MOLECULAR BIOLOGY 1993; 1:171-178. [PMID: 8269095 DOI: 10.1111/j.1365-2583.1993.tb00089.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Codon usage was compiled for fourteen chromosomal genes and four retrotransposons from the mosquito Anopheles gambiae. Variation exists among chromosomal genes in the degree of bias. The genes showing the highest bias are probably most highly expressed. In these genes, the base composition at the third codon position is much richer in G + C than is the overall coding sequence. Thus, codon usage is biased toward G- or C-ending codons. Codon usage in each retrotransposon is quite different, not only from chromosomal genes but also from the other retrotransposons. Codon usage comparisons among homologous genes from An. gambiae and two other Dipterans, the yellow fever mosquito Aedes aegypti and the fruitfly Drosophila melanogaster, show that while there are similarities, particularly between An. gambiae and D. melanogaster in the preference for G- and C-ending codons, each species has evolved a distinct pattern of codon usage.
Collapse
Affiliation(s)
- N J Besansky
- Malaria Branch, National Center for Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30333
| |
Collapse
|
26
|
Besansky NJ, Finnerty V, Collins FH. Molecular Perspectives on the Genetics of Mosquitoes. ADVANCES IN GENETICS 1992; 30:123-84. [PMID: 1360745 DOI: 10.1016/s0065-2660(08)60320-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- N J Besansky
- Malaria Branch, Centers for Disease Control, Atlanta, Georgia 30333
| | | | | |
Collapse
|