1
|
Srivastav SP, Feschotte C, Clark AG. Rapid evolution of piRNA clusters in the Drosophila melanogaster ovary. Genome Res 2024; 34:711-724. [PMID: 38749655 PMCID: PMC11216404 DOI: 10.1101/gr.278062.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 05/07/2024] [Indexed: 05/28/2024]
Abstract
The piRNA pathway is a highly conserved mechanism to repress transposable element (TE) activity in the animal germline via a specialized class of small RNAs called piwi-interacting RNAs (piRNAs). piRNAs are produced from discrete genomic regions called piRNA clusters (piCs). Although the molecular processes by which piCs function are relatively well understood in Drosophila melanogaster, much less is known about the origin and evolution of piCs in this or any other species. To investigate piC origin and evolution, we use a population genomic approach to compare piC activity and sequence composition across eight geographically distant strains of D. melanogaster with high-quality long-read genome assemblies. We perform annotations of ovary piCs and genome-wide TE content in each strain. Our analysis uncovers extensive variation in piC activity across strains and signatures of rapid birth and death of piCs. Most TEs inferred to be recently active show an enrichment of insertions into old and large piCs, consistent with the previously proposed "trap" model of piC evolution. In contrast, a small subset of active LTR families is enriched for the formation of new piCs, suggesting that these TEs have higher proclivity to form piCs. Thus, our findings uncover processes leading to the origin of piCs. We propose that piC evolution begins with the emergence of piRNAs from individual insertions of a few select TE families prone to seed new piCs that subsequently expand by accretion of insertions from most other TE families during evolution to form larger "trap" clusters. Our study shows that TEs themselves are the major force driving the rapid evolution of piCs.
Collapse
Affiliation(s)
- Satyam P Srivastav
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
2
|
Srivastav S, Feschotte C, Clark AG. Rapid evolution of piRNA clusters in the Drosophila melanogaster ovary. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.08.539910. [PMID: 37214865 PMCID: PMC10197564 DOI: 10.1101/2023.05.08.539910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Animal genomes are parasitized by a horde of transposable elements (TEs) whose mutagenic activity can have catastrophic consequences. The piRNA pathway is a conserved mechanism to repress TE activity in the germline via a specialized class of small RNAs associated with effector Piwi proteins called piwi-associated RNAs (piRNAs). piRNAs are produced from discrete genomic regions called piRNA clusters (piCs). While piCs are generally enriched for TE sequences and the molecular processes by which they are transcribed and regulated are relatively well understood in Drosophila melanogaster, much less is known about the origin and evolution of piCs in this or any other species. To investigate piC evolution, we use a population genomics approach to compare piC activity and sequence composition across 8 geographically distant strains of D. melanogaster with high quality long-read genome assemblies. We perform extensive annotations of ovary piCs and TE content in each strain and test predictions of two proposed models of piC evolution. The 'de novo' model posits that individual TE insertions can spontaneously attain the status of a small piC to generate piRNAs silencing the entire TE family. The 'trap' model envisions large and evolutionary stable genomic clusters where TEs tend to accumulate and serves as a long-term "memory" of ancient TE invasions and produce a great variety of piRNAs protecting against related TEs entering the genome. It remains unclear which model best describes the evolution of piCs. Our analysis uncovers extensive variation in piC activity across strains and signatures of rapid birth and death of piCs in natural populations. Most TE families inferred to be recently or currently active show an enrichment of strain-specific insertions into large piCs, consistent with the trap model. By contrast, only a small subset of active LTR retrotransposon families is enriched for the formation of strain-specific piCs, suggesting that these families have an inherent proclivity to form de novo piCs. Thus, our findings support aspects of both 'de novo' and 'trap' models of piC evolution. We propose that these two models represent two extreme stages along an evolutionary continuum, which begins with the emergence of piCs de novo from a few specific LTR retrotransposon insertions that subsequently expand by accretion of other TE insertions during evolution to form larger 'trap' clusters. Our study shows that piCs are evolutionarily labile and that TEs themselves are the major force driving the formation and evolution of piCs.
Collapse
Affiliation(s)
- Satyam Srivastav
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| | - Andrew G. Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| |
Collapse
|
3
|
Maxwell PH, Belote JM, Levis RW. Identification of multiple transcription initiation, polyadenylation, and splice sites in the Drosophila melanogaster TART family of telomeric retrotransposons. Nucleic Acids Res 2006; 34:5498-507. [PMID: 17020919 PMCID: PMC1636488 DOI: 10.1093/nar/gkl709] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The Drosophila non-long terminal repeat (non-LTR) retrotransposons TART and HeT-A specifically retrotranspose to chromosome ends to maintain Drosophila telomeric DNA. Relatively little is known, though, about the regulation of their expression and their retrotransposition to telomeres. We have used rapid amplification of cDNA ends (RACE) to identify multiple transcription initiation and polyadenylation sites for sense and antisense transcripts of three subfamilies of TART elements in Drosophila melanogaster. These results are consistent with the production of an array of TART transcripts. In contrast to other Drosophila non-LTR elements, a major initiation site for sense transcripts was mapped near the 3′ end of the TART 5′-untranslated region (5′-UTR), rather than at the start of the 5′-UTR. A sequence overlapping this sense start site contains a good match to an initiator consensus for the transcription start sites of Drosophila LTR retrotransposons. Interestingly, analysis of 5′ RACE products for antisense transcripts and the GenBank EST database revealed that TART antisense transcripts contain multiple introns. Our results highlight differences between transcription of TART and of other Drosophila non-LTR elements and they provide a foundation for testing the relationship between exceptional aspects of TART transcription and TART's specialized role at telomeres.
Collapse
Affiliation(s)
- Patrick H Maxwell
- Department of Biology, Syracuse University, 130 College Place, Syracuse, NY 13244, USA.
| | | | | |
Collapse
|
4
|
Kutach AK, Kadonaga JT. The downstream promoter element DPE appears to be as widely used as the TATA box in Drosophila core promoters. Mol Cell Biol 2000; 20:4754-64. [PMID: 10848601 PMCID: PMC85905 DOI: 10.1128/mcb.20.13.4754-4764.2000] [Citation(s) in RCA: 242] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The downstream promoter element (DPE) functions cooperatively with the initiator (Inr) for the binding of TFIID in the transcription of core promoters in the absence of a TATA box. We examined the properties of sequences that can function as a DPE as well as the range of promoters that use the DPE as a core promoter element. By using an in vitro transcription assay, we identified 17 new DPE-dependent promoters and found that all possessed identical spacing between the Inr and DPE. Moreover, mutational analysis indicated that the insertion or deletion of a single nucleotide between the Inr and DPE causes a reduction in transcriptional activity and TFIID binding. To explore the range of sequences that can function as a DPE, we constructed and analyzed randomized promoter libraries. These experiments yielded the DPE functional range set, which represents sequences that contribute to or are compatible with DPE function. We then analyzed the DPE functional range set in conjunction with a Drosophila core promoter database that we compiled from 205 promoters with accurately mapped start sites. Somewhat surprisingly, the DPE sequence motif is as common as the TATA box in Drosophila promoters. There is, in addition, a striking adherence of Inr sequences to the Inr consensus in DPE-containing promoters relative to DPE-less promoters. Furthermore, statistical and biochemical analyses indicated that a G nucleotide between the Inr and DPE contributes to transcription from DPE-containing promoters. Thus, these data reveal that the DPE exhibits a strict spacing requirement yet some sequence flexibility and appears to be as widely used as the TATA box in Drosophila.
Collapse
Affiliation(s)
- A K Kutach
- Department of Biology and Center for Molecular Genetics, University of California, San Diego, La Jolla 92093-0347, USA
| | | |
Collapse
|
5
|
Marsano RM, Moschetti R, Caggese C, Lanave C, Barsanti P, Caizzi R. The complete Tirant transposable element in Drosophila melanogaster shows a structural relationship with retrovirus-like retrotransposons. Gene 2000; 247:87-95. [PMID: 10773447 DOI: 10.1016/s0378-1119(00)00115-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
We have determined the structure and organization of Tirant, a retrotransposon of Drosophila melanogaster reported in literature to be responsible for four independent mutations. Tirant is a long terminal repeat (LTR) retrotransposon 8527bp long. It possesses three open reading frames (ORF) encoding Gag, Pol and Env proteins with a strong similarity with ZAM, a recently identified member of the gypsy class of retrovirus-like mobile elements. Molecular analysis of the Tirant genomic copies present in four D. melanogaster strains revealed that most of them are defective, non-autonomous elements that differ in the position and extension of the conserved internal portion. Defective elements lacking the Gag ORF but retaining the Env ORF are abundant in heterochromatin. Four discrete Tirant transcripts are observed during embryogenesis in the strain Oregon-R, the smaller of which, 1.8kb in size, originates from the splicing of a primary transcript and leads to a subgenomic RNA coding for the Env product.
Collapse
Affiliation(s)
- R M Marsano
- Istituto di Genetica, Università di Bari, Via Amendola 165/A, 70126, Bari, Italy
| | | | | | | | | | | |
Collapse
|
6
|
Ponomarenko NA, Bannikov VM, Anashchenko VA, Tchurikov NA. Burdock, a novel retrotransposon in Drosophila melanogaster, integrates into the coding region of the cut locus. FEBS Lett 1997; 413:7-10. [PMID: 9287107 DOI: 10.1016/s0014-5793(97)00844-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The burdock element is known to be the 2.6-kb insertion into the same region of the cut locus in 12 independently obtained ct-lethal mutants. Here we have determined the complete sequences of this insertion and of the hot spot region. It was found that the burdock is a short retrotransposon with long terminal repeats and a single open reading frame (ORF). The polypeptide encoded by the burdock ORF contains two adjacent regions homologous to the gag and pol polyproteins of the gypsy mobile element. The burdock insertion interrupts the short ORF of the cut locus. The target site sequence of the burdock insertions is similar to the Drosophila topoisomerase II cleavage site.
Collapse
Affiliation(s)
- N A Ponomarenko
- Engelhardt Institute of Molecular Biology, Russian Academy of Science, Moscow
| | | | | | | |
Collapse
|
7
|
Burke TW, Kadonaga JT. Drosophila TFIID binds to a conserved downstream basal promoter element that is present in many TATA-box-deficient promoters. Genes Dev 1996; 10:711-24. [PMID: 8598298 DOI: 10.1101/gad.10.6.711] [Citation(s) in RCA: 290] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
We describe the identification and characterization of a conserved downstream basal promoter element that is present in a subset of Drosophila TATA-box-deficient (TATA-less) promoters by using purified, epitope-tagged TFIID complex (eTFIID) from embryos of transgenic Drosophila. DNase I footprinting of the binding of eTFIID to TATA-less promoters revealed that the factor protected a region that extended from the initiation site sequence (about +1) to approximately 35 nucleotides downstream of the RNA start site. In contrast, there was no apparent upstream DNase I protection or hypersensitivity induced by eTFIID in the -25 to -30 region at which TATA motifs are typically located. Further studies revealed a conserved sequence motif, (A/G)G(A/T)CGTG, termed the downstream promoter element (DPE), which is located approximately 30 nucleotides downstream of the RNA start site of many TATA-less promoters. DNase I footprinting and in vitro transcription experiments revealed that a DPE in its normal downstream location is necessary for transcription of DPE-containing TATA-less promoters and can compensate for the disruption of an upstream TATA box of a TATA-containing promoter. Moreover, a systematic mutational analysis of DNA sequences that encompass the DPE confirmed the importance of the consensus DPE sequence motif for basal transcription and further supports the postulate that the DPE is a distinct, downstream basal promoter element. These results suggest that the DPE acts in conjunction with the initiation site sequence to provide a binding site for TFIID in the absence of a TATA box to mediate transcription of TATA-less promoters.
Collapse
Affiliation(s)
- T W Burke
- Department of Biology and Center for Molecular Genetics, University of California at San Diego, La Jolla, 92093-0347, USA
| | | |
Collapse
|
8
|
Mozer BA, Benzer S. Ingrowth by photoreceptor axons induces transcription of a retrotransposon in the developing Drosophila brain. Development 1994; 120:1049-58. [PMID: 8026319 DOI: 10.1242/dev.120.5.1049] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The development of the lamina, the first optic ganglion of the fly visual system, depends on inductive cues from the innervating photoreceptor axons. lacZ expression from a P-element insertion, A72, occurs in the anlage of the lamina coincident with axon ingrowth from the eye imaginal disc. In eyeless mutants lacking photoreceptor axons, lacZ expression did not occur. The P-element was found to have inserted within the 3′ long terminal repeat (LTR) of a ‘17.6′ type retrotransposon. The expression pattern of 17.6 transcripts in the brain in wild-type and eyeless mutants paralleled the expression of the lacZ reporter. Analysis of 17.6 cis-regulatory sequences indicates that the lamina-specific expression is due to the combined action of an enhancer element in the LTR and a repressor element within the internal body of the retrotransposon. The regulation of the 17.6 retrotransposon provides a model for the study of innervation-dependent gene expression in postsynaptic cells during neurogenesis.
Collapse
Affiliation(s)
- B A Mozer
- Division of Biology, California Institute of Technology, Pasadena 91125
| | | |
Collapse
|
9
|
Abstract
Studies of transcriptional control sequences responsible for regulated and basal-level RNA synthesis from promoters of Drosophila melanogaster retrotransposons reveal novel aspects of gene regulation and lead to identification of trans-acting factors that can be involved in RNA polymerase II transcription not only of retrotransposons, but of many other cellular genes. Comparisons between promoters of retrotransposons and some other Drosophila genes demonstrate that there is a greater variety in basal promoter structure than previously thought and that many promoters may contain essential sequences downstream from the RNA start site.
Collapse
Affiliation(s)
- I R Arkhipova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow
| | | |
Collapse
|
10
|
Garfinkel DJ, Hedge AM, Youngren SD, Copeland TD. Proteolytic processing of pol-TYB proteins from the yeast retrotransposon Ty1. J Virol 1991; 65:4573-81. [PMID: 1714514 PMCID: PMC248911 DOI: 10.1128/jvi.65.9.4573-4581.1991] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Using antibodies directed against the TYB1 protein of the transpositionally competent retrotransposon Ty1-H3, we have identified three mature proteins of 23, 60, and 90 kDa and processing intermediates of 140 and 160 kDa that are derived from the 190-kDa TYA1-TYB1 polyprotein. Mature proteins and variable amounts of the precursors cofractionate with Ty viruslike particles. The map locations and precursor-product relationships of mature TYB1 polypeptides suggest that p23 is Ty1 protease, p90 is integrase, and p60 contains reverse transcriptase and RNase H. Immunoprecipitation and immunoblot analyses of Ty1 proteins show that p190 is cleaved to form p160. The p160 intermediate is cleaved to form p23 and p140, and p140 is cleaved to form p90 and p60. Processing of TYB1 proteins is dependent on Ty1 protease. Immunoblot analysis of TYB proteins from different Ty1 isolates reveal that correct processing of TYB1 proteins is a characteristic of functional Ty1 elements, whereas aberrant processing is a common defect found in transposition-incompetent elements.
Collapse
Affiliation(s)
- D J Garfinkel
- ABL-Basic Research Program, NCI-Frederick Cancer Research and Development Center, Frederick, Maryland 21702-1201
| | | | | | | |
Collapse
|
11
|
McCurrach KJ, Rothnie HM, Hardman N, Glover LA. Identification of a second retrotransposon-related element in the genome of Physarum polycephalum. Curr Genet 1990; 17:403-8. [PMID: 2162742 DOI: 10.1007/bf00334518] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The repetitive fraction of the genome of the eukaryotic slime mould Physarum polycephalum is dominated by Tp1, a family of retrotransposon-like sequences. Tp1 elements are arranged in scrambled clusters probably arising from integration of the element into copies of its own sequence. The present report describes a second sequence family, Tp2, which has been identified within cloned DNA segments of scrambled Tp1 sequences. Like Tp1, the Tp2 element is structurally related to retrotransposons, having long terminal direct repeats and being flanked by an apparent target site duplication, but its relatively short length (1.68 kb) indicates that it is probably incapable of encoding all the functions necessary for its own mobilisation. Analysis of the coding potential of the Tp2 element supports this view, although a striking homology to a nucleic acid binding domain common to many retrotransposons was identified. As with Tp1, putative regulatory signals can be identified in the LTRs of Tp2. Identical arrangements of Tp2 with respect to Tp1 in more than one independently derived clone indicate that non-functional copies of Tp2 may be mobilised as part of a Tp1 transcriptional unit.
Collapse
Affiliation(s)
- K J McCurrach
- Department of Biochemistry, University of Aberdeen, Marischal College, UK
| | | | | | | |
Collapse
|
12
|
Jakubczak JL, Xiong Y, Eickbush TH. Type I (R1) and type II (R2) ribosomal DNA insertions of Drosophila melanogaster are retrotransposable elements closely related to those of Bombyx mori. J Mol Biol 1990; 212:37-52. [PMID: 1690812 DOI: 10.1016/0022-2836(90)90303-4] [Citation(s) in RCA: 117] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Approximately 50% of the ribosomal DNA (rDNA) units of Drosophila melanogaster are inactivated by two different 28 S RNA ribosomal gene insertions (type I and type II). We present here the nucleotide sequence of complete type I and type II elements. Conceptual translation of these sequences revealed open reading frames (ORFs) encoding amino acid residues conserved in all retrotransposable elements. Full-length type I elements are 5.35 x 10(3) base-pairs in length and contain two overlapping ORFs. The smaller ORF (471 amino acid residues) has similarity to gag genes, while the larger ORF (1021 residues) has similarity to pol genes. Full-length type II elements are 3.6 x 10(3) base-pairs and contain one large ORF (1056 residues) that appears to represent a gag-pol fusion. Type I and type II elements are similar in structure, in the proteins they encode, and in insertion specificity to the R1Bm and R2Bm retrotransposable elements of Bombyx mori. We suggest that the D. melanogaster elements be called R1Dm and R2Dm, to reflect their structure as retrotransposons. Comparison of the R1 and R2 elements from these two widely different species revealed regions of the ORF that are likely to play an important role in the propagation of the elements. Four distinct regions of sequence conservation separated by regions of little or no sequence similarity were detected for both the R1 and R2 elements: (1) cysteine motifs of the gag gene, with three such motifs for R1 and one motif for R2; (2) a reverse transcriptase domain; (3) an integrase domain located carboxyl terminal to the reverse transcriptase region; and (4) a small region amino terminal to the reverse transcriptase domain, whose function is not known. The level of identity of the amino acid residues for these segments is 28 to 34% between the R1 elements, and 34 to 39% for the R2 elements. Finally, it may be predicted that the mechanism of unequal crossover might eventually eliminate R1 and R2 from the rDNA locus. The long history of selection at the protein level exhibited by these elements indicates that it is their active transposition that maintains them in the locus. The high level of sequence homogeneity between copies of each element within the same species is consistent with the high turnover rate expected to result from these processes.
Collapse
Affiliation(s)
- J L Jakubczak
- Department of Biology, University of Rochester, NY 14627
| | | | | |
Collapse
|
13
|
Affiliation(s)
- G Echalier
- URA 7 CNRS, Laboratoire de Zoologie, Université Pierre et Marie Curie, Paris, France
| |
Collapse
|
14
|
Tanda S, Shrimpton AE, Chueh LL, Itayama H, Matsubayashi H, Saigo K, Tobari YN, Langley CH. Retrovirus-like features and site specific insertions of a transposable element, tom, in Drosophila ananassae. MOLECULAR & GENERAL GENETICS : MGG 1988; 214:405-11. [PMID: 2851093 DOI: 10.1007/bf00330473] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The tom element, putatively associated with optic morphology (Om) mutations in Drosophila ananassae, was identified as a retrovirus-like transposable element. The tom element was found to terminate with 475 (or 474) base pair direct repeats which are identical in sequence to each other. Southern blot and heteroduplex analyses showed the tom element to have high homology to 297 and 17.6, two retrotransposons found in D. melanogaster. As in the cases of 297 and 17.6, tom includes nucleotide sequences coding for a presumptive protease and reverse transcriptase, similar in amino acid sequence to those of the Moloney murine leukaemia virus. At the tom insertion site of the sn9g locus, a host DNA sequence (T)ATAT was found to be duplicated on each side of the tom insertion and all other tom elements examined were also flanked by (T)ATAT. In each of six cases, the 5' flanking host sequence was TATAT. These results indicate that the target sequence of the tom element may be TATAT and that the entire region or a part of this sequence was duplicated on insertion of the tom element.
Collapse
Affiliation(s)
- S Tanda
- Laboratory of Molecular Genetics, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709
| | | | | | | | | | | | | | | |
Collapse
|