1
|
Gerdes P, Chan D, Lundberg M, Sanchez-Luque FJ, Bodea GO, Ewing AD, Faulkner GJ, Richardson SR. Locus-resolution analysis of L1 regulation and retrotransposition potential in mouse embryonic development. Genome Res 2023; 33:1465-1481. [PMID: 37798118 PMCID: PMC10620060 DOI: 10.1101/gr.278003.123] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/21/2023] [Indexed: 10/07/2023]
Abstract
Mice harbor ∼2800 intact copies of the retrotransposon Long Interspersed Element 1 (L1). The in vivo retrotransposition capacity of an L1 copy is defined by both its sequence integrity and epigenetic status, including DNA methylation of the monomeric units constituting young mouse L1 promoters. Locus-specific L1 methylation dynamics during development may therefore elucidate and explain spatiotemporal niches of endogenous retrotransposition but remain unresolved. Here, we interrogate the retrotransposition efficiency and epigenetic fate of source (donor) L1s, identified as mobile in vivo. We show that promoter monomer loss consistently attenuates the relative retrotransposition potential of their offspring (daughter) L1 insertions. We also observe that most donor/daughter L1 pairs are efficiently methylated upon differentiation in vivo and in vitro. We use Oxford Nanopore Technologies (ONT) long-read sequencing to resolve L1 methylation genome-wide and at individual L1 loci, revealing a distinctive "smile" pattern in methylation levels across the L1 promoter region. Using Pacific Biosciences (PacBio) SMRT sequencing of L1 5' RACE products, we then examine DNA methylation dynamics at the mouse L1 promoter in parallel with transcription start site (TSS) distribution at locus-specific resolution. Together, our results offer a novel perspective on the interplay between epigenetic repression, L1 evolution, and genome stability.
Collapse
Affiliation(s)
- Patricia Gerdes
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Dorothy Chan
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Mischa Lundberg
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland 4102, Australia
- Translational Bioinformatics, Commonwealth Scientific and Industrial Research Organisation, Sydney, New South Wales 2113, Australia
| | - Francisco J Sanchez-Luque
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- GENYO. Centre for Genomics and Oncological Research (Pfizer-University of Granada-Andalusian Regional Government), PTS Granada, 18016, Spain
- MRC Human Genetics Unit, Institute of Genetics and Cancer (IGC), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Gabriela O Bodea
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Adam D Ewing
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia;
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Sandra R Richardson
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia;
| |
Collapse
|
2
|
Del Re B, Giorgi G. Long INterspersed element-1 mobility as a sensor of environmental stresses. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS 2020; 61:465-493. [PMID: 32144842 DOI: 10.1002/em.22366] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 03/03/2020] [Accepted: 03/04/2020] [Indexed: 06/10/2023]
Abstract
Long INterspersed element (LINE-1, L1) retrotransposons are the most abundant transposable elements in the human genome, constituting approximately 17%. They move by a "copy-paste" mechanism, involving reverse transcription of an RNA intermediate and insertion of its cDNA copy at a new site in the genome. L1 retrotransposition (L1-RTP) can cause insertional mutations, alter gene expression, transduce exons, and induce epigenetic dysregulation. L1-RTP is generally repressed; however, a number of observations collected over about 15 years revealed that it can occur in response to environmental stresses. Moreover, emerging evidence indicates that L1-RTP can play a role in the onset of several neurological and oncological diseases in humans. In recent years, great attention has been paid to the exposome paradigm, which proposes that health effects of an environmental factor should be evaluated considering both cumulative environmental exposures and the endogenous processes resulting from the biological response. L1-RTP could be an endogenous process considered for this application. Here, we summarize the current understanding of environmental factors that can affect the retrotransposition of human L1 elements. Evidence indicates that L1-RTP alteration is triggered by numerous and various environmental stressors, such as chemical agents (heavy metals, carcinogens, oxidants, and drugs), physical agents (ionizing and non-ionizing radiations), and experiential factors (voluntary exercise, social isolation, maternal care, and environmental light/dark cycles). These data come from in vitro studies on cell lines and in vivo studies on transgenic animals: future investigations should be focused on physiologically relevant models to gain a better understanding of this topic.
Collapse
Affiliation(s)
- Brunella Del Re
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum University of Bologna, Bologna, Italy
| | - Gianfranco Giorgi
- Department of Biological, Geological and Environmental Sciences, Alma Mater Studiorum University of Bologna, Bologna, Italy
| |
Collapse
|
3
|
Blumenstiel JP. Birth, School, Work, Death, and Resurrection: The Life Stages and Dynamics of Transposable Element Proliferation. Genes (Basel) 2019; 10:genes10050336. [PMID: 31058854 PMCID: PMC6562965 DOI: 10.3390/genes10050336] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Revised: 04/18/2019] [Accepted: 04/23/2019] [Indexed: 12/18/2022] Open
Abstract
Transposable elements (TEs) can be maintained in sexually reproducing species even if they are harmful. However, the evolutionary strategies that TEs employ during proliferation can modulate their impact. In this review, I outline the different life stages of a TE lineage, from birth to proliferation to extinction. Through their interactions with the host, TEs can exploit diverse strategies that range from long-term coexistence to recurrent movement across species boundaries by horizontal transfer. TEs can also engage in a poorly understood phenomenon of TE resurrection, where TE lineages can apparently go extinct, only to proliferate again. By determining how this is possible, we may obtain new insights into the evolutionary dynamics of TEs and how they shape the genomes of their hosts.
Collapse
Affiliation(s)
- Justin P Blumenstiel
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66049, USA.
| |
Collapse
|
4
|
Sookdeo A, Hepp CM, Boissinot S. Contrasted patterns of evolution of the LINE-1 retrotransposon in perissodactyls: the history of a LINE-1 extinction. Mob DNA 2018; 9:12. [PMID: 29610583 PMCID: PMC5872511 DOI: 10.1186/s13100-018-0117-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 03/22/2018] [Indexed: 12/30/2022] Open
Abstract
Background LINE-1 (L1) is the dominant autonomously replicating non-LTR retrotransposon in mammals. Although our knowledge of L1 evolution across the tree of life has considerably improved in recent years, what we know of L1 evolution in mammals is biased and comes mostly from studies in primates (mostly human) and rodents (mostly mouse). It is unclear if patterns of evolution that are shared between those two groups apply to other mammalian orders. Here we performed a detailed study on the evolution of L1 in perissodactyls by making use of the complete genome of the domestic horse and of the white rhinoceros. This mammalian order offers an excellent model to study the extinction of L1 since the rhinoceros is one of the few mammalian species to have lost active L1. Results We found that multiple L1 lineages, carrying different 5’UTRs, have been simultaneously active during the evolution of perissodactyls. We also found that L1 has continuously amplified and diversified in horse. In rhinoceros, L1 was very prolific early on. Two successful families were simultaneously active until ~20my ago but became extinct suddenly at exactly the same time. Conclusions The general pattern of L1 evolution in perissodactyls is very similar to what was previously described in mouse and human, suggesting some commonalities in the way mammalian genomes interact with L1. We confirmed the extinction of L1 in rhinoceros and we discuss several possible mechanisms. Electronic supplementary material The online version of this article (10.1186/s13100-018-0117-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Akash Sookdeo
- 1Department of Biology, New York University, New York, NY USA
| | - Crystal M Hepp
- 2School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ USA
| | - Stéphane Boissinot
- 3New York University Abu Dhabi, Saadiyat Island, Abu Dhabi, United Arab Emirates
| |
Collapse
|
5
|
Sookdeo A, Hepp CM, McClure MA, Boissinot S. Revisiting the evolution of mouse LINE-1 in the genomic era. Mob DNA 2013; 4:3. [PMID: 23286374 PMCID: PMC3600994 DOI: 10.1186/1759-8753-4-3] [Citation(s) in RCA: 115] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2012] [Accepted: 10/25/2012] [Indexed: 11/10/2022] Open
Abstract
Background LINE-1 (L1) is the dominant category of transposable elements in placental mammals. L1 has significantly affected the size and structure of all mammalian genomes and understanding the nature of the interactions between L1 and its mammalian host remains a question of crucial importance in comparative genomics. For this reason, much attention has been dedicated to the evolution of L1. Among the most studied elements is the mouse L1 which has been the subject of a number of studies in the 1980s and 1990s. These seminal studies, performed in the pre-genomic era when only a limited number of L1 sequences were available, have significantly improved our understanding of L1 evolution. Yet, no comprehensive study on the evolution of L1 in mouse has been performed since the completion of this genome sequence. Results Using the Genome Parsing Suite we performed the first evolutionary analysis of mouse L1 over the entire length of the element. This analysis indicates that the mouse L1 has recruited novel 5’UTR sequences more frequently than previously thought and that the simultaneous activity of non-homologous promoters seems to be one of the conditions for the co-existence of multiple L1 families or lineages. In addition the exchange of genetic information between L1 families is not limited to the 5’UTR as evidence of inter-family recombination was observed in ORF1, ORF2, and the 3’UTR. In contrast to the human L1, there was little evidence of rapid amino-acid replacement in the coiled-coil of ORF1, although this region is structurally unstable. We propose that the structural instability of the coiled-coil domain might be adaptive and that structural changes in this region are selectively equivalent to the rapid evolution at the amino-acid level reported in the human lineage. Conclusions The pattern of evolution of L1 in mouse shows some similarity with human suggesting that the nature of the interactions between L1 and its host might be similar in these two species. Yet, some notable differences, particularly in the evolution of ORF1, suggest that the molecular mechanisms involved in host-L1 interactions might be different in these two species.
Collapse
Affiliation(s)
- Akash Sookdeo
- Department of Biology, Queens College, the City University of New York, 65-30 Kissena Boulevard, Flushing, NY 11367-1597, USA.
| | | | | | | |
Collapse
|
6
|
Khan H, Smit A, Boissinot S. Molecular evolution and tempo of amplification of human LINE-1 retrotransposons since the origin of primates. Genome Res 2005; 16:78-87. [PMID: 16344559 PMCID: PMC1356131 DOI: 10.1101/gr.4001406] [Citation(s) in RCA: 253] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
We investigated the evolution of the families of LINE-1 (L1) retrotransposons that have amplified in the human lineage since the origin of primates. We identified two phases in the evolution of L1. From approximately 70 million years ago (Mya) until approximately 40 Mya, three distinct L1 lineages were simultaneously active in the genome of ancestral primates. In contrast, during the last 40 million years (Myr), i.e., during the evolution of anthropoid primates, a single lineage of families has evolved and amplified. We found that novel (i.e., unrelated) regulatory regions (5'UTR) have been frequently recruited during the evolution of L1, whereas the two open-reading frames (ORF1 and ORF2) have remained relatively conserved. We found that L1 families coexisted and formed independently evolving L1 lineages only when they had different 5'UTRs. We propose that L1 families with different 5'UTR can coexist because they don't rely on the same host-encoded factors for their transcription and therefore do not compete with each other. The most prolific L1 families (families L1PA8 to L1PA3) amplified between 40 and 12 Mya. This period of high activity corresponds to an episode of adaptive evolution in a segment of ORF1. The correlation between the high activity of L1 families and adaptive evolution could result from the coevolution of L1 and a host-encoded repressor of L1 activity.
Collapse
Affiliation(s)
- Hameed Khan
- Department of Biology, Queens College, the City University of New York, Flushing, New York 11367, USA
| | | | | |
Collapse
|
7
|
Abstract
L1 retrotransposons comprise 17% of the human genome. Although most L1s are inactive, some elements remain capable of retrotransposition. L1 elements have a long evolutionary history dating to the beginnings of eukaryotic existence. Although many aspects of their retrotransposition mechanism remain poorly understood, they likely integrate into genomic DNA by a process called target primed reverse transcription. L1s have shaped mammalian genomes through a number of mechanisms. First, they have greatly expanded the genome both by their own retrotransposition and by providing the machinery necessary for the retrotransposition of other mobile elements, such as Alus. Second, they have shuffled non-L1 sequence throughout the genome by a process termed transduction. Third, they have affected gene expression by a number of mechanisms. For instance, they occasionally insert into genes and cause disease both in humans and in mice. L1 elements have proven useful as phylogenetic markers and may find other practical applications in gene discovery following insertional mutagenesis in mice and in the delivery of therapeutic genes.
Collapse
Affiliation(s)
- E M Ostertag
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA.
| | | |
Collapse
|
8
|
Chen HH, Liu TYC, Huang CJ, Choo KB. Generation of two homologous and intronless zinc-finger protein genes, zfp352 and zfp353, with different expression patterns by retrotransposition. Genomics 2002; 79:18-23. [PMID: 11827453 DOI: 10.1006/geno.2001.6664] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We have previously reported a mouse zinc-finger protein gene, Zfp352 (formerly 2czf48), that is expressed in early mouse embryos. Here, we report the genomic structure of Zfp352 and its lung-specific homolog, Zfp353. The two genes map on different chromosomes at 4C6 and 8B3.1. Both genes are intronless, except for the presence of a single 4.6-kb intron in the 5' untranslated region of Zfp352. The genes use different RNA start sites located 1.2 kb apart within the 5' homologous region. LINE1 sequences are structurally associated with the genes and form an integral part of Zfp353 transcripts, suggesting previous retrotransposition events. We propose a model of evolution of the genes. The main feature of the model is the presence of a fortuitous upstream promoter and an intron in the first retrotransposition site, creating a pre-Zfp352 gene with a 5' untranslated region intron. A second retrotransposition event copying from the pre-Zfp352 retroposon and removing the fortuitous intron resulted in the intronless Zfp353 at a different chromosomal location and with a different mode of expression. The model may be applicable to other genes with a similar structure with a single intron in the 5' untranslated region. The exact role of LINE1 in the retrotransposition events remains to be elucidated.
Collapse
Affiliation(s)
- Huang-Hui Chen
- Recombinant DNA Laboratory, Department of Medical Research and Education, Veterans General Hospital-Taipei, Shih Pai, Taipei, Taiwan 11217
| | | | | | | |
Collapse
|
9
|
DeBerardinis RJ, Kazazian HH. Analysis of the promoter from an expanding mouse retrotransposon subfamily. Genomics 1999; 56:317-23. [PMID: 10087199 DOI: 10.1006/geno.1998.5729] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The mouse genome contains several subfamilies of the retrotransposon L1. One subfamily, TF, contains 4000-5000 full-length members and is expanding due to retrotransposition of a large number of active elements. Here we studied the TF 5' untranslated region (UTR), which contains promoter activity required for subfamily expression. Using reporter assays, we show that promoter activity is derived from TF-specific monomer sequences and is proportional to the number of monomers in the 5' UTR. These data suggest that nearly all full-length TF elements in the mouse genome are currently competent for expression. We aligned the sequences of 53 monomers to generate a consensus TF monomer and determined that most TF elements are truncated near a potential binding site for a transcription initiation factor. We also determined that much of the sequence variation among TF monomers results from transition mutations at CpG dinucleotides, suggesting that genomic TF 5' UTRs are methylated at CpGs.
Collapse
Affiliation(s)
- R J DeBerardinis
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | | |
Collapse
|
10
|
DeBerardinis RJ, Goodier JL, Ostertag EM, Kazazian HH. Rapid amplification of a retrotransposon subfamily is evolving the mouse genome. Nat Genet 1998; 20:288-90. [PMID: 9806550 DOI: 10.1038/3104] [Citation(s) in RCA: 121] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Retrotransposition affects genome structure by increasing repetition and producing insertional mutations. Dispersion of the retrotransposon L1 throughout mammalian genomes suggests that L1 activity might be an important evolutionary force. Here we report that L1 retrotransposition contributes to rapid genome evolution in the mouse, because a number of L1 sequences from the T(F) subfamily are retrotransposition competent. We show that the T(F) subfamily is large, young and expanding, containing approximately 4,800 full-length members in strain 129. Eleven randomly isolated, full-length T(F) elements averaged 99.8% sequence identity to each other, and seven of these retrotransposed in cultured cells. Thus, we estimate that the mouse genome contains approximately 3,000 active T(F) elements, 75 times the estimated number of active human L1s. Moreover, as T(F) elements are polymorphic among closely related mice, they have retrotransposed recently, implying rapid amplification of the subfamily to yield genomes with different patterns of interspersed repetition. Our data show that mice and humans differ considerably in the number of active L1s, and probably differ in the contribution of retrotransposition to ongoing sequence evolution.
Collapse
Affiliation(s)
- R J DeBerardinis
- Department of Genetics, University of Pennsylvania, Philadelphia 19104, USA
| | | | | | | |
Collapse
|
11
|
Saxton JA, Martin SL. Recombination between subtypes creates a mosaic lineage of LINE-1 that is expressed and actively retrotransposing in the mouse genome. J Mol Biol 1998; 280:611-22. [PMID: 9677292 DOI: 10.1006/jmbi.1998.1899] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
LINE-1, or L1, elements are retrotransposons that have amplified to high-copy number during the evolution of mammals. L1 appears to amplify in waves, spawning large numbers of progeny such that elements with distinct sequence features dominate the dispersal process in a given window of time. This process generates discrete subfamilies of L1 within mammalian genomes, with the oldest being remnants, or fossils, of earlier waves of amplification. In mice, at least three distinct subfamilies of L1 were distinguished by their unique 5' ends, A, F and V. These subfamilies amplified at distinct times in the evolution of mice, with A being the youngest and V the oldest; both V and F subfamilies were believed extinct. Recent data established that a variant of the F family, TF, is actively retrotransposing. We demonstrate here that members of the TF subfamily are abundantly expressed in mouse cells and encode the major protein constituent of L1 ribonucleoprotein particles. Although members of the TF subfamily are not as numerous in the genomes of laboratory mice as are members of the older A and F subfamilies, they appear to have been activated some time ago during mouse evolution, in the common ancestor of Mus spretus and Mus domesticus. Phylogenetic analysis demonstrates that this modern, active form of TF-type L1 has a composite evolutionary history, showing evidence of multiple recombinations between distinct L1 variants, including members of the A and F subfamilies.
Collapse
Affiliation(s)
- J A Saxton
- Department of Cellular and Structural Biology, B111, University of Colorado School of Medicine, 4200 E. Ninth Ave., Denver, CO 80262, USA
| | | |
Collapse
|
12
|
Woodcock DM, Linsenmeyer ME, Warren WD. DNA methylation in mouse A-repeats in DNA methyltransferase-knockout ES cells and in normal cells determined by bisulfite genomic sequencing. Gene 1998; 206:63-7. [PMID: 9461416 DOI: 10.1016/s0378-1119(97)00566-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Mouse ES cells with a null mutation of the known DNA methyltransferase retain some residual DNA methylation and can methylate foreign sequences de novo. We have used bisulfite genomic sequencing to examine the sequence specificity and distributions of methylation of a hypermethylated CG island sequence, mouse A-repeats. There were 13 CG dinucleotides in the region examined, 12 of which were methylated to variable extents in all DNAs. We found that: (1) there is considerable residual DNA methylation in ES cells lacking the known DNA methyltransferase (29% of normal methylation in the complete knockout ES DNA); (2) this other activity methylates at exactly the same CG sites as the major methyltransferase; and (3) differences in the distribution of methylated sites between A-repeats in these DNAs are consistent with this other activity methylating in a random de novo fashion. Also, the lack of any methylation in non-CG sites argues that, in other studies where non-CG methylation sites have been found by bisulfite sequencing, detection of such sites of non-CG methylation is not an inherent artifact in this methodology.
Collapse
Affiliation(s)
- D M Woodcock
- Peter MacCallum Cancer Institute, Melbourne, Victoria, Australia.
| | | | | |
Collapse
|
13
|
Cabot EL, Angeletti B, Usdin K, Furano AV. Rapid evolution of a young L1 (LINE-1) clade in recently speciated Rattus taxa. J Mol Evol 1997; 45:412-23. [PMID: 9321420 DOI: 10.1007/pl00006246] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
L1 elements are retrotransposons that have been replicating and evolving in mammalian genomes since before the mammalian radiation. Rattus norvegicus shares the young L1mlvi2 clade only with its sister taxon, Rattus cf moluccarius. Here we compared the L1mlvi2 clade in these recently diverged species and found that it evolved rapidly into closely related but distinct clades: the L1mlvi2-rm clade (or subfamily), characterized here from R. cf moluccarius, and the L1mlvi2-rn clade, originally described in R. norvegicus. In addition to other differences, these clades are distinguished by a cluster of amino acid replacement substitutions in ORF I. Both rat species contain the L1mlvi2-rm clade, but the L1mlvi2-rn clade is restricted to R. norvegicus. Therefore, the L1mlvi2-rm clade arose prior to the divergence of R. norvegicus and R. cf moluccarius, and the L1mlvi2-rn clade amplified after their divergence. The total number of L1mlvi2-rm elements in R. cf moluccarius is about the same as the sum of the L1mlvi2-rm and L1mlvi2-rn elements in R. norvegicus. The possibility that L1 amplification is in some way limited so that the two clades compete for replicative supremacy as well as the implications of the other distinguishing characteristic of the L1mlvi2-rn and L1mlvi2-rm clades are discussed.
Collapse
Affiliation(s)
- E L Cabot
- Section on Genomic Structure and Function, NIDDK, NIH, Bethesda, MD 20892-0830, USA.
| | | | | | | |
Collapse
|
14
|
Hayward BE, Zavanelli M, Furano AV. Recombination creates novel L1 (LINE-1) elements in Rattus norvegicus. Genetics 1997; 146:641-54. [PMID: 9178013 PMCID: PMC1208004 DOI: 10.1093/genetics/146.2.641] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Mammalian L1 (long interspersed repeated DNA. LINE-1) retrotransposons consist of a 5' untranslated region (UTR) with regulatory properties, two protein encoding regions (ORF I, ORF II, which encodes a reverse transcriptase) and a 3' UTR. L1 elements have been evolving in mammals for > 100 million years and this process continues to generate novel L1 subfamilies in modern species. Here we characterized the youngest known subfamily in Rattus norvegicus, L1mlvi2, and unexpectedly found that this element has a dual ancestry. While its 3' UTR shares the same lineage as its nearest chronologically antecedent subfamilies, L13 and L14, its ORF I sequence does not. The L1mlvi2 ORF I was derived from an ancestral ORF I sequence that was the evolutionary precursor of the L13 and L14 ORF I. We suggest that an ancestral ORF I sequence was recruited into the modern L1mlvi2 subfamily by recombination that possibly could have resulted from template strand switching by the reverse transcriptase during L1 replication. This mechanism could also account for some of the structural features of rodent L1 5' UTR and ORF I sequences including one of the more dramatic features of L1 evolution in mammals, namely the repeated acquisition of novel 5' UTRs.
Collapse
Affiliation(s)
- B E Hayward
- Laboratory of Molecular and Cellular Biology, National Institute of Diabetes, Maryland 20892-0830, USA
| | | | | |
Collapse
|
15
|
Pardue ML, Danilevskaya ON, Lowenhaupt K, Wong J, Erby K. The gag coding region of the Drosophila telomeric retrotransposon, HeT-A, has an internal frame shift and a length polymorphic region. J Mol Evol 1996; 43:572-83. [PMID: 8995054 DOI: 10.1007/bf02202105] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A major component of Drosophila telomeres is the retrotransposon HeT-A, which is clearly related to other retrotransposons and retroviruses. This retrotransposon is distinguished by its exclusively telomeric location, and by the fact that, unlike other retrotransposons, it does not encode its own reverse transcriptase. HeT-A coding sequences diverge significantly, even between elements within the same genome. Such rapid divergence has been noted previously in studies of gag genes from other retroelements. Sequence comparisons indicate that the entire HeT-A coding region codes for gag protein, with regions of similarity to other insect retrotransposon gag proteins found throughout the open reading frame (ORF). Similarity is most striking in the zinc knuckle region, a region characteristic of gag genes of most replication-competent retroelements. We identify a subgroup of insect non-LTR retrotransposons with three zinc knuckles of the form: (1) CX2CX4HX4C, (2) CX2CX3HX4C, (3) CX2CX3HX6C. The first and third knuckles are invariant, but the second shows some differences between members of this subgroup. This subgroup includes HeT-A and a second Drosophila telomeric retrotransposon, TART. Unlike other gag regions, HeT-A requires a -1 frameshift for complete translation. Such frameshifts are common between the gag and pol sequences of retroviruses but have not before been seen within a gag sequence. The frameshift allows HeT-A to encode two polypeptides; this mechanism may substitute for the post-translational cleavage that creates multiple gag polypeptides in retroviruses. D. melanogaster HeT-A coding sequences have a polymorphic region with insertions/deletions of 1-31 codons and many nucleotide changes. None of these changes interrupt the open reading frame, arguing that only elements with translatable ORFs can be incorporated into the chromosomes. Perhaps HeT-A translation products act in cis to target the RNA to chromosome ends.
Collapse
Affiliation(s)
- M L Pardue
- Department of Biology, Massachusetts Institute of Technology, Cambridge 02139, USA
| | | | | | | | | |
Collapse
|
16
|
Biessmann H, Kasravi B, Bui T, Fujiwara G, Champion LE, Mason JM. Comparison of two active HeT-A retroposons of Drosophila melanogaster. Chromosoma 1994; 103:90-8. [PMID: 8055715 DOI: 10.1007/bf00352317] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
HeT-A elements are Drosophila melanogaster LINE-like retroposons that transpose to broken chromosome ends by attaching themselves with an oligo(A) tail. Since this family of elements is believed to be involved in the vital function of telomere elongation in Drosophila, it is important to understand their transposition mechanism and the molecular aspects of activity. By comparison of several elements we have defined here the unit length of HeT-A elements to be approximately 6 kb. Also, we studied an active HeT-A element that had transposed very recently to the end of a terminally deleted X chromosome. The 12 kb of newly transposed DNA consisted of a tandem array of three different HeT-A elements joined by oligo(A) tails to each other and to the chromosome end broken in the yellow gene. Such an array may have transposed as a single unit or resulted from rapid successive transpositions of individual HeT-A elements. By sequence comparison with another recently transposed HeT-A element, conserved domains in the single open reading frame (ORF), encoding a gag-like polypeptide, of these elements were defined. We conclude that for transposition an intact ORF is required in cis, while the reverse transcriptase is not encoded on the HeT-A element but is provided in trans. This would make HeT-A elements dependent on an external reverse transcriptase for transposition and establish control of the genome over the activity of HeT-A elements. This distinguishes the Drosophila HeT-A element, which has been implicated in Drosophila telomere elongation, from the other, 'selfish' LINE-like elements.
Collapse
Affiliation(s)
- H Biessmann
- Developmental Biology Center, University of California, Irvine 92717
| | | | | | | | | | | |
Collapse
|
17
|
Adey NB, Tollefsbol TO, Sparks AB, Edgell MH, Hutchison CA. Molecular resurrection of an extinct ancestral promoter for mouse L1. Proc Natl Acad Sci U S A 1994; 91:1569-73. [PMID: 8108446 PMCID: PMC43201 DOI: 10.1073/pnas.91.4.1569] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The F-type subfamily of LINE-1 or L1 retroposons [for long interspersed (repetitive) element 1] was dispersed in the mouse genome several million years ago. This subfamily appears to be both transcriptionally and transpositionally inactive today and therefore may be considered evolutionarily extinct. We hypothesized that these F-type L1s are inactive because of the accumulation of mutations. To test this idea we used phylogenetic analysis to deduce the sequence of a transpositionally active ancestral F-type promoter, resurrected it by chemical synthesis, and showed that it has promoter activity. In contrast, F-type sequences isolated from the modern genome are inactive. This approach, in which the automated DNA synthesizer is used as a "time machine," should have broad application in testing models derived from evolutionary studies.
Collapse
Affiliation(s)
- N B Adey
- Department of Microbiology and Immunology, University of North Carolina at Chapel Hill 27599
| | | | | | | | | |
Collapse
|
18
|
Schichman SA, Severynse DM, Edgell MH, Hutchison CA. Strand-specific LINE-1 transcription in mouse F9 cells originates from the youngest phylogenetic subgroup of LINE-1 elements. J Mol Biol 1992; 224:559-74. [PMID: 1314898 DOI: 10.1016/0022-2836(92)90544-t] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
LINE-1 (L1) is a mammalian family of highly repeated DNA sequences that are members of a class of transposable elements whose movement involves an RNA intermediate. Both structural and evolutionary data indicate that the L1 family consists of a small number of active transposable elements interspersed with a large number of L1 pseudogenes. In the mouse, the longest, characterized L1 sequences span about 7000 base-pairs and contain two long open reading frames. Two subfamilies of mouse L1 elements, A and F, have been defined on the basis of the type of putative transcriptional regulatory sequence found at the 5' end. In order to identify a transcribed subset of L1 elements in mouse F9 teratocarcinoma cells, we have examined the strand-specificity of L1 transcription by Northern analysis and compared the open reading frame-1 sequences of ten A-type cDNAs with fifteen genomic A-type L1 elements. Transcripts containing A-type sequence are far more abundant than those containing F-type sequence. Although the majority of L1 RNA in F9 cells appears to be transcribed non-specifically from both strands, our results provide evidence for a subpopulation of variable length, strand-specific transcripts arising from A-type transcriptional regulatory sequences. F9 cell cDNA sequences, which share greater than 99.5% sequence identity with one another, represent a homogeneous subset of the genomic L1 population. Examination of genomic mouse L1 sequences reveals three types of length polymorphism in a defined segment of the first open reading frame. Phylogenetic analysis shows a correlation between the type of length polymorphism in the first open reading frame and the relative age of an individual A-type genomic L1 element. Comparison of the cDNA and genomic sequences indicates that the youngest subgroup of A-type L1 elements is preferentially transcribed in F9 cells. This subgroup may be currently dominating the L1 dispersal process in mice.
Collapse
Affiliation(s)
- S A Schichman
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27599
| | | | | | | |
Collapse
|
19
|
Adey NB, Schichman SA, Hutchison CA, Edgell MH. Composite of A and F-type 5' terminal sequences defines a subfamily of mouse LINE-1 elements. J Mol Biol 1991; 221:367-73. [PMID: 1920423 DOI: 10.1016/0022-2836(91)80057-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The 5' terminus of full-length L1 elements contains transcriptional control sequences. In mouse L1 (L1Md) elements, these sequences exist as an array of tandem direct repeats. Two types of repeat units, termed A-monomers and F-monomers, have been reported. Both monomers are about 200 bp in length but share no significant sequence homology. Previous studies have identified L1Md elements containing either A or F-monomers but not both. Here we describe three "composite" L1Md elements that contain both types of monomer sequence. Two of these composite L1Md elements are highly homologous and share the same structural rearrangements, implying that they arose from a common ancestor that has the same composite 5' end.
Collapse
Affiliation(s)
- N B Adey
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 72599
| | | | | | | |
Collapse
|
20
|
Adey NB, Comer MB, Edgell MH, Hutchison CA. Nucleotide sequence of a mouse full-length F-type L1 element. Nucleic Acids Res 1991; 19:2497. [PMID: 1645871 PMCID: PMC329465 DOI: 10.1093/nar/19.9.2497] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Affiliation(s)
- N B Adey
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27599
| | | | | | | |
Collapse
|
21
|
Lankenau DH, Huijser P, Jansen E, Miedema K, Hennig W. DNA sequence comparison of micropia transposable elements from Drosophila hydei and Drosophila melanogaster. Chromosoma 1990; 99:111-7. [PMID: 2162752 DOI: 10.1007/bf01735326] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Members of the retrotransposon family micropia were discovered as constituents of wild-type Y chromosomal fertility genes from Drosophila hydei. Several members of the micropia family have subsequently been recovered from Drosophila melanogaster and four micropia elements, micropia-DhMiF2, -DhMiF8, -Dm11 and -Dm2, two each from D. hydei and D. melanogaster, have been totally sequenced (17 kb of micropia sequences and 6.8 kb from insertions). Comparative analysis of micropia sequences revealed a complex pattern of divergence within a single Drosophila genome. The divergence includes deletions, possibly by a slipped mispairing mechanism, insertions of a retroposon, and of another retrotransposon (copia) and "positional nucleotide shuffling" within the tandem repeats of the 3' non-protein-coding region of micropia elements. A 10 bp long sequence of each repeat unit of the 3' tandem repeats of micropia elements is highly conserved and is therefore a candidate of functional importance either in transposition events or in regulatory activity on flanking DNA sequences.
Collapse
Affiliation(s)
- D H Lankenau
- Department of Molecular and Developmental Genetics, Catholic University, Nijmegen, The Netherlands
| | | | | | | | | |
Collapse
|
22
|
Lankenau DH, Huijser P, Jansen E, Miedema K, Hennig W. Micropia: a retrotransposon of Drosophila combining structural features of DNA viruses, retroviruses and non-viral transposable elements. J Mol Biol 1988; 204:233-46. [PMID: 2464689 DOI: 10.1016/0022-2836(88)90572-4] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The retrotransposon micropia was first described from Y-chromosomal fertility genes of Drosophila hydei. Screening a Drosophila melanogaster genomic library yielded several clones representing micropia elements in D. melanogaster. The DNA sequences of two elements from D. hydei (micropia-DhMiF2 and micropia-DhMiF8) and two elements from D. melanogaster (micropia-Dm2 and micropia-Dm11) permitted a detailed analysis of the spatial organization of micropia constituents. Micropia represents the typical gene organization represented by "core"-protein domains followed by a protease, reverse transcriptase, RNase and integrase domain. New features of the micropia family compared with other retrotransposons are: (1) a region of similarity to class I major histocompatibility complex antigens of mammals; (2) only one main open reading frame of about 4000 bases length; (3) a non-protein-coding region of about 500 base-pairs length between the 3' end of the open reading frame and the 5' start of the 3' long terminal repeat. This region includes 32 base-pair tandem repeats; (4) within the long terminal repeats, 82 base-pair tandem repeats with four potential ecdysteroid receptor binding sites. Because micropia combines many evolutionary features of different viruses, non-viral transposable elements, chromosomal genes and repetitive sequence organizations, this retrotransposon may be seen as a "minigenome" reflecting evolutionary principles of the construction of genomic components.
Collapse
Affiliation(s)
- D H Lankenau
- Department of Molecular and Developmental Genetics, Katholieke Universiteit, Nijmegen, The Netherlands
| | | | | | | | | |
Collapse
|
23
|
Lloyd JA, Potter SS. Distinct subfamilies of primate L1Gg retroposons, with some elements carrying tandem repeats in the 5' region. Nucleic Acids Res 1988; 16:6147-56. [PMID: 3399387 PMCID: PMC336853 DOI: 10.1093/nar/16.13.6147] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Two subfamilies of L1 elements, differing dramatically in the first 1.2 kb of sequence at their 5' ends, were identified in the prosimian primate, Galago garnetti. Interesting patterns of sequence similarity were observed between the galago subfamilies, and with the L1s from human and from another prosimian, the slow loris. Furthermore, members of one of the subfamilies have six to eight tandemly repeated units of 73 bp, starting about 730 bp from their 5' ends. Such tandem repeats have not been reported in other primate L1s, but a striking sequence similarity was found between the galago tandem repeats and those previously described at the 5' termini of some mouse L1s [Loeb, D. D. et al. Mol. Cell. Biol. 6, 168-182, 1986]. Although the similar sequence indicates a shared, conserved function, the galago repeats are sub-terminal and therefore cannot serve as portable RNA polymerase II promoters, as has been suggested for the mouse tandem repeats.
Collapse
Affiliation(s)
- J A Lloyd
- Children's Hospital Research Foundation, Cincinnati, OH 45229
| | | |
Collapse
|