1
|
Saha K, Nielsen GI, Nandani R, Kong L, Ye P, An W. YY1 is a transcriptional activator of mouse LINE-1 Tf subfamily. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.03.573552. [PMID: 38260579 PMCID: PMC10802269 DOI: 10.1101/2024.01.03.573552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Long interspersed element type 1 (LINE-1, L1) is an active autonomous transposable element (TE) in the human genome. The first step of L1 replication is transcription, which is controlled by an internal RNA polymerase II promoter in the 5' untranslated region (UTR) of a full-length L1. It has been shown that transcription factor YY1 binds to a conserved sequence motif at the 5' end of the human L1 5'UTR and dictates where transcription initiates but not the level of transcription. Putative YY1-binding motifs have been predicted in the 5'UTRs of two distinct mouse L1 subfamilies, Tf and Gf. Using site-directed mutagenesis, in vitro binding, and gene knockdown assays, we experimentally tested the role of YY1 in mouse L1 transcription. Our results indicate that Tf, but not Gf subfamily, harbors functional YY1-binding sites in its 5'UTR monomers. In contrast to its role in human L1, YY1 functions as a transcriptional activator for the mouse Tf subfamily. Furthermore, YY1-binding motifs are solely responsible for the synergistic interaction between monomers, consistent with a model wherein distant monomers act as enhancers for mouse L1 transcription. The abundance of YY1-binding sites in Tf elements also raise important implications for gene regulation at the genomic level.
Collapse
Affiliation(s)
- Karabi Saha
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD 57007, USA
| | - Grace I. Nielsen
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD 57007, USA
| | - Raj Nandani
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD 57007, USA
| | - Lingqi Kong
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD 57007, USA
| | - Ping Ye
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD 57007, USA
| | - Wenfeng An
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD 57007, USA
| |
Collapse
|
2
|
Gerdes P, Chan D, Lundberg M, Sanchez-Luque FJ, Bodea GO, Ewing AD, Faulkner GJ, Richardson SR. Locus-resolution analysis of L1 regulation and retrotransposition potential in mouse embryonic development. Genome Res 2023; 33:1465-1481. [PMID: 37798118 PMCID: PMC10620060 DOI: 10.1101/gr.278003.123] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/21/2023] [Indexed: 10/07/2023]
Abstract
Mice harbor ∼2800 intact copies of the retrotransposon Long Interspersed Element 1 (L1). The in vivo retrotransposition capacity of an L1 copy is defined by both its sequence integrity and epigenetic status, including DNA methylation of the monomeric units constituting young mouse L1 promoters. Locus-specific L1 methylation dynamics during development may therefore elucidate and explain spatiotemporal niches of endogenous retrotransposition but remain unresolved. Here, we interrogate the retrotransposition efficiency and epigenetic fate of source (donor) L1s, identified as mobile in vivo. We show that promoter monomer loss consistently attenuates the relative retrotransposition potential of their offspring (daughter) L1 insertions. We also observe that most donor/daughter L1 pairs are efficiently methylated upon differentiation in vivo and in vitro. We use Oxford Nanopore Technologies (ONT) long-read sequencing to resolve L1 methylation genome-wide and at individual L1 loci, revealing a distinctive "smile" pattern in methylation levels across the L1 promoter region. Using Pacific Biosciences (PacBio) SMRT sequencing of L1 5' RACE products, we then examine DNA methylation dynamics at the mouse L1 promoter in parallel with transcription start site (TSS) distribution at locus-specific resolution. Together, our results offer a novel perspective on the interplay between epigenetic repression, L1 evolution, and genome stability.
Collapse
Affiliation(s)
- Patricia Gerdes
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Dorothy Chan
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Mischa Lundberg
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland 4102, Australia
- Translational Bioinformatics, Commonwealth Scientific and Industrial Research Organisation, Sydney, New South Wales 2113, Australia
| | - Francisco J Sanchez-Luque
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- GENYO. Centre for Genomics and Oncological Research (Pfizer-University of Granada-Andalusian Regional Government), PTS Granada, 18016, Spain
- MRC Human Genetics Unit, Institute of Genetics and Cancer (IGC), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Gabriela O Bodea
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Adam D Ewing
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia;
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Sandra R Richardson
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia;
| |
Collapse
|
3
|
Kong L, Saha K, Hu Y, Tschetter JN, Habben CE, Whitmore LS, Yao C, Ge X, Ye P, Newkirk SJ, An W. Subfamily-specific differential contribution of individual monomers and the tether sequence to mouse L1 promoter activity. Mob DNA 2022; 13:13. [PMID: 35443687 PMCID: PMC9022269 DOI: 10.1186/s13100-022-00269-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 03/28/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The internal promoter in L1 5'UTR is critical for autonomous L1 transcription and initiating retrotransposition. Unlike the human genome, which features one contemporarily active subfamily, four subfamilies (A_I, Gf_I and Tf_I/II) have been amplifying in the mouse genome in the last one million years. Moreover, mouse L1 5'UTRs are organized into tandem repeats called monomers, which are separated from ORF1 by a tether domain. In this study, we aim to compare promoter activities across young mouse L1 subfamilies and investigate the contribution of individual monomers and the tether sequence. RESULTS We observed an inverse relationship between subfamily age and the average number of monomers among evolutionarily young mouse L1 subfamilies. The youngest subgroup (A_I and Tf_I/II) on average carry 3-4 monomers in the 5'UTR. Using a single-vector dual-luciferase reporter assay, we compared promoter activities across six L1 subfamilies (A_I/II, Gf_I and Tf_I/II/III) and established their antisense promoter activities in a mouse embryonic fibroblast cell line and a mouse embryonal carcinoma cell line. Using consensus promoter sequences for three subfamilies (A_I, Gf_I and Tf_I), we dissected the differential roles of individual monomers and the tether domain in L1 promoter activity. We validated that, across multiple subfamilies, the second monomer consistently enhances the overall promoter activity. For individual promoter components, monomer 2 is consistently more active than the corresponding monomer 1 and/or the tether for each subfamily. Importantly, we revealed intricate interactions between monomer 2, monomer 1 and tether domains in a subfamily-specific manner. Furthermore, using three-monomer 5'UTRs, we established a complex nonlinear relationship between the length of the outmost monomer and the overall promoter activity. CONCLUSIONS The laboratory mouse is an important mammalian model system for human diseases as well as L1 biology. Our study extends previous findings and represents an important step toward a better understanding of the molecular mechanism controlling mouse L1 transcription as well as L1's impact on development and disease.
Collapse
Affiliation(s)
- Lingqi Kong
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Karabi Saha
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Yuchi Hu
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Jada N Tschetter
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Chase E Habben
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Leanne S Whitmore
- Department of Immunology, University of Washington, Seattle, WA, 98109, USA
| | - Changfeng Yao
- Anhui University of Traditional Chinese Medicine, Hefei, 230012, Anhui, China
| | - Xijin Ge
- Department of Mathematics & Statistics, South Dakota State University, Brookings, SD, 57007, USA
| | - Ping Ye
- Department of Pharmacy Practice, South Dakota State University, Brookings, SD, 57007, USA
| | - Simon J Newkirk
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Wenfeng An
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA.
| |
Collapse
|
4
|
Floreani L, Ansaloni F, Mangoni D, Agostoni E, Sanges R, Persichetti F, Gustincich S. Analysis of LINE1 Retrotransposons in Huntington’s Disease. Front Cell Neurosci 2022; 15:743797. [PMID: 35095420 PMCID: PMC8795916 DOI: 10.3389/fncel.2021.743797] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Accepted: 12/27/2021] [Indexed: 12/27/2022] Open
Abstract
Transposable elements (TEs) are mobile genetic elements that made up about half the human genome. Among them, the autonomous non-LTR retrotransposon long interspersed nuclear element-1 (L1) is the only currently active TE in mammals and covers about 17% of the mammalian genome. L1s exert their function as structural elements in the genome, as transcribed RNAs to influence chromatin structure and as retrotransposed elements to shape genomic variation in somatic cells. L1s activity has been shown altered in several diseases of the nervous system. Huntington disease (HD) is a dominantly inherited neurodegenerative disorder caused by an expansion of a CAG repeat in the HTT gene which leads to a gradual loss of neurons most prominently in the striatum and, to a lesser extent, in cortical brain regions. The length of the expanded CAG tract is related to age at disease onset, with longer repeats leading to earlier onset. Here we carried out bioinformatic analysis of public RNA-seq data of a panel of HD mouse models showing that a decrease of L1 RNA expression recapitulates two hallmarks of the disease: it correlates to CAG repeat length and it occurs in the striatum, the site of neurodegeneration. Results were then experimentally validated in HttQ111 knock-in mice. The expression of L1-encoded proteins was independent from L1 RNA levels and differentially regulated in time and tissues. The pattern of expression L1 RNAs in human HD post-mortem brains showed similarity to mouse models of the disease. This work suggests the need for further study of L1s in HD and adds support to the current hypothesis that dysregulation of TEs may be involved in neurodegenerative diseases.
Collapse
Affiliation(s)
- Lavinia Floreani
- Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
| | - Federico Ansaloni
- Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
- Central RNA Laboratory, Istituto Italiano di Tecnologia—IIT, Genova, Italy
| | - Damiano Mangoni
- Central RNA Laboratory, Istituto Italiano di Tecnologia—IIT, Genova, Italy
| | - Elena Agostoni
- Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
| | - Remo Sanges
- Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
- Central RNA Laboratory, Istituto Italiano di Tecnologia—IIT, Genova, Italy
- *Correspondence: Remo Sanges,
| | - Francesca Persichetti
- Department of Health Sciences, University of Piemonte Orientale “ A. Avogadro,”Novara, Italy
- Francesca Persichetti,
| | - Stefano Gustincich
- Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
- Central RNA Laboratory, Istituto Italiano di Tecnologia—IIT, Genova, Italy
- Stefano Gustincich,
| |
Collapse
|
5
|
Stow EC, Kaul T, deHaro DL, Dem MR, Beletsky AG, Morales ME, Du Q, LaRosa AJ, Yang H, Smither E, Baddoo M, Ungerleider N, Deininger P, Belancio VP. Organ-, sex- and age-dependent patterns of endogenous L1 mRNA expression at a single locus resolution. Nucleic Acids Res 2021; 49:5813-5831. [PMID: 34023901 PMCID: PMC8191783 DOI: 10.1093/nar/gkab369] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 04/21/2021] [Accepted: 04/28/2021] [Indexed: 11/13/2022] Open
Abstract
Expression of L1 mRNA, the first step in the L1 copy-and-paste amplification cycle, is a prerequisite for L1-associated genomic instability. We used a reported stringent bioinformatics method to parse L1 mRNA transcripts and measure the level of L1 mRNA expressed in mouse and rat organs at a locus-specific resolution. This analysis determined that mRNA expression of L1 loci in rodents exhibits striking organ specificity with less than 0.8% of loci shared between organs of the same organism. This organ specificity in L1 mRNA expression is preserved in male and female mice and across age groups. We discovered notable differences in L1 mRNA expression between sexes with only 5% of expressed L1 loci shared between male and female mice. Moreover, we report that the levels of total L1 mRNA expression and the number and spectrum of expressed L1 loci fluctuate with age as independent variables, demonstrating different patterns in different organs and sexes. Overall, our comparisons between organs and sexes and across ages ranging from 2 to 22 months establish previously unforeseen dynamic changes in L1 mRNA expression in vivo. These findings establish the beginning of an atlas of endogenous L1 mRNA expression across a broad range of biological variables that will guide future studies.
Collapse
Affiliation(s)
- Emily C Stow
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Tiffany Kaul
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Epidemiology, Tulane School of Public Health and Tropical Medicine, New Orleans, LA 70112 USA
| | - Dawn L deHaro
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Madeleine R Dem
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Anna G Beletsky
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Maria E Morales
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Epidemiology, Tulane School of Public Health and Tropical Medicine, New Orleans, LA 70112 USA
| | - Qianhui Du
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Alexis J LaRosa
- Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Hanlin Yang
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA
| | - Emily Smither
- Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| | - Melody Baddoo
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA
| | - Nathan Ungerleider
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA
| | - Prescott Deininger
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Epidemiology, Tulane School of Public Health and Tropical Medicine, New Orleans, LA 70112 USA
| | - Victoria P Belancio
- Tulane Cancer Center, Tulane Health Sciences Center, 1700 Tulane Ave, New Orleans, LA 70112, USA.,Department of Structural and Cellular Biology, Tulane School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112 USA
| |
Collapse
|
6
|
Gagnier L, Belancio VP, Mager DL. Mouse germ line mutations due to retrotransposon insertions. Mob DNA 2019; 10:15. [PMID: 31011371 PMCID: PMC6466679 DOI: 10.1186/s13100-019-0157-4] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 04/01/2019] [Indexed: 12/24/2022] Open
Abstract
Transposable element (TE) insertions are responsible for a significant fraction of spontaneous germ line mutations reported in inbred mouse strains. This major contribution of TEs to the mutational landscape in mouse contrasts with the situation in human, where their relative contribution as germ line insertional mutagens is much lower. In this focussed review, we provide comprehensive lists of TE-induced mouse mutations, discuss the different TE types involved in these insertional mutations and elaborate on particularly interesting cases. We also discuss differences and similarities between the mutational role of TEs in mice and humans.
Collapse
Affiliation(s)
- Liane Gagnier
- Terry Fox Laboratory, BC Cancer and Department of Medical Genetics, University of British Columbia, V5Z1L3, Vancouver, BC Canada
| | - Victoria P. Belancio
- Department of Structural and Cellular Biology, Tulane University School of Medicine, Tulane Cancer Center, Tulane Center for Aging, New Orleans, LA 70112 USA
| | - Dixie L. Mager
- Terry Fox Laboratory, BC Cancer and Department of Medical Genetics, University of British Columbia, V5Z1L3, Vancouver, BC Canada
| |
Collapse
|
7
|
Zhou M, Smith AD. Subtype classification and functional annotation of L1Md retrotransposon promoters. Mob DNA 2019; 10:14. [PMID: 31007728 PMCID: PMC6454616 DOI: 10.1186/s13100-019-0156-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Accepted: 03/28/2019] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND L1Md retrotransposons are the most abundant and active transposable elements in the mouse genome. The promoters of many L1Md retrotransposons are composed of tandem repeats called monomers. The number of monomers varies between retrotransposon copies, thus making it difficult to annotate L1Md promoters. Duplication of monomers contributes to the maintenance of L1Md promoters during truncation-prone retrotranspositions, but the associated mechanism remains unclear. Since the current classification of monomers is based on limited data, a comprehensive monomer annotation is needed for supporting functional studies of L1Md promoters genome-wide. RESULTS We developed a pipeline for de novo monomer detection and classification. Identified monomers are further classified into subtypes based on their sequence profiles. We applied this pipeline to genome assemblies of various rodent species. A major monomer subtype of the lab mouse was also found in other Mus species, implying that such subtype has emerged in the common ancestor of involved species. We also characterized the positioning pattern of monomer subtypes within individual promoters. Our analyses indicate that the subtype composition of an L1Md promoter can be used to infer its transcriptional activity during male germ cell development. CONCLUSIONS We identified subtypes for all monomer types using comprehensive data, greatly expanding the spectrum of monomer variants. The analysis of monomer subtype positioning provides evidence supporting both previously proposed models of L1Md promoter expansion. The transcription silencing of L1Md promoters differs between promoter types, which supports a model involving distinct suppressive pathways rather than a universal mechanism for retrotransposon repression in gametogenesis.
Collapse
Affiliation(s)
- Meng Zhou
- Molecular and Computational Biology Section, Division of Biological Sciences, University of Southern California, Los Angeles, USA
| | - Andrew D. Smith
- Molecular and Computational Biology Section, Division of Biological Sciences, University of Southern California, Los Angeles, USA
| |
Collapse
|
8
|
Boissinot S, Sookdeo A. The Evolution of LINE-1 in Vertebrates. Genome Biol Evol 2018; 8:3485-3507. [PMID: 28175298 PMCID: PMC5381506 DOI: 10.1093/gbe/evw247] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/03/2016] [Indexed: 12/21/2022] Open
Abstract
The abundance and diversity of the LINE-1 (L1) retrotransposon differ greatly among vertebrates. Mammalian genomes contain hundreds of thousands L1s that have accumulated since the origin of mammals. A single group of very similar elements is active at a time in mammals, thus a single lineage of active families has evolved in this group. In contrast, non-mammalian genomes (fish, amphibians, reptiles) harbor a large diversity of concurrently transposing families, which are all represented by very small number of recently inserted copies. Why the pattern of diversity and abundance of L1 is so different among vertebrates remains unknown. To address this issue, we performed a detailed analysis of the evolution of active L1 in 14 mammals and in 3 non-mammalian vertebrate model species. We examined the evolution of base composition and codon bias, the general structure, and the evolution of the different domains of L1 (5′UTR, ORF1, ORF2, 3′UTR). L1s differ substantially in length, base composition, and structure among vertebrates. The most variation is found in the 5′UTR, which is longer in amniotes, and in the ORF1, which tend to evolve faster in mammals. The highly divergent L1 families of lizard, frog, and fish share species-specific features suggesting that they are subjected to the same functional constraints imposed by their host. The relative conservation of the 5′UTR and ORF1 in non-mammalian vertebrates suggests that the repression of transposition by the host does not act in a sequence-specific manner and did not result in an arms race, as is observed in mammals.
Collapse
|
9
|
Murata Y, Bundo M, Ueda J, Kubota-Sakashita M, Kasai K, Kato T, Iwamoto K. DNA methylation and hydroxymethylation analyses of the active LINE-1 subfamilies in mice. Sci Rep 2017; 7:13624. [PMID: 29051587 PMCID: PMC5648895 DOI: 10.1038/s41598-017-14165-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Accepted: 10/06/2017] [Indexed: 11/17/2022] Open
Abstract
Retrotransposon long interspersed nuclear element-1 (LINE-1) occupies a large proportion of the mammalian genome, comprising approximately 100,000 genomic copies in mice. Epigenetic status of the 5′ untranslated region (5′-UTR) of LINE-1 is critical for its promoter activity. DNA methylation levels in the 5′-UTR of human active LINE-1 subfamily can be measured by well-established methods, such as a pyrosequencing-based assay. However, because of the considerable sequence and structural diversity in LINE-1 among species, methods for such assays should be adapted for the species of interest. Here we developed pyrosequencing-based assays to examine methylcytosine (mC) and hydroxymethylcytosine (hmC) levels of the three active LINE-1 subfamilies in mice (TfI, A, and GfII). Using these assays, we quantified mC and hmC levels in four brain regions and four nonbrain tissues including tail, heart, testis, and ovary. We observed tissue- and subfamily-specific mC and hmC differences. We also found that mC levels were strongly correlated among different brain regions, but mC levels of the testis showed a poor correlation with those of other tissues. Interestingly, mC levels in the A and GfII subfamilies were highly correlated, possibly reflecting their close evolutionary relationship. Our assays will be useful for exploring the epigenetic regulation of the active LINE-1 subfamilies in mice.
Collapse
Affiliation(s)
- Yui Murata
- Department of Molecular Brain Science, Graduate School of Medical Sciences, Kumamoto University, 1-1-1 Honjo, Chuo-ku, Kumamoto City, Kumamoto, 860-8556, Japan
| | - Miki Bundo
- Department of Molecular Brain Science, Graduate School of Medical Sciences, Kumamoto University, 1-1-1 Honjo, Chuo-ku, Kumamoto City, Kumamoto, 860-8556, Japan.,PRESTO, Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi City, Saitama, 332-0012, Japan
| | - Junko Ueda
- Laboratory for Molecular Dynamics of Mental Disorders, RIKEN Brain Science Institute, 2-1 Hirosawa, Wako-city, Saitama, 351-0198, Japan
| | - Mie Kubota-Sakashita
- Laboratory for Molecular Dynamics of Mental Disorders, RIKEN Brain Science Institute, 2-1 Hirosawa, Wako-city, Saitama, 351-0198, Japan
| | - Kiyoto Kasai
- Department of Neuropsychiatry, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan
| | - Tadafumi Kato
- Laboratory for Molecular Dynamics of Mental Disorders, RIKEN Brain Science Institute, 2-1 Hirosawa, Wako-city, Saitama, 351-0198, Japan
| | - Kazuya Iwamoto
- Department of Molecular Brain Science, Graduate School of Medical Sciences, Kumamoto University, 1-1-1 Honjo, Chuo-ku, Kumamoto City, Kumamoto, 860-8556, Japan.
| |
Collapse
|
10
|
Abstract
Transposable elements have had a profound impact on the structure and function of mammalian genomes. The retrotransposon Long INterspersed Element-1 (LINE-1 or L1), by virtue of its replicative mobilization mechanism, comprises ∼17% of the human genome. Although the vast majority of human LINE-1 sequences are inactive molecular fossils, an estimated 80-100 copies per individual retain the ability to mobilize by a process termed retrotransposition. Indeed, LINE-1 is the only active, autonomous retrotransposon in humans and its retrotransposition continues to generate both intra-individual and inter-individual genetic diversity. Here, we briefly review the types of transposable elements that reside in mammalian genomes. We will focus our discussion on LINE-1 retrotransposons and the non-autonomous Short INterspersed Elements (SINEs) that rely on the proteins encoded by LINE-1 for their mobilization. We review cases where LINE-1-mediated retrotransposition events have resulted in genetic disease and discuss how the characterization of these mutagenic insertions led to the identification of retrotransposition-competent LINE-1s in the human and mouse genomes. We then discuss how the integration of molecular genetic, biochemical, and modern genomic technologies have yielded insight into the mechanism of LINE-1 retrotransposition, the impact of LINE-1-mediated retrotransposition events on mammalian genomes, and the host cellular mechanisms that protect the genome from unabated LINE-1-mediated retrotransposition events. Throughout this review, we highlight unanswered questions in LINE-1 biology that provide exciting opportunities for future research. Clearly, much has been learned about LINE-1 and SINE biology since the publication of Mobile DNA II thirteen years ago. Future studies should continue to yield exciting discoveries about how these retrotransposons contribute to genetic diversity in mammalian genomes.
Collapse
|
11
|
Servant G, Deininger PL. Insertion of Retrotransposons at Chromosome Ends: Adaptive Response to Chromosome Maintenance. Front Genet 2016; 6:358. [PMID: 26779254 PMCID: PMC4700185 DOI: 10.3389/fgene.2015.00358] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 12/10/2015] [Indexed: 01/30/2023] Open
Abstract
The telomerase complex is a specialized reverse transcriptase (RT) that inserts tandem DNA arrays at the linear chromosome ends and contributes to the protection of the genetic information in eukaryotic genomes. Telomerases are phylogenetically related to retrotransposons, encoding also the RT activity required for the amplification of their sequences throughout the genome. Intriguingly the telomerase gene is lost from the Drosophila genome and tandem retrotransposons replace telomeric sequences at the chromosome extremities. This observation suggests the versatility of RT activity in counteracting the chromosome shortening associated with genome replication and that retrotransposons can provide this activity in case of a dysfunctional telomerase. In this review paper, we describe the major classes of retroelements present in eukaryotic genomes in order to point out the differences and similarities with the telomerase complex. In a second part, we discuss the insertion of retroelements at the ends of chromosomes as an adaptive response for dysfunctional telomeres.
Collapse
Affiliation(s)
| | - Prescott L. Deininger
- Tulane Cancer Center, Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LAUSA
| |
Collapse
|
12
|
A unique HMG-box domain of mouse Maelstrom binds structured RNA but not double stranded DNA. PLoS One 2015; 10:e0120268. [PMID: 25807393 PMCID: PMC4373776 DOI: 10.1371/journal.pone.0120268] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Accepted: 02/05/2015] [Indexed: 01/11/2023] Open
Abstract
Piwi-interacting piRNAs are a major and essential class of small RNAs in the animal germ cells with a prominent role in transposon control. Efficient piRNA biogenesis and function require a cohort of proteins conserved throughout the animal kingdom. Here we studied Maelstrom (MAEL), which is essential for piRNA biogenesis and germ cell differentiation in flies and mice. MAEL contains a high mobility group (HMG)-box domain and a Maelstrom-specific domain with a presumptive RNase H-fold. We employed a combination of sequence analyses, structural and biochemical approaches to evaluate and compare nucleic acid binding of mouse MAEL HMG-box to that of canonical HMG-box domain proteins (SRY and HMGB1a). MAEL HMG-box failed to bind double-stranded (ds)DNA but bound to structured RNA. We also identified important roles of a novel cluster of arginine residues in MAEL HMG-box in these interactions. Cumulatively, our results suggest that the MAEL HMG-box domain may contribute to MAEL function in selective processing of retrotransposon RNA into piRNAs. In this regard, a cellular role of MAEL HMG-box domain is reminiscent of that of HMGB1 as a sentinel of immunogenic nucleic acids in the innate immune response.
Collapse
|
13
|
Brečević L, Rinčić M, Krsnik Ž, Sedmak G, Hamid AB, Kosyakova N, Galić I, Liehr T, Borovečki F. Association of new deletion/duplication region at chromosome 1p21 with intellectual disability, severe speech deficit and autism spectrum disorder-like behavior: an all-in approach to solving the DPYD enigma. Transl Neurosci 2015; 6:59-86. [PMID: 28123791 PMCID: PMC4936614 DOI: 10.1515/tnsci-2015-0007] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Accepted: 12/29/2014] [Indexed: 12/14/2022] Open
Abstract
We describe an as yet unreported neocentric small supernumerary marker chromosome (sSMC) derived from chromosome 1p21.3p21.2. It was present in 80% of the lymphocytes in a male patient with intellectual disability, severe speech deficit, mild dysmorphic features, and hyperactivity with elements of autism spectrum disorder (ASD). Several important neurodevelopmental genes are affected by the 3.56 Mb copy number gain of 1p21.3p21.2, which may be considered reciprocal in gene content to the recently recognized 1p21.3 microdeletion syndrome. Both 1p21.3 deletions and the presented duplication display overlapping symptoms, fitting the same disorder category. Contribution of coding and non-coding genes to the phenotype is discussed in the light of cellular and intercellular homeostasis disequilibrium. In line with this the presented 1p21.3p21.2 copy number gain correlated to 1p21.3 microdeletion syndrome verifies the hypothesis of a cumulative effect of the number of deregulated genes - homeostasis disequilibrium leading to overlapping phenotypes between microdeletion and microduplication syndromes. Although miR-137 appears to be the major player in the 1p21.3p21.2 region, deregulation of the DPYD (dihydropyrimidine dehydrogenase) gene may potentially affect neighboring genes underlying the overlapping symptoms present in both the copy number loss and copy number gain of 1p21. Namely, the all-in approach revealed that DPYD is a complex gene whose expression is epigenetically regulated by long non-coding RNAs (lncRNAs) within the locus. Furthermore, the long interspersed nuclear element-1 (LINE-1) L1MC1 transposon inserted in DPYD intronic transcript 1 (DPYD-IT1) lncRNA with its parasites, TcMAR-Tigger5b and pair of Alu repeats appears to be the “weakest link” within the DPYD gene liable to break. Identification of the precise mechanism through which DPYD is epigenetically regulated, and underlying reasons why exactly the break (FRA1E) happens, will consequently pave the way toward preventing severe toxicity to the antineoplastic drug 5-fluorouracil (5-FU) and development of the causative therapy for the dihydropyrimidine dehydrogenase deficiency.
Collapse
Affiliation(s)
- Lukrecija Brečević
- Croatian Institute for Brain Research, University of Zagreb Medical School, Šalata 12, 10000 Zagreb, Croatia
- Department for Functional Genomics, Center for Translational and Clinical Research, University of Zagreb Medical School, University Hospital Center Zagreb, Šalata 2, 10000 Zagreb, Croatia
- E-mail: ;
| | - Martina Rinčić
- Croatian Institute for Brain Research, University of Zagreb Medical School, Šalata 12, 10000 Zagreb, Croatia
- Department for Functional Genomics, Center for Translational and Clinical Research, University of Zagreb Medical School, University Hospital Center Zagreb, Šalata 2, 10000 Zagreb, Croatia
- Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, 07743 Jena, Germany
| | - Željka Krsnik
- Croatian Institute for Brain Research, University of Zagreb Medical School, Šalata 12, 10000 Zagreb, Croatia
| | - Goran Sedmak
- Croatian Institute for Brain Research, University of Zagreb Medical School, Šalata 12, 10000 Zagreb, Croatia
| | - Ahmed B. Hamid
- Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, 07743 Jena, Germany
| | - Nadezda Kosyakova
- Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, 07743 Jena, Germany
| | - Ivan Galić
- Center for Rehabilitation Stančić, Stančić bb, 10370 Stančić, Croatia
| | - Thomas Liehr
- Jena University Hospital, Friedrich Schiller University, Institute of Human Genetics, Kollegiengasse 10, 07743 Jena, Germany
| | - Fran Borovečki
- Department for Functional Genomics, Center for Translational and Clinical Research, University of Zagreb Medical School, University Hospital Center Zagreb, Šalata 2, 10000 Zagreb, Croatia
| |
Collapse
|
14
|
Ngamphiw C, Tongsima S, Mutirangura A. Roles of intragenic and intergenic L1s in mouse and human. PLoS One 2014; 9:e113434. [PMID: 25409429 PMCID: PMC4237456 DOI: 10.1371/journal.pone.0113434] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2014] [Accepted: 10/28/2014] [Indexed: 02/02/2023] Open
Abstract
Long INterspersed Element-1 (LINE-1 or L1) is a retrotransposable element that has shaped the evolution of mammalian genomes. There is increasing evidence that transcriptionally active L1 could have been co-opted through evolution to play various roles including X-inactivation, homologous recombination and gene regulation. Here, we compare putatively active L1 distributions in the mouse with human. L1 density is higher in the mouse except for the Y-chromosome. L1 density is the highest in X-chromosome, implying an X-inactivation role. L1 is more common outside genes (intergenic) except for the Y-chromosome in both species. The structure of mouse L1 is distinguished from human L1 by the presence of a 200 bp repeat in the 5' UTR of the former. We found that mouse intragenic L1 has significantly higher repeat copy numbers than intergenic L1, suggesting that this is important for control of L1 expression. Furthermore, a significant association between the presence of intragenic L1s and down-regulated genes in early embryogenesis was found in both species. In conclusion, the distribution of L1 in the mouse genome points to biological roles of L1 in mouse similar to human.
Collapse
Affiliation(s)
- Chumpol Ngamphiw
- Inter-Department Program of Biomedical Sciences, Faculty of Graduate School, Chulalongkorn University, Bangkok, Thailand
- Genome Technology Research Unit, National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Sissades Tongsima
- Genome Technology Research Unit, National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Apiwat Mutirangura
- Center of Excellence in Molecular Genetics of Cancer and Human Diseases, Department of Anatomy, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
- * E-mail:
| |
Collapse
|
15
|
Rosser JM, An W. L1 expression and regulation in humans and rodents. Front Biosci (Elite Ed) 2012; 4:2203-25. [PMID: 22202032 DOI: 10.2741/537] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Long interspersed elements type 1 (LINE-1s, or L1s) have impacted mammalian genomes at multiple levels. L1 transcription is mainly controlled by its 5' untranslated region (5'UTR), which differs significantly among active human and rodent L1 families. In this review, L1 expression and its regulation are examined in the context of human and rodent development. First, endogenous L1 expression patterns in three different species-human, rat, and mouse-are compared and contrasted. A detailed account of relevant experimental evidence is presented according to the source material, such as cell lines, tumors, and normal somatic and germline tissues from different developmental stages. Second, factors involved in the regulation of L1 expression at both transcriptional and posttranscriptional levels are discussed. These include transcription factors, DNA methylation, PIWI-interacting RNAs (piRNAs), RNA interference (RNAi), and posttranscriptional host factors. Similarities and differences between human and rodent L1s are highlighted. Third, recent findings from transgenic mouse models of L1 are summarized and contrasted with those from endogenous L1 studies. Finally, the challenges and opportunities for L1 mouse models are discussed.
Collapse
Affiliation(s)
- James M Rosser
- School of Molecular Biosciences and Center for Reproductive Biology, Washington State University, Pullman, WA 99164, USA
| | | |
Collapse
|
16
|
Belancio VP, Hedges DJ, Deininger P. Mammalian non-LTR retrotransposons: for better or worse, in sickness and in health. Genome Res 2008; 18:343-58. [PMID: 18256243 DOI: 10.1101/gr.5558208] [Citation(s) in RCA: 224] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Transposable elements (TEs) have shared an exceptionally long coexistence with their host organisms and have come to occupy a significant fraction of eukaryotic genomes. The bulk of the expansion occurring within mammalian genomes has arisen from the activity of type I retrotransposons, which amplify in a "copy-and-paste" fashion through an RNA intermediate. For better or worse, the sequences of these retrotransposons are now wedded to the genomes of their mammalian hosts. Although there are several reported instances of the positive contribution of mobile elements to their host genomes, these discoveries have occurred alongside growing evidence of the role of TEs in human disease and genetic instability. Here we examine, with a particular emphasis on human retrotransposon activity, several newly discovered aspects of mammalian retrotransposon biology. We consider their potential impact on host biology as well as their ultimate implications for the nature of the TE-host relationship.
Collapse
Affiliation(s)
- Victoria P Belancio
- Tulane Cancer Center and Department of Epidemiology, Tulane University Health Sciences Center, New Orleans, Louisiana 70112, USA
| | | | | |
Collapse
|
17
|
Ellis PJI, Ferguson L, Clemente EJ, Affara NA. Bidirectional transcription of a novel chimeric gene mapping to mouse chromosome Yq. BMC Evol Biol 2007; 7:171. [PMID: 17892569 PMCID: PMC2212661 DOI: 10.1186/1471-2148-7-171] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2007] [Accepted: 09/24/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The male-specific region of the mouse Y chromosome long arm (MSYq) contains three known highly multi-copy X-Y homologous gene families, Ssty1/2, Sly and Asty. Deletions on MSYq lead to teratozoospermia and subfertility or infertility, with a sex ratio skew in the offspring of subfertile MSYqdel males RESULTS We report the highly unusual genomic structure of a novel MSYq locus, Orly, and a diverse set of spermatid-specific transcripts arising from copies of this locus. Orly is composed of partial copies of Ssty1, Asty and Sly arranged in sequence. The Ssty1- and Sly-derived segments are in antisense orientation relative to each other, leading to bi-directional transcription of Orly. Genome search and phylogenetic tree analysis is used to determine the order of events in mouse Yq evolution. We find that Orly is the most recent gene to arise on Yq, and that subsequently there was massive expansion in copy number of all Yq-linked genes. CONCLUSION Orly has an unprecedented chimeric structure, and generates both "forward" (Orly) and "reverse" (Orlyos) transcripts arising from the promoters at each end of the locus. The region of overlap of known Orly and Orlyos transcripts is homologous to Sly intron 2. We propose that Orly may be involved in an intragenomic conflict between mouse X and Y chromosomes, and that this process underlies the massive expansion in copy number of the genes on MSYq and their X homologues.
Collapse
Affiliation(s)
- Peter JI Ellis
- Mammalian Molecular Genetics Group, University of Cambridge Department of Pathology, Tennis Court Rd., Cambridge, CB2 1QP, UK
| | - Lydia Ferguson
- Mammalian Molecular Genetics Group, University of Cambridge Department of Pathology, Tennis Court Rd., Cambridge, CB2 1QP, UK
| | - Emily J Clemente
- Mammalian Molecular Genetics Group, University of Cambridge Department of Pathology, Tennis Court Rd., Cambridge, CB2 1QP, UK
| | - Nabeel A Affara
- Mammalian Molecular Genetics Group, University of Cambridge Department of Pathology, Tennis Court Rd., Cambridge, CB2 1QP, UK
| |
Collapse
|
18
|
Penzkofer T, Dandekar T, Zemojtel T. L1Base: from functional annotation to prediction of active LINE-1 elements. Nucleic Acids Res 2005; 33:D498-500. [PMID: 15608246 PMCID: PMC539998 DOI: 10.1093/nar/gki044] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
L1Base is a dedicated database containing putatively active LINE-1 (L1) insertions residing in human and rodent genomes that are as follows: (i) intact in the two open reading frames (ORFs), full-length L1s (FLI-L1s) and (ii) intact ORF2 but disrupted ORF1 (ORF2-L1s). In addition, due to their regulatory potential, the full-length (>6000 bp) non-intact L1s (FLnI-L1s) were also included in the database. Application of a novel annotation methodology, L1Xplorer, allowed in-depth annotation of functional sequence features important for L1 activity, such as transcription factor binding sites and amino acid residues. The L1Base is available online at http://l1base.molgen.mpg.de. In addition, the data stored in the database can be accessed from the Ensembl web browser via a DAS service (http://l1das.molgen.mpg.de:8080/das).
Collapse
Affiliation(s)
- Tobias Penzkofer
- Department of Computational Molecular Biology, Max-Planck-Institute for Molecular Genetics, Ihnestrasse 73, D-14195 Berlin, Germany
| | | | | |
Collapse
|
19
|
Kolosha VO, Martin SL. High-affinity, non-sequence-specific RNA binding by the open reading frame 1 (ORF1) protein from long interspersed nuclear element 1 (LINE-1). J Biol Chem 2003; 278:8112-7. [PMID: 12506113 DOI: 10.1074/jbc.m210487200] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Long interspersed nuclear element 1 (LINE-1 or L1) is an interspersed repeated DNA found in mammalian genomes. L1 achieved its high copy number by retrotransposition, a process that requires the two L1-encoded proteins, ORF1p and ORF2p. The role of ORF1p in the retrotransposition cycle is incompletely understood, but it is known to bind single-stranded nucleic acids and act as a nucleic acid chaperone. This study assesses the nature and specificity of the interaction of ORF1p with RNA. Results of coimmunoprecipitation experiments demonstrate that ORF1p preferentially binds a single T1 nuclease digestion product of 38 nucleotides (nt) within the full-length mouse L1 transcript. The 38-nt fragment is localized within L1 RNA and found to be sufficient for binding by ORF1p but not necessary, because its complement is also efficiently coimmunoprecipitated, as are all sequences 38 nt or longer. Results of nitrocellulose filter-binding assays demonstrate that the binding of ORF1p to RNA does not require divalent cations but is sensitive to the concentration of monovalent cation. Both sense and antisense transcripts bind with apparent K(D)s in the low nanomolar range. The results of both types of assay unambiguously support the conclusion that purified ORF1p from mouse L1 is a high-affinity, non-sequence-specific RNA binding protein.
Collapse
Affiliation(s)
- Vladimir O Kolosha
- Department of Cellular and Structural Biology and Program in Molecular Biology, University of Colorado School of Medicine, Denver 80262, USA
| | | |
Collapse
|
20
|
Benihoud K, Bonardelle D, Soual-Hoebeke E, Durand-Gasselin I, Emilie D, Kiger N, Bobé P. Unusual expression of LINE-1 transposable element in the MRL autoimmune lymphoproliferative syndrome-prone strain. Oncogene 2002; 21:5593-600. [PMID: 12165858 DOI: 10.1038/sj.onc.1205730] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2002] [Revised: 05/22/2002] [Accepted: 06/07/2002] [Indexed: 11/08/2022]
Abstract
LINE-1 are endogenous mobile genetic elements that have dispersed and accumulated in the genomes of eukaryotes via germline transposition, with up to 100,000 copies in mammalian genomes. LINE-1 elements transpose by reverse transcription of their own transcript. Transposition requires synthesis of a full-length, sense-strand transcripts and proteins encoded by open reading frame (ORF) 1 and ORF2. Although severely repressed in most normal tissues, LINE-1 occasionally leads to disease by insertional mutagenesis. In the present study, Northern blot and in situ hybridization analyses revealed a template-strand transcription of LINE-1 ORF2 (encoding reverse transcriptase, RT) in lymphoid organs and the liver from MRL-+/+ and Fas-deficient MRL/lpr strains and their normal ancestors. While these sense transcripts are restricted to the nucleus in hepatocytes, they are also found in the cytoplasm in splenocytes. In contrast to transcription, ORF2 translation was detected only in MRL strains, as shown by the cytoplasmic labelling of splenic cells obtained with a monoclonal antibody recognizing the LINE-1 RT. This antibody coprecipitated two proteins of 45 and 12 kDa from MRL/lpr lymphoid organ lysates that were removed by pretreatment with anti-beta2-microglobulin antiserum, suggesting a structural association between a LINE-1 product and a major histocompatibility complex class I or class I-like molecule.
Collapse
Affiliation(s)
- Karim Benihoud
- INSERM U 267, 14, avenue Paul-Vaillant-Couturier, 94807 Villejuif, France
| | | | | | | | | | | | | |
Collapse
|
21
|
Goodier JL, Ostertag EM, Du K, Kazazian HH. A novel active L1 retrotransposon subfamily in the mouse. Genome Res 2001; 11:1677-85. [PMID: 11591644 PMCID: PMC311137 DOI: 10.1101/gr.198301] [Citation(s) in RCA: 169] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Unlike human L1 retrotransposons, the 5' UTR of mouse L1 elements contains tandem repeats of approximately 200 bp in length called monomers. Multiple L1 subfamilies exist in the mouse which are distinguished by their monomer sequences. We previously described a young subfamily, called the T(F) subfamily, which contains approximately 1800 active elements among its 3000 full-length members. Here we characterize a novel subfamily of mouse L1 elements, G(F), which has unique monomer sequence and unusual patterns of monomer organization. A majority of these G(F) elements also have a unique length polymorphism in ORF1. Polymorphism analysis of G(F) elements in various mouse subspecies and laboratory strains revealed that, like T(F), the G(F) subfamily is young and expanding. About 1500 full-length G(F) elements exist in the diploid mouse genome and, based on the results of a cell culture assay, approximately 400 G(F) elements are potentially capable of retrotransposition. We also tested 14 A-type subfamily elements in the assay and estimate that about 900 active A elements may be present in the mouse genome. Thus, it is now known that there are three large active subfamilies of mouse L1s; T(F), A, and G(F), and that in total approximately 3000 full-length elements are potentially capable of active retrotransposition. This number is in great excess to the number of L1 elements thought to be active in the human genome.
Collapse
Affiliation(s)
- J L Goodier
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA.
| | | | | | | |
Collapse
|
22
|
Busseau I, Berezikov E, Bucheton A. Identification of Waldo-A and Waldo-B, two closely related non-LTR retrotransposons in Drosophila. Mol Biol Evol 2001; 18:196-205. [PMID: 11158378 DOI: 10.1093/oxfordjournals.molbev.a003793] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We have identified two novel, closely related subfamilies of non-long-terminal-repeat (non-LTR) retrotransposons in Drosophila melanogaster, the Waldo-A and Waldo-B subfamilies, that are in the same lineage as site-specific LTR retrotransposons of the R1 clade. Both contain potentially active copies with two large open reading frames, having coding capacities for a nucleoprotein as well as endonuclease and reverse transcriptase activities. Many copies are truncated at the 5' end, and most are surrounded by target site duplications of variable lengths. Elements of both subfamilies have a nonrandom distribution in the genome, often being inserted within or very close to (CA)(n) arrays. At the DNA level, the longest elements of Waldo-A and Waldo-B are 69% identical on their entire length, except for the 5' untranslated regions, which have a mosaic organization, suggesting that one arose from the other following new promoter acquisition. This event occurred before the speciation of the D. melanogaster subgroup of species, since both Waldo-A and Waldo-B coexist in other species of this subgroup.
Collapse
Affiliation(s)
- I Busseau
- Institut de Génétique Humaine, Centre National de la Recherche Scientifique, 141 rue de la Cardonille, 34396 Montpellier cedex 05, France.
| | | | | |
Collapse
|
23
|
Martin SL, Li J, Weisz JA. Deletion analysis defines distinct functional domains for protein-protein and nucleic acid interactions in the ORF1 protein of mouse LINE-1. J Mol Biol 2000; 304:11-20. [PMID: 11071806 DOI: 10.1006/jmbi.2000.4182] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
LINE-1, or L1, is a non-LTR retrotransposon in mammals. Retrotransposition of L1 requires the action of two element-encoded proteins, ORF1p and ORF2p. ORF2p provides essential enzymatic activities for the reverse transcription and integration of a newly transposed copy of L1, whereas the exact role of ORF1p is less well understood. The 43 kDa ORF1p copurifies as a large complex with L1 RNA in extracts of human and mouse cells. Mouse ORF1p purified from Escherichia coli binds RNA and single-stranded DNA in vitro, exhibits nucleic acid chaperone activity, and is capable of protein-protein interaction. In this study we create a series of deletions in the ORF1 sequence, express the truncated proteins and examine their activities to delineate the region of ORF1p responsible for these different functions. By both yeast two-hybrid analysis and GST pull-down assay, the protein-protein interaction domain is defined as a coiled-coil domain that encompasses about one third of the protein near its N terminus. Based on data obtained with UV-cross-linking, electrophoretic mobility-shift assay and an annealing assay, the C-terminal one third of ORF1p is both necessary and sufficient for nucleic acid binding and to promote annealing of complementary oligonucleotides. Separation of these activities into different domains of ORF1p will facilitate detailed biochemical analyses of the structure and function of this protein and understanding of its role during L1 retrotransposition.
Collapse
Affiliation(s)
- S L Martin
- Department of Cellular and Structural Biology, University of Colorado School of Medicine, Denver, CO 80262, USA.
| | | | | |
Collapse
|
24
|
Furano AV. The biological properties and evolutionary dynamics of mammalian LINE-1 retrotransposons. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 2000; 64:255-94. [PMID: 10697412 DOI: 10.1016/s0079-6603(00)64007-2] [Citation(s) in RCA: 145] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Mammalian LINE-1 (L1) elements belong to the superfamily of autonomously replicating retrotransposable elements that lack the long terminal repeated (LTR) sequences typical of retroviruses and retroviral-like retrotransposons. The non-LTR superfamily is very ancient and L1-like elements are ubiquitous in nature, having been found in plants, fungi, invertebrates, and various vertebrate classes from fish to mammals. L1 elements have been replicating and evolving in mammals for at least the past 100 million years and now constitute 20% or more of some mammalian genomes. Therefore, L1 elements presumably have had a profound, perhaps defining, effect on the evolution, structure, and function of mammalian genomes. L1 elements contain regulatory signals and encode two proteins: one is an RNA-binding protein and the second one presumably functions as an integrase-replicase, because it has both endonuclease and reverse transcriptase activities. This work reviews the structure and biological properties of L1 elements, including their regulation, replication, evolution, and interaction with their mammalian hosts. Although each of these processes is incompletely understood, what is known indicates that they represent challenging and fascinating biological phenomena, the resolution of which will be essential for fully understanding the biology of mammals.
Collapse
Affiliation(s)
- A V Furano
- Section on Genomic Structure and Function, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
25
|
DeBerardinis RJ, Kazazian HH. Analysis of the promoter from an expanding mouse retrotransposon subfamily. Genomics 1999; 56:317-23. [PMID: 10087199 DOI: 10.1006/geno.1998.5729] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The mouse genome contains several subfamilies of the retrotransposon L1. One subfamily, TF, contains 4000-5000 full-length members and is expanding due to retrotransposition of a large number of active elements. Here we studied the TF 5' untranslated region (UTR), which contains promoter activity required for subfamily expression. Using reporter assays, we show that promoter activity is derived from TF-specific monomer sequences and is proportional to the number of monomers in the 5' UTR. These data suggest that nearly all full-length TF elements in the mouse genome are currently competent for expression. We aligned the sequences of 53 monomers to generate a consensus TF monomer and determined that most TF elements are truncated near a potential binding site for a transcription initiation factor. We also determined that much of the sequence variation among TF monomers results from transition mutations at CpG dinucleotides, suggesting that genomic TF 5' UTRs are methylated at CpGs.
Collapse
Affiliation(s)
- R J DeBerardinis
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | | |
Collapse
|
26
|
Saxton JA, Martin SL. Recombination between subtypes creates a mosaic lineage of LINE-1 that is expressed and actively retrotransposing in the mouse genome. J Mol Biol 1998; 280:611-22. [PMID: 9677292 DOI: 10.1006/jmbi.1998.1899] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
LINE-1, or L1, elements are retrotransposons that have amplified to high-copy number during the evolution of mammals. L1 appears to amplify in waves, spawning large numbers of progeny such that elements with distinct sequence features dominate the dispersal process in a given window of time. This process generates discrete subfamilies of L1 within mammalian genomes, with the oldest being remnants, or fossils, of earlier waves of amplification. In mice, at least three distinct subfamilies of L1 were distinguished by their unique 5' ends, A, F and V. These subfamilies amplified at distinct times in the evolution of mice, with A being the youngest and V the oldest; both V and F subfamilies were believed extinct. Recent data established that a variant of the F family, TF, is actively retrotransposing. We demonstrate here that members of the TF subfamily are abundantly expressed in mouse cells and encode the major protein constituent of L1 ribonucleoprotein particles. Although members of the TF subfamily are not as numerous in the genomes of laboratory mice as are members of the older A and F subfamilies, they appear to have been activated some time ago during mouse evolution, in the common ancestor of Mus spretus and Mus domesticus. Phylogenetic analysis demonstrates that this modern, active form of TF-type L1 has a composite evolutionary history, showing evidence of multiple recombinations between distinct L1 variants, including members of the A and F subfamilies.
Collapse
Affiliation(s)
- J A Saxton
- Department of Cellular and Structural Biology, B111, University of Colorado School of Medicine, 4200 E. Ninth Ave., Denver, CO 80262, USA
| | | |
Collapse
|
27
|
Martin SL, Li J, Epperson LE, Lieberman B. Functional reverse transcriptases encoded by A-type mouse LINE-1: defining the minimal domain by deletion analysis. Gene 1998; 215:69-75. [PMID: 9666081 DOI: 10.1016/s0378-1119(98)00252-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Long interspersed elements, or LINEs, are retrotransposons that move via an RNA intermediate. In mice, one polymorphic variant of L1 has amplified relatively recently, giving rise to the A-type subfamily in species belonging to the genus and subgenus Mus. Retrotransposition of LINE-1 (L1) requires the function of the L1-encoded reverse transcriptase that is produced from open reading frame 2 (ORF2). Here, we employ a convenient yeast genetic assay to determine the reverse transcriptase activity of the ORF2 obtained from three A-type L1 elements: one, a cDNA from the RNA in ribonucleoprotein particles; another with a purported inactivating mutation; and the third, a hypothetical ancestral construct. Because there are no examples of A-type elements that have transposed recently to inactivate a gene, this assay is the first step towards demonstrating the functional capability of mouse A-type LINE-1 elements. One of the three elements was believed to have been inactivated during evolution by the substitution of leucine for a highly conserved phenylalanine or tryptophan residue among known reverse transcriptases. This mutation did not inactivate the L1 reverse transcriptase in the yeast assay; thus, all three of the elements tested encoded reverse transcriptase activity. We further examined the minimal reverse transcriptase domain within ORF2 by creating a series of deletions. The results demonstrate that removal of the L1 endonuclease domain from the N-terminal region of ORF2 does not affect reverse transcriptase activity as determined by this assay, and that approximately half of the ORF2 coding sequence from mouse A-type L1 elements is required for functional reverse transcriptase.
Collapse
Affiliation(s)
- S L Martin
- University of Colorado School of Medicine, Department of Cellular, Structural Biology, Program in Molecular Biology, 4200 E. Ninth Avenue, Box B111, Denver, CO 80262, USA.
| | | | | | | |
Collapse
|
28
|
Naas TP, DeBerardinis RJ, Moran JV, Ostertag EM, Kingsmore SF, Seldin MF, Hayashizaki Y, Martin SL, Kazazian HH. An actively retrotransposing, novel subfamily of mouse L1 elements. EMBO J 1998; 17:590-7. [PMID: 9430649 PMCID: PMC1170408 DOI: 10.1093/emboj/17.2.590] [Citation(s) in RCA: 151] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Retrotransposition of LINEs and other retroelements increases repetition in mammalian genomes and can cause deleterious mutations. Recent insertions of two full-length L1s, L1spa and L1Orl, caused the disease phenotypes of the spastic and Orleans reeler mice respectively. Here we show that these two recently retrotransposed L1s are nearly identical in sequence, have two open reading frames and belong to a novel subfamily related to the ancient F subfamily. We have named this new subfamily TF (for transposable) and show that many full-length members of this family are present in the mouse genome. The TF 5' untranslated region has promoter activity, and TF-type RNA is abundant in cytoplasmic ribonucleoprotein particles, which are likely intermediates in retrotransposition. Both L1spa and L1Orl have reverse transcriptase activity in a yeast-based assay and retrotranspose at high frequency in cultured cells. Together, our data indicate that the TF subfamily of L1s contains a major class of mobile elements that is expanding in the mouse genome.
Collapse
Affiliation(s)
- T P Naas
- Department of Genetics, University of Pennsylvania, School of Medicine, Philadelphia, PA 19104, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Cabot EL, Angeletti B, Usdin K, Furano AV. Rapid evolution of a young L1 (LINE-1) clade in recently speciated Rattus taxa. J Mol Evol 1997; 45:412-23. [PMID: 9321420 DOI: 10.1007/pl00006246] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
L1 elements are retrotransposons that have been replicating and evolving in mammalian genomes since before the mammalian radiation. Rattus norvegicus shares the young L1mlvi2 clade only with its sister taxon, Rattus cf moluccarius. Here we compared the L1mlvi2 clade in these recently diverged species and found that it evolved rapidly into closely related but distinct clades: the L1mlvi2-rm clade (or subfamily), characterized here from R. cf moluccarius, and the L1mlvi2-rn clade, originally described in R. norvegicus. In addition to other differences, these clades are distinguished by a cluster of amino acid replacement substitutions in ORF I. Both rat species contain the L1mlvi2-rm clade, but the L1mlvi2-rn clade is restricted to R. norvegicus. Therefore, the L1mlvi2-rm clade arose prior to the divergence of R. norvegicus and R. cf moluccarius, and the L1mlvi2-rn clade amplified after their divergence. The total number of L1mlvi2-rm elements in R. cf moluccarius is about the same as the sum of the L1mlvi2-rm and L1mlvi2-rn elements in R. norvegicus. The possibility that L1 amplification is in some way limited so that the two clades compete for replicative supremacy as well as the implications of the other distinguishing characteristic of the L1mlvi2-rn and L1mlvi2-rm clades are discussed.
Collapse
Affiliation(s)
- E L Cabot
- Section on Genomic Structure and Function, NIDDK, NIH, Bethesda, MD 20892-0830, USA.
| | | | | | | |
Collapse
|
30
|
Furano AV, Usdin K. DNA "fossils" and phylogenetic analysis. Using L1 (LINE-1, long interspersed repeated) DNA to determine the evolutionary history of mammals. J Biol Chem 1995; 270:25301-4. [PMID: 7592685 DOI: 10.1074/jbc.270.43.25301] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Affiliation(s)
- A V Furano
- Section on Genomic Structure and Function, NIDDK, National Institutes of Health, Bethesda, Maryland 20893-0830, USA
| | | |
Collapse
|
31
|
Trelogan SA, Martin SL. Tightly regulated, developmentally specific expression of the first open reading frame from LINE-1 during mouse embryogenesis. Proc Natl Acad Sci U S A 1995; 92:1520-4. [PMID: 7878012 PMCID: PMC42551 DOI: 10.1073/pnas.92.5.1520] [Citation(s) in RCA: 133] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
LINE-1 (L1) has achieved its status as a middle repetitive DNA family in mammalian genomes by duplicative transposition. Although transposition may occur in any cell type, expression and transposition of a full-length functional element in the germ line are necessary for evolutionarily significant propagation of L1. An immunohistochemical analysis of adult mouse ovaries and mouse postimplantation embryos revealed expression of L1 open reading frame 1 in the germ line as well as in steroidogenic tissues. These results demonstrate that L1 expression is controlled by a tightly regulated temporal and spatial program of events during development and imply that multiple loci of L1 in the mouse genome are active for expression.
Collapse
Affiliation(s)
- S A Trelogan
- University of Colorado Health Sciences Center, Department of Cellular and Structural Biology, Denver 80262
| | | |
Collapse
|
32
|
Martin SL. Characterization of a LINE-1 cDNA that originated from RNA present in ribonucleoprotein particles: implications for the structure of an active mouse LINE-1. Gene 1995; 153:261-6. [PMID: 7533116 DOI: 10.1016/0378-1119(94)00785-q] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Full-length, sense-strand, long interspersed element-1 (LINE-1 or L1) RNA is found as an RNA-protein complex in mouse embryonal carcinoma cells. Since this complex is a likely intermediate in LINE-1 transposition, its RNA may be enriched for the functional, or active, subset of mouse L1 sequences. For this reason, a cDNA library was constructed from RNA prepared from these ribonucleoprotein particles. The isolation and complete DNA sequence of one clone that is a strong candidate to be a functional version of mouse L1 is reported here. The structure of this element suggests a revision of the predicted sequence of an active mouse L1 and provides a tag that can be used to isolate its locus in the genome.
Collapse
Affiliation(s)
- S L Martin
- Department of Cellular and Structural Biology, University of Colorado School of Medicine, Denver 80262
| |
Collapse
|
33
|
Biessmann H, Kasravi B, Bui T, Fujiwara G, Champion LE, Mason JM. Comparison of two active HeT-A retroposons of Drosophila melanogaster. Chromosoma 1994; 103:90-8. [PMID: 8055715 DOI: 10.1007/bf00352317] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
HeT-A elements are Drosophila melanogaster LINE-like retroposons that transpose to broken chromosome ends by attaching themselves with an oligo(A) tail. Since this family of elements is believed to be involved in the vital function of telomere elongation in Drosophila, it is important to understand their transposition mechanism and the molecular aspects of activity. By comparison of several elements we have defined here the unit length of HeT-A elements to be approximately 6 kb. Also, we studied an active HeT-A element that had transposed very recently to the end of a terminally deleted X chromosome. The 12 kb of newly transposed DNA consisted of a tandem array of three different HeT-A elements joined by oligo(A) tails to each other and to the chromosome end broken in the yellow gene. Such an array may have transposed as a single unit or resulted from rapid successive transpositions of individual HeT-A elements. By sequence comparison with another recently transposed HeT-A element, conserved domains in the single open reading frame (ORF), encoding a gag-like polypeptide, of these elements were defined. We conclude that for transposition an intact ORF is required in cis, while the reverse transcriptase is not encoded on the HeT-A element but is provided in trans. This would make HeT-A elements dependent on an external reverse transcriptase for transposition and establish control of the genome over the activity of HeT-A elements. This distinguishes the Drosophila HeT-A element, which has been implicated in Drosophila telomere elongation, from the other, 'selfish' LINE-like elements.
Collapse
Affiliation(s)
- H Biessmann
- Developmental Biology Center, University of California, Irvine 92717
| | | | | | | | | | | |
Collapse
|
34
|
Adey NB, Tollefsbol TO, Sparks AB, Edgell MH, Hutchison CA. Molecular resurrection of an extinct ancestral promoter for mouse L1. Proc Natl Acad Sci U S A 1994; 91:1569-73. [PMID: 8108446 PMCID: PMC43201 DOI: 10.1073/pnas.91.4.1569] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The F-type subfamily of LINE-1 or L1 retroposons [for long interspersed (repetitive) element 1] was dispersed in the mouse genome several million years ago. This subfamily appears to be both transcriptionally and transpositionally inactive today and therefore may be considered evolutionarily extinct. We hypothesized that these F-type L1s are inactive because of the accumulation of mutations. To test this idea we used phylogenetic analysis to deduce the sequence of a transpositionally active ancestral F-type promoter, resurrected it by chemical synthesis, and showed that it has promoter activity. In contrast, F-type sequences isolated from the modern genome are inactive. This approach, in which the automated DNA synthesizer is used as a "time machine," should have broad application in testing models derived from evolutionary studies.
Collapse
Affiliation(s)
- N B Adey
- Department of Microbiology and Immunology, University of North Carolina at Chapel Hill 27599
| | | | | | | | | |
Collapse
|
35
|
Thayer RE, Singer MF, Fanning TG. Undermethylation of specific LINE-1 sequences in human cells producing a LINE-1-encoded protein. Gene 1993; 133:273-7. [PMID: 7693554 DOI: 10.1016/0378-1119(93)90651-i] [Citation(s) in RCA: 69] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Nucleotide sequences near the 5' ends of some long interspersed elements-1 (LINE-1) from Homo sapiens (L1Hs) are undermethylated in cell lines which produce a L1Hs-encoded protein. In contrast, these sequences are methylated in cell lines with little or no detectable L1Hs expression. The fact that the 5' end of L1Hs is differentially methylated in cells exhibiting different levels of L1Hs expression suggests that the methylation state of this region plays a role in L1Hs expression.
Collapse
Affiliation(s)
- R E Thayer
- Laboratory of Biochemistry, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892
| | | | | |
Collapse
|
36
|
Stanhope MJ, Tagle DA, Shivji MS, Hattori M, Sakaki Y, Slightom JL, Goodman M. Multiple L1 progenitors in prosimian primates: phylogenetic evidence from ORF1 sequences. J Mol Evol 1993; 37:179-89. [PMID: 8411207 DOI: 10.1007/bf02407354] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
One of the uncertainties regarding the evolution of L1 elements is whether there are numerous progenitor genes. We present phylogenetic evidence from ORF1 sequences of slow loris (Nycticebus coucang) and galago (Galago crassicaudatus) that there were at least two distinct progenitors, active at the same time, in the ancestor of this family of prosimian primates. A maximum parsimony analysis that included representative L1s from human, rabbit, and rodents, along with the prosimian sequences, revealed that one of the galago L1s (Gc11) grouped very strongly with the slow loris sequences. The remaining galago elements formed their own unique and strongly supported clade. An analysis of replacement and silent site changes for each link of the most parsimonious tree indicated that during the descent of the Gc11 sequence approximately two times more synonymous than nonsynonymous substitutions had occurred, implying that the Gc11 founder was functional for some time after the split of galago and slow loris. Strong purifying selection was also evident on the galago branch of the tree. These data indicate that there were two distinct and contemporaneous L1 progenitors in the lorisoid ancestor, evolving under purifying selection, that were retained as functional L1s in the galago lineage (and presumably also in the slow loris). The prosimian ORF1 sequences could be further subdivided into subfamilies. ORF1 sequences from both the galago and slow loris have a premature termination codon near the 3' end, not shared by the other mammalian sequences, that shortens the open reading frame by 288 bp. An analysis of synonymous and nonsynonymous substitutions for the 5' and 3' portions, that included intra- and inter-subfamily comparisons, as well as comparisons among the other mammalian sequences, suggested that this premature stop codon is a prosimian acquisition that has rendered the 3' portion of ORF1 in these primates noncoding.
Collapse
Affiliation(s)
- M J Stanhope
- Department of Anatomy and Cell Biology, Wayne State University, Detroit, MI 48201
| | | | | | | | | | | | | |
Collapse
|
37
|
Pascale E, Liu C, Valle E, Usdin K, Furano AV. The evolution of long interspersed repeated DNA (L1, LINE 1) as revealed by the analysis of an ancient rodent L1 DNA family. J Mol Evol 1993; 36:9-20. [PMID: 8433380 DOI: 10.1007/bf02407302] [Citation(s) in RCA: 53] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
All modern mammals contain a distinctive, highly repeated (> or = 50,000 members) family of long interspersed repeated DNA called the L1 (LINE 1) family. While the modern L1 families were derived from a common ancestor that predated the mammalian radiation approximately 80 million years ago, most of the members of these families were generated within the last 5 million years. However, recently we demonstrated that modern murine (Old World rats and mice) genomes share an older long interspersed repeated DNA family that we called Lx. Here we report our analysis of the DNA sequence of Lx family members and the relationship of this family to the modern L1 families in mouse and rat. The extent of DNA sequence divergence between Lx members indicates that the Lx amplification occurred about 12 million years ago, around the time of the murine radiation. Parsimony analysis revealed that Lx elements were ancestral to both the modern rat and mouse L1 families. However, we found that few if any of the evolutionary intermediates between the Lx and the modern L1 families were extensively amplified. Because the modern L1 families have evolved under selective pressure, the evolutionary intermediates must have been capable of replication. Therefore, replication-competent L1 elements can reside in genomes without undergoing extensive amplification. We discuss the bearing of our findings on the evolution of L1 DNA elements and the mammalian genome.
Collapse
Affiliation(s)
- E Pascale
- Section on Genomic Structure and Function, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892
| | | | | | | | | |
Collapse
|
38
|
Minakami R, Kurose K, Etoh K, Furuhata Y, Hattori M, Sakaki Y. Identification of an internal cis-element essential for the human L1 transcription and a nuclear factor(s) binding to the element. Nucleic Acids Res 1992; 20:3139-45. [PMID: 1320255 PMCID: PMC312450 DOI: 10.1093/nar/20.12.3139] [Citation(s) in RCA: 81] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
L1 (LINE-1) is a long interspersed repetitive sequence derived from a retrotransposon. Transfection studies using the CAT gene as a reporter demonstrated that the first 155bp in the human L1 sequence contains an element(s) responsible for the promoter activity in HeLa cells. The transcription was shown to initiate at the first nucleotide of the L1 sequence in the transgene. Three prominent nuclear protein binding sites were found in the 5' region of the L1 sequence by DNaseI footprint analysis. One of the binding sites, designated as site A located at +3 to +26, was shown to be essential for the L1 transcription because the mutation at the site A caused almost complete loss of the promoter activity. A sequence AAGATGGCC at +11 to +19 in the site A was defined as a target core element for the protein binding. The site A-binding protein (designated TFL1-A) was found in various types of cells including an embryonic teratocarcinoma cell line. These results indicate that an internal short element located at the very 5' terminal of L1 sequence and the nuclear factor binding to the element play a crucial role in the transcription of human L1.
Collapse
Affiliation(s)
- R Minakami
- Research Laboratory for Genetic Information, Kyushu University, Fukuoka, Japan
| | | | | | | | | | | |
Collapse
|
39
|
Schichman SA, Severynse DM, Edgell MH, Hutchison CA. Strand-specific LINE-1 transcription in mouse F9 cells originates from the youngest phylogenetic subgroup of LINE-1 elements. J Mol Biol 1992; 224:559-74. [PMID: 1314898 DOI: 10.1016/0022-2836(92)90544-t] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
LINE-1 (L1) is a mammalian family of highly repeated DNA sequences that are members of a class of transposable elements whose movement involves an RNA intermediate. Both structural and evolutionary data indicate that the L1 family consists of a small number of active transposable elements interspersed with a large number of L1 pseudogenes. In the mouse, the longest, characterized L1 sequences span about 7000 base-pairs and contain two long open reading frames. Two subfamilies of mouse L1 elements, A and F, have been defined on the basis of the type of putative transcriptional regulatory sequence found at the 5' end. In order to identify a transcribed subset of L1 elements in mouse F9 teratocarcinoma cells, we have examined the strand-specificity of L1 transcription by Northern analysis and compared the open reading frame-1 sequences of ten A-type cDNAs with fifteen genomic A-type L1 elements. Transcripts containing A-type sequence are far more abundant than those containing F-type sequence. Although the majority of L1 RNA in F9 cells appears to be transcribed non-specifically from both strands, our results provide evidence for a subpopulation of variable length, strand-specific transcripts arising from A-type transcriptional regulatory sequences. F9 cell cDNA sequences, which share greater than 99.5% sequence identity with one another, represent a homogeneous subset of the genomic L1 population. Examination of genomic mouse L1 sequences reveals three types of length polymorphism in a defined segment of the first open reading frame. Phylogenetic analysis shows a correlation between the type of length polymorphism in the first open reading frame and the relative age of an individual A-type genomic L1 element. Comparison of the cDNA and genomic sequences indicates that the youngest subgroup of A-type L1 elements is preferentially transcribed in F9 cells. This subgroup may be currently dominating the L1 dispersal process in mice.
Collapse
Affiliation(s)
- S A Schichman
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27599
| | | | | | | |
Collapse
|
40
|
Adey NB, Schichman SA, Hutchison CA, Edgell MH. Composite of A and F-type 5' terminal sequences defines a subfamily of mouse LINE-1 elements. J Mol Biol 1991; 221:367-73. [PMID: 1920423 DOI: 10.1016/0022-2836(91)80057-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The 5' terminus of full-length L1 elements contains transcriptional control sequences. In mouse L1 (L1Md) elements, these sequences exist as an array of tandem direct repeats. Two types of repeat units, termed A-monomers and F-monomers, have been reported. Both monomers are about 200 bp in length but share no significant sequence homology. Previous studies have identified L1Md elements containing either A or F-monomers but not both. Here we describe three "composite" L1Md elements that contain both types of monomer sequence. Two of these composite L1Md elements are highly homologous and share the same structural rearrangements, implying that they arose from a common ancestor that has the same composite 5' end.
Collapse
Affiliation(s)
- N B Adey
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 72599
| | | | | | | |
Collapse
|