1
|
Rosser JM, An W. L1 expression and regulation in humans and rodents. Front Biosci (Elite Ed) 2012; 4:2203-25. [PMID: 22202032 DOI: 10.2741/537] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Long interspersed elements type 1 (LINE-1s, or L1s) have impacted mammalian genomes at multiple levels. L1 transcription is mainly controlled by its 5' untranslated region (5'UTR), which differs significantly among active human and rodent L1 families. In this review, L1 expression and its regulation are examined in the context of human and rodent development. First, endogenous L1 expression patterns in three different species-human, rat, and mouse-are compared and contrasted. A detailed account of relevant experimental evidence is presented according to the source material, such as cell lines, tumors, and normal somatic and germline tissues from different developmental stages. Second, factors involved in the regulation of L1 expression at both transcriptional and posttranscriptional levels are discussed. These include transcription factors, DNA methylation, PIWI-interacting RNAs (piRNAs), RNA interference (RNAi), and posttranscriptional host factors. Similarities and differences between human and rodent L1s are highlighted. Third, recent findings from transgenic mouse models of L1 are summarized and contrasted with those from endogenous L1 studies. Finally, the challenges and opportunities for L1 mouse models are discussed.
Collapse
Affiliation(s)
- James M Rosser
- School of Molecular Biosciences and Center for Reproductive Biology, Washington State University, Pullman, WA 99164, USA
| | | |
Collapse
|
2
|
Kirilyuk A, Tolstonog GV, Damert A, Held U, Hahn S, Löwer R, Buschmann C, Horn AV, Traub P, Schumann GG. Functional endogenous LINE-1 retrotransposons are expressed and mobilized in rat chloroleukemia cells. Nucleic Acids Res 2007; 36:648-65. [PMID: 18073200 PMCID: PMC2241872 DOI: 10.1093/nar/gkm1045] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
LINE-1 (L1) is a highly successful autonomous non-LTR retrotransposon and a major force shaping mammalian genomes. Although there are about 600 000 L1 copies covering 23% of the rat genome, full-length rat L1s (L1Rn) with intact open reading frames (ORFs) representing functional master copies for retrotransposition have not been identified yet. In conjunction with studies to elucidate the role of L1 retrotransposons in tumorigenesis, we isolated and characterized 10 different cDNAs from transcribed full-length L1Rn elements in rat chloroleukemia (RCL) cells, each encoding intact ORF1 proteins (ORF1p). We identified the first functional L1Rn retrotransposon from this pool of cDNAs, determined its activity in HeLa cells and in the RCL cell line the cDNAs originated from and demonstrate that it is mobilized in the tumor cell line in which it is expressed. Furthermore, we generated monoclonal antibodies directed against L1Rn ORF1 and ORF2-encoded recombinant proteins, analyzed the expression of L1-encoded proteins and found ORF1p predominantly in the nucleus. Our results support the hypothesis that the reported explosive amplification of genomic L1Rn sequences after their transcriptional activation in RCL cells is based on L1 retrotransposition. Therefore, L1 activity might be one cause for genomic instability observed during the progression of leukemia.
Collapse
Affiliation(s)
- Alexander Kirilyuk
- Max-Planck-Institut für Zellbiologie, Rosenhof, D-68526 Ladenburg, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Stuart RO, Bush KT, Nigam SK. Changes in global gene expression patterns during development and maturation of the rat kidney. Proc Natl Acad Sci U S A 2001; 98:5649-54. [PMID: 11331749 PMCID: PMC33267 DOI: 10.1073/pnas.091110798] [Citation(s) in RCA: 141] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2001] [Accepted: 03/06/2001] [Indexed: 11/18/2022] Open
Abstract
We set out to define patterns of gene expression during kidney organogenesis by using high-density DNA array technology. Expression analysis of 8,740 rat genes revealed five discrete patterns or groups of gene expression during nephrogenesis. Group 1 consisted of genes with very high expression in the early embryonic kidney, many with roles in protein translation and DNA replication. Group 2 consisted of genes that peaked in midembryogenesis and contained many transcripts specifying proteins of the extracellular matrix. Many additional transcripts allied with groups 1 and 2 had known or proposed roles in kidney development and included LIM1, POD1, GFRA1, WT1, BCL2, Homeobox protein A11, timeless, pleiotrophin, HGF, HNF3, BMP4, TGF-alpha, TGF-beta2, IGF-II, met, FGF7, BMP4, and ganglioside-GD3. Group 3 consisted of transcripts that peaked in the neonatal period and contained a number of retrotransposon RNAs. Group 4 contained genes that steadily increased in relative expression levels throughout development, including many genes involved in energy metabolism and transport. Group 5 consisted of genes with relatively low levels of expression throughout embryogenesis but with markedly higher levels in the adult kidney; this group included a heterogeneous mix of transporters, detoxification enzymes, and oxidative stress genes. The data suggest that the embryonic kidney is committed to cellular proliferation and morphogenesis early on, followed sequentially by extracellular matrix deposition and acquisition of markers of terminal differentiation. The neonatal burst of retrotransposon mRNA was unexpected and may play a role in a stress response associated with birth. Custom analytical tools were developed including "The Equalizer" and "eBlot," which contain improved methods for data normalization, significance testing, and data mining.
Collapse
Affiliation(s)
- R O Stuart
- Department of Medicine, Division of Nephrology-Hypertension, Cancer Center, University of California at San Diego, La Jolla, CA 92093, USA.
| | | | | |
Collapse
|
4
|
Cabot EL, Angeletti B, Usdin K, Furano AV. Rapid evolution of a young L1 (LINE-1) clade in recently speciated Rattus taxa. J Mol Evol 1997; 45:412-23. [PMID: 9321420 DOI: 10.1007/pl00006246] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
L1 elements are retrotransposons that have been replicating and evolving in mammalian genomes since before the mammalian radiation. Rattus norvegicus shares the young L1mlvi2 clade only with its sister taxon, Rattus cf moluccarius. Here we compared the L1mlvi2 clade in these recently diverged species and found that it evolved rapidly into closely related but distinct clades: the L1mlvi2-rm clade (or subfamily), characterized here from R. cf moluccarius, and the L1mlvi2-rn clade, originally described in R. norvegicus. In addition to other differences, these clades are distinguished by a cluster of amino acid replacement substitutions in ORF I. Both rat species contain the L1mlvi2-rm clade, but the L1mlvi2-rn clade is restricted to R. norvegicus. Therefore, the L1mlvi2-rm clade arose prior to the divergence of R. norvegicus and R. cf moluccarius, and the L1mlvi2-rn clade amplified after their divergence. The total number of L1mlvi2-rm elements in R. cf moluccarius is about the same as the sum of the L1mlvi2-rm and L1mlvi2-rn elements in R. norvegicus. The possibility that L1 amplification is in some way limited so that the two clades compete for replicative supremacy as well as the implications of the other distinguishing characteristic of the L1mlvi2-rn and L1mlvi2-rm clades are discussed.
Collapse
Affiliation(s)
- E L Cabot
- Section on Genomic Structure and Function, NIDDK, NIH, Bethesda, MD 20892-0830, USA.
| | | | | | | |
Collapse
|
5
|
Dawson A, Hartswood E, Paterson T, Finnegan DJ. A LINE-like transposable element in Drosophila, the I factor, encodes a protein with properties similar to those of retroviral nucleocapsids. EMBO J 1997; 16:4448-55. [PMID: 9250689 PMCID: PMC1170071 DOI: 10.1093/emboj/16.14.4448] [Citation(s) in RCA: 49] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
I factors are members of the LINE-like family of transposable elements and move by reverse transcription of an RNA intermediate. Complete I factors contain two open reading frames. The amino acid sequence encoded by the first of these, ORF1, includes the motif CX2CX4HX4C that is characteristic of the nucleocapsid domain of retroviral gag polypeptides followed by a copy of the slightly different sequences CX2CX4HX6C and CX2CX9HX6C. The function of this protein is unknown. We have expressed this protein in Escherichia coli and Spodoptera frugiperda cells and have shown that it binds both DNA and RNA but without any evidence for sequence specificity. The properties of deletion derivatives of the protein indicate that more than one region is responsible for DNA binding and that the CCHC motif is not essential for this. The ORF1 protein expressed in either E. coli or Spodoptera cells forms high molecular weight structures that require the region of the protein including the CCHC motif for their formation. This protein can also accelerate the annealing of complementary single-stranded oligonucleotides. These results suggest that this protein may associate with the RNA transposition intermediates of the I factor to form particles that enter the nucleus during transposition and that it may stimulate both the priming of reverse transcription and integration. This may be generally true for the product of the first open reading frame of LINE-like elements.
Collapse
Affiliation(s)
- A Dawson
- Institute of Cell and Molecular Biology, University of Edinburgh, UK
| | | | | | | |
Collapse
|
6
|
Dukhanina OI, Dene H, Deng AY, Choi CR, Hoebee B, Rapp JP. Linkage map and congenic strains to localize blood pressure QTL on rat chromosome 10. Mamm Genome 1997; 8:229-35. [PMID: 9096100 DOI: 10.1007/s003359900399] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Our purposes were to develop a linkage map for rat Chromosome (Chr) 10, using chromosome-sorted DNA, and to construct congenic strains to localize blood pressure quantitative trait loci (QTL) on Chr 10 with the map. The linkage mapping panel consisted of three F2 populations totaling 418 rats. Thirty-two new and 29 known microsatellite markers were placed on the map, which spanned 88.9 centiMorgans (cM). The average distance between markers was 1.46 cM. No markers were separated by more than 6.8 cM. Four congenic strains were constructed by introgressing various segments of Chr 10 from the Milan normotensive strain (MNS) onto the background of the Dahl salt-sensitive (S) strain. A blood pressure QTL with a strong effect on blood pressure (35-42 mm Hg) when expressed on the S background was localized to a 31-cM region between D10Mco6 and D10Mcol. The region does not include the locus for inducible nitric oxide synthase (Nos2), which had been considered to be a candidate locus for the QTL.
Collapse
Affiliation(s)
- O I Dukhanina
- Department of Physiology and Molecular Medicine, Medical College of Ohio, P.O. Box 10008, Toledo, Ohio 43669, USA
| | | | | | | | | | | |
Collapse
|
7
|
Stanhope MJ, Tagle DA, Shivji MS, Hattori M, Sakaki Y, Slightom JL, Goodman M. Multiple L1 progenitors in prosimian primates: phylogenetic evidence from ORF1 sequences. J Mol Evol 1993; 37:179-89. [PMID: 8411207 DOI: 10.1007/bf02407354] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
One of the uncertainties regarding the evolution of L1 elements is whether there are numerous progenitor genes. We present phylogenetic evidence from ORF1 sequences of slow loris (Nycticebus coucang) and galago (Galago crassicaudatus) that there were at least two distinct progenitors, active at the same time, in the ancestor of this family of prosimian primates. A maximum parsimony analysis that included representative L1s from human, rabbit, and rodents, along with the prosimian sequences, revealed that one of the galago L1s (Gc11) grouped very strongly with the slow loris sequences. The remaining galago elements formed their own unique and strongly supported clade. An analysis of replacement and silent site changes for each link of the most parsimonious tree indicated that during the descent of the Gc11 sequence approximately two times more synonymous than nonsynonymous substitutions had occurred, implying that the Gc11 founder was functional for some time after the split of galago and slow loris. Strong purifying selection was also evident on the galago branch of the tree. These data indicate that there were two distinct and contemporaneous L1 progenitors in the lorisoid ancestor, evolving under purifying selection, that were retained as functional L1s in the galago lineage (and presumably also in the slow loris). The prosimian ORF1 sequences could be further subdivided into subfamilies. ORF1 sequences from both the galago and slow loris have a premature termination codon near the 3' end, not shared by the other mammalian sequences, that shortens the open reading frame by 288 bp. An analysis of synonymous and nonsynonymous substitutions for the 5' and 3' portions, that included intra- and inter-subfamily comparisons, as well as comparisons among the other mammalian sequences, suggested that this premature stop codon is a prosimian acquisition that has rendered the 3' portion of ORF1 in these primates noncoding.
Collapse
Affiliation(s)
- M J Stanhope
- Department of Anatomy and Cell Biology, Wayne State University, Detroit, MI 48201
| | | | | | | | | | | | | |
Collapse
|
8
|
Pascale E, Liu C, Valle E, Usdin K, Furano AV. The evolution of long interspersed repeated DNA (L1, LINE 1) as revealed by the analysis of an ancient rodent L1 DNA family. J Mol Evol 1993; 36:9-20. [PMID: 8433380 DOI: 10.1007/bf02407302] [Citation(s) in RCA: 53] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
All modern mammals contain a distinctive, highly repeated (> or = 50,000 members) family of long interspersed repeated DNA called the L1 (LINE 1) family. While the modern L1 families were derived from a common ancestor that predated the mammalian radiation approximately 80 million years ago, most of the members of these families were generated within the last 5 million years. However, recently we demonstrated that modern murine (Old World rats and mice) genomes share an older long interspersed repeated DNA family that we called Lx. Here we report our analysis of the DNA sequence of Lx family members and the relationship of this family to the modern L1 families in mouse and rat. The extent of DNA sequence divergence between Lx members indicates that the Lx amplification occurred about 12 million years ago, around the time of the murine radiation. Parsimony analysis revealed that Lx elements were ancestral to both the modern rat and mouse L1 families. However, we found that few if any of the evolutionary intermediates between the Lx and the modern L1 families were extensively amplified. Because the modern L1 families have evolved under selective pressure, the evolutionary intermediates must have been capable of replication. Therefore, replication-competent L1 elements can reside in genomes without undergoing extensive amplification. We discuss the bearing of our findings on the evolution of L1 DNA elements and the mammalian genome.
Collapse
Affiliation(s)
- E Pascale
- Section on Genomic Structure and Function, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892
| | | | | | | | | |
Collapse
|
9
|
Translation of the rat LINE bicistronic RNAs in vitro involves ribosomal reinitiation instead of frameshifting. Mol Cell Biol 1992. [PMID: 1380649 DOI: 10.1128/mcb.12.9.4242] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The genomic structure of the rat LINE (L1Rn) DNA element contains two overlapping open reading frames (ORFs) and apparently has a potential to code for a DNA/RNA-binding protein (in ORF1) and a reverse transcriptase (in ORF2). We have characterized a 1,630-bp L1Rn cDNA clone encompassing the overlapping ORFs and a 600-bp genomic fragment derived from a full-length L1Rn member and containing the beginning of ORF1. These DNAs were used to restore in part the ORF1-ORF2 organization of L1Rn after being cloned into the pSP65 vector under the control of SP6 polymerase promoter. To test whether L1Rn ORF1 and ORF2 are expressed as a fusion protein, a series of capped RNAs with progressive truncations containing one or both ORFs were prepared and translated in the rabbit reticulocyte lysate. Our analysis indicates that the expression of a putative reverse transcriptase-encoded L1Rn ORF2 in vitro is regulated by reinitiation or internal initiation of translation but not by ribosomal frameshifting.
Collapse
|
10
|
Ilves H, Kahre O, Speek M. Translation of the rat LINE bicistronic RNAs in vitro involves ribosomal reinitiation instead of frameshifting. Mol Cell Biol 1992; 12:4242-8. [PMID: 1380649 PMCID: PMC360333 DOI: 10.1128/mcb.12.9.4242-4248.1992] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The genomic structure of the rat LINE (L1Rn) DNA element contains two overlapping open reading frames (ORFs) and apparently has a potential to code for a DNA/RNA-binding protein (in ORF1) and a reverse transcriptase (in ORF2). We have characterized a 1,630-bp L1Rn cDNA clone encompassing the overlapping ORFs and a 600-bp genomic fragment derived from a full-length L1Rn member and containing the beginning of ORF1. These DNAs were used to restore in part the ORF1-ORF2 organization of L1Rn after being cloned into the pSP65 vector under the control of SP6 polymerase promoter. To test whether L1Rn ORF1 and ORF2 are expressed as a fusion protein, a series of capped RNAs with progressive truncations containing one or both ORFs were prepared and translated in the rabbit reticulocyte lysate. Our analysis indicates that the expression of a putative reverse transcriptase-encoded L1Rn ORF2 in vitro is regulated by reinitiation or internal initiation of translation but not by ribosomal frameshifting.
Collapse
Affiliation(s)
- H Ilves
- Laboratory of Molecular Genetics, Institute of Chemical Physics and Biophysics, Tartu, Estonia
| | | | | |
Collapse
|
11
|
Volinia S, Scapoli C, Gambari R, Barale R, Barrai I. A set of viral DNA decamers enriched in transcription control signals. Nucleic Acids Res 1991; 19:3733-40. [PMID: 1906607 PMCID: PMC328405 DOI: 10.1093/nar/19.13.3733] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
We studied the frequency distribution of oligonucleotides 10 bp long in a sample of 620 Kb of viral genomes, containing 102 sequences from GenBank, with the aim of detecting transcription control signals. Two thousand three hundred decamers had a frequency 10 times higher than the mean and were subjected to further statistical analysis. For each of the 2300 decamers (parents), we counted the individual frequencies of the 30 decamers differing from the parent by one base mutation (progeny) and then calculated two variance/mean chi squares for the progeny, with and without the parent. We then studied the distribution of the ratio between the two chi squares. Out of 2300 decamers, 10 times more frequent than average, 479 decamers had a chi square ratio of 1.9 or larger. In this final set, which corresponds to less than 0.05% of all possible decamers, 58 decamers were found to contain viral and eukaryotic transcription control elements, like NF-kB, Sp1 and others. Furthermore, this set contains an excess of signals of length 5, 6, 7, 8, 9 and 10, when compared to 150 random sets, bootstrapped from the same viral genomes.
Collapse
Affiliation(s)
- S Volinia
- Dipartimento di Biologia Evolutiva, Università di Ferrara, Italy
| | | | | | | | | |
Collapse
|
12
|
Pascale E, Valle E, Furano AV. Amplification of an ancestral mammalian L1 family of long interspersed repeated DNA occurred just before the murine radiation. Proc Natl Acad Sci U S A 1990; 87:9481-5. [PMID: 2251288 PMCID: PMC55190 DOI: 10.1073/pnas.87.23.9481] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Each mammalian genus examined so far contains 50,000-100,000 members of an L1 (LINE 1) family of long interspersed repeated DNA elements. Current knowledge on the evolution of L1 families presents a paradox because, although L1 families have been in mammalian genomes since before the mammalian radiation approximately 80 million years ago, most members of the L1 families are only a few million years old. Accordingly it has been suggested either that the extensive amplification that characterizes present-day L1 families did not occur in the past or that old members were removed as new ones were generated. However, we show here that an ancestral rodent L1 family was extensively amplified approximately 10 million years ago and that the relics (approximately 60,000 copies) of this amplification have persisted in modern murine genomes (Old World rats and mice). This amplification occurred just before the divergence of modern murine genera from their common ancestor and identifies the murine node in the lineage of modern muroid rodents. Our results suggest that repeated amplification of L1 elements is a feature of the evolution of mammalian genomes and that ancestral amplification events could provide a useful tool for determining mammalian lineages.
Collapse
Affiliation(s)
- E Pascale
- Laboratory of Biochemical Pharmacology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institute of Health, Bethesda, MD 20892
| | | | | |
Collapse
|
13
|
Belmaaza A, Wallenburg JC, Brouillette S, Gusew N, Chartrand P. Genetic exchange between endogenous and exogenous LINE-1 repetitive elements in mouse cells. Nucleic Acids Res 1990; 18:6385-91. [PMID: 1978749 PMCID: PMC332516 DOI: 10.1093/nar/18.21.6385] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The repetitive LINE (L1) elements of the mouse, which are present at about 10(5) copies per genome and share over 80% of sequence homology, were examined for their ability to undergo genetic exchange with exogenous L1 sequences. The exogenous L1 sequences, carried by a shuttle vector, consisted of an internal fragment from L1Md-A2, a previously described member of the L1 family of the mouse. Using an assay that does not require the reconstitution of a selectable marker we found that this vector, in either circular or linear form, acquired DNA sequences from endogenous L1 elements at a frequency of 10(-3) to 10(-4) per rescued vector. Physical analysis of the acquired L1 sequences revealed that distinct endogenous L1 elements acted as donors and that different subfamilies participated. These results demonstrate that L1 elements are readily capable of genetic exchange. Apart from gene conversion events, the acquisition of L1 sequences outside the region of homology suggested that a second mechanism was also involved in the genetic exchange. A model which accounts for this mechanism is presented and its potential implication on the rearrangement of L1 elements is discussed.
Collapse
Affiliation(s)
- A Belmaaza
- Canadian Red Cross Society, Research and Development, Montreal Centre, Quebec
| | | | | | | | | |
Collapse
|
14
|
Vaury C, Abad P, Pelisson A, Lenoir A, Bucheton A. Molecular characteristics of the heterochromatic I elements from a reactive strain of Drosophila melanogaster. J Mol Evol 1990; 31:424-31. [PMID: 2175814 DOI: 10.1007/bf02106056] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
There are two categories of strains in Drosophila melanogaster with respect to the I-R system of hybrid dysgenesis. The inducer strains contain particular transposable elements named I factors. They are not present in the strains of the other category called reactive (R) strains. Defective I elements are present in the pericentromeric regions of both categories of strains. This last subfamily of I sequences has not yet been described in detail and little is known about its origin. In this paper, we report that the defective I elements display an average of 94% of sequence identity with each other and with the transposable I factor. The results suggest that they cannot be the progenitors of the present day I factors, but that each of these two subfamilies started to evolve independently several million years ago. Furthermore, the sequence comparison of these I elements with an active I factor from Drosophila teissieri provides useful information about when the deleted I elements became immobilized.
Collapse
Affiliation(s)
- C Vaury
- Laboratoire de Génétique, Université Blaise Pascal, Aubière, France
| | | | | | | | | |
Collapse
|
15
|
Yokota H, Oishi M. Differential cloning of genomic DNA: cloning of DNA with an altered primary structure by in-gel competitive reassociation. Proc Natl Acad Sci U S A 1990; 87:6398-402. [PMID: 2385598 PMCID: PMC54541 DOI: 10.1073/pnas.87.16.6398] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
A procedure was developed for cloning (anonymous) DNA sequences whose primary structures differ between two DNA samples. The procedure is based upon in-gel competitive DNA reassociation after electrophoresis of a mixture of restriction enzyme-digested target DNA (from which clones are to be isolated) and a large excess of unclonable reference DNA (competitor DNA). Inclusion of polyethylene glycol in the reassociation buffer greatly improved the in-gel reassociation efficiency, which was critical for the practical use of the procedure. Using this technique, we obtained several clones from rat brain (target) DNA, which may have been derived from tissue (brain)-specific altered DNA structures. The details of this procedure and its possible applications are discussed.
Collapse
Affiliation(s)
- H Yokota
- Institute of Applied Microbiology, University of Tokyo, Japan
| | | |
Collapse
|
16
|
Hohjoh H, Minakami R, Sakaki Y. Selective cloning and sequence analysis of the human L1 (LINE-1) sequences which transposed in the relatively recent past. Nucleic Acids Res 1990; 18:4099-104. [PMID: 2165587 PMCID: PMC331165 DOI: 10.1093/nar/18.14.4099] [Citation(s) in RCA: 29] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
L1 (LINE-1), a long interspersed repetitive DNA family of mammalian genomes, is thought to be a sequence family derived from a retrotransposon-like element(s), but its actively transposable unit(s) has not been identified yet. We developed a novel method for selective isolation of the human L1 sequences which transposed in a relatively recent past and may have still retained a feature of the 'active L1' unit. From the inspection of the nucleotide sequences, we conjectured that the 'active L1' or 'nearly active L1' units should have a high content of the CpG dinucleotide sequence, a mutation hot spot sequence, and contain several sites for rare cutters such as BssH II and Nar I at their 5' terminal regions. Using these rare cutter sites as selection markers, the L1 sequences were isolated, which had the high content of CpG at the 5' terminal regions and over 90% homology to L1 transcripts found in a human teratocarcinoma cell line. These L1s were shown to be 'relatively new L1' units which had integrated into chromosomes within these several million years during evolution. From the sequence data of these L1s and L1 cDNA, a consensus sequence of the 5' terminal region of high CpG L1s were constructed. A region of the consensus sequence showed about 69% homology to the 5' terminal region of Drosophila jockey element.
Collapse
Affiliation(s)
- H Hohjoh
- Research Laboratory for Genetic Information, Kyushu University, Fukuoka, Japan
| | | | | |
Collapse
|
17
|
Sibbald PR, Blencowe BJ. A completely conserved rat U6 snRNA pseudogene coding sequence is sandwiched between a cytochrome c retropseudogene and a LINE-like sequence. Nucleic Acids Res 1990; 18:1063. [PMID: 2156224 PMCID: PMC330380 DOI: 10.1093/nar/18.4.1063] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Affiliation(s)
- P R Sibbald
- European Molecular Biology Laboratory, Heidelberg, FRG
| | | |
Collapse
|
18
|
Yokota H, Iwasaki T, Takahashi M, Oishi M. A tissue-specific change in repetitive DNA in rats. Proc Natl Acad Sci U S A 1989; 86:9233-7. [PMID: 2594763 PMCID: PMC298468 DOI: 10.1073/pnas.86.23.9233] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
From a genomic library constructed from an EcoRI digest of Wistar rat brain DNA, we isolated a clone (BL-1) that gave a 0.6-kilobase restriction fragment only in brain (or lens) DNA upon Southern hybridization. The tissue-specific fragment was present in Wistar and other strains (Sprague-Dawley and Donryu) of rats regardless of their sex and age. Sequencing of the clone indicated that it is closely related to a part of the LINE 3 sequence, one of the highly repetitive sequences present throughout mammalian genomes. Polymerase chain reaction using primer sequences in the BL-1 clone indicated that it is derived from an amplified (rear-ranged) sequence, although other explanations are possible. These results suggest that there are tissue-specific changes in DNA primary structure during mammalian developmental processes.
Collapse
Affiliation(s)
- H Yokota
- Furusawa Morpho Gene Project, Research Development Corporation of Japan, Ibaragi
| | | | | | | |
Collapse
|
19
|
Abad P, Vaury C, Pélisson A, Chaboissier MC, Busseau I, Bucheton A. A long interspersed repetitive element--the I factor of Drosophila teissieri--is able to transpose in different Drosophila species. Proc Natl Acad Sci U S A 1989; 86:8887-91. [PMID: 2554335 PMCID: PMC298395 DOI: 10.1073/pnas.86.22.8887] [Citation(s) in RCA: 51] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Long interspersed repetitive elements (LINEs) are transposable elements present in many species. In mammals they are difficult to study because most of them are defective and their transposition frequency is low. The I factor of Drosophila melanogaster is a LINE element that is particularly interesting because its transposition occurs at high frequency during I-R hybrid dysgenesis. This phenomenon occurs when males from the class of inducer strains are crossed with females from the class of reactive strains. Inducer strains contain several complete 5.4-kilobase I factors at various sites on the chromosomal arms. Reactive strains are devoid of complete I factors. Many results indicate that active I factors have invaded the D. melanogaster genome recently. To study the evolutionary history of I elements, we have cloned and sequenced a potentially active I factor from Drosophila teissieri. It is flanked by a target-site duplication and terminates at the 3' end by tandem repeats of the sequence TAA. When introduced into the germ line of a reactive strain of D. melanogaster by P element-mediated transformation, it is able to transpose and induces hybrid dysgenesis. This strengthens the hypothesis of a recent reinvasion of the D. melanogaster genome by active I factors giving rise to the inducer strains. They could have originated by horizontal transfer from another species. Such events also could occur for other LINE elements and might explain the spread of new variants in mammalian genomes. Moreover, the results give a further insight into I factor functional organization.
Collapse
Affiliation(s)
- P Abad
- Station de Recherches de Nématologie et de Génétique Moléculaire des Invertébrés, Institut National de la Recherche Agronomique, Antibes, France
| | | | | | | | | | | |
Collapse
|
20
|
Abstract
Members of two related families of transposable elements, Tx1 and Tx2, were isolated from the genome of Xenopus laevis and characterized. In both families, two versions of the elements were found. The smaller version in each family (Tx1d and Tx2d) consisted largely of two types of 400-base-pair tandem internal repeats. These elements had discrete ends and short inverted terminal repeats characteristic of mobile DNAs that are presumed to move via DNA intermediates, e.g., Drosophila P and maize Ac elements. The longer versions (Tx1c and Tx2c) differed from Tx1d and Tx2d by the presence of a 6.9-kilobase-pair internal segment that included two long open reading frames (ORFs). ORF1 had one cysteine-plus-histidine-rich sequence of the type found in retroviral gag proteins. ORF2 showed more substantial homology to retroviral pol genes and particularly to the analogs of pol found in a subclass of mobile DNAs that are supposed retrotransposons, such as mammalian long interspersed repetitive sequences, Drosophila I factors, silkworm R1 elements, and trypanosome Ingi elements. Thus, the Tx1 elements present a paradox by exhibiting features of two classes of mobile DNAs that are thought to have very different modes of transposition. Two possible resolutions are considered: (i) the composite versions are actually made up of two independent elements, one of the retrotransposon class, which has a high degree of specificity for insertion into a target within the other, P-like element; and (ii) the composite elements are intact, autonomous mobile DNAs, in which the pol-like gene product collaborates with the terminal inverted repeats to cause transposition of the entire unit.
Collapse
|
21
|
Busseau I, Pelisson A, Bucheton A. Characterization of 5' truncated transposed copies of the I factor in Drosophila melanogaster. Nucleic Acids Res 1989; 17:6939-45. [PMID: 2550903 PMCID: PMC318424 DOI: 10.1093/nar/17.17.6939] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
I factors in Drosophila melanogaster are transposable elements structurally related to Mammalian LINEs. Their transposition is activated at high frequencies during I-R hybrid dysgenesis and is associated with the production of mutations of various sorts. Very few of these mutations have been studied at the molecular level; those reported so far result either from chromosomal rearrangements or from insertions of complete I factors. We have analysed three I-R induced yellow mutations and have found that one of them is due to the insertion of an I element very similar to the complete I factor, whereas the other two are due to insertions of I elements that are truncated at their 5' ends; one of them exhibits an unusual 3' end. We discuss possible mechanisms of production of such modified I elements.
Collapse
Affiliation(s)
- I Busseau
- Laboratoire de Génétique, Unité associée au CNRS no. 360, Université Blaise Pascal, Aubiere, France
| | | | | |
Collapse
|
22
|
Demers GW, Matunis MJ, Hardison RC. The L1 family of long interspersed repetitive DNA in rabbits: sequence, copy number, conserved open reading frames, and similarity to keratin. J Mol Evol 1989; 29:3-19. [PMID: 2475641 PMCID: PMC7087506 DOI: 10.1007/bf02106177] [Citation(s) in RCA: 28] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The L1 family of long interspersed repetitive DNA in the rabbit genome (L1Oc) has been studied by determining the sequence of the five L1 repeats in the rabbit beta-like globin gene cluster and by hybridization analysis of other L1 repeats in the genome. L1Oc repeats have a common 3' end that terminates in a poly A addition signal and an A-rich tract, but individual repeats have different 5' ends, indicating a polar truncation from the 5' end during their synthesis or propagation. As a result of the polar truncations, the 5' end of L1Oc is present in about 11,000 copies per haploid genome, whereas the 3' end is present in at least 66,000 copies per haploid genome. One type of L1Oc repeat has internal direct repeats of 78 bp in the 3' untranslated region, whereas other L1Oc repeats have only one copy of this sequence. The longest repeat sequenced, L1Oc5, is 6.5 kb long, and genomic blot-hybridization data using probes from the 5' end of L1Oc5 indicate that a full length L1Oc repeat is about 7.5 kb long, extending about 1 kb 5' to the sequenced region. The L1Oc5 sequence has long open reading frames (ORFs) that correspond to ORF-1 and ORF-2 described in the mouse L1 sequence. In contrast to the overlapping reading frames seen for mouse L1, ORF-1 and ORF-2 are in the same reading frame in rabbit and human L1s, resulting in a discistronic structure. The region between the likely stop codon for ORF-1 and the proposed start codon for ORF-2 is not conserved in interspecies comparisons, which is further evidence that this short region does not encode part of a protein. ORF-1 appears to be a hybrid of sequences, of which the 3' half is unique to and conserved in mammalian L1 repeats. The 5' half of ORF-1 is not conserved between mammalian L1 repeats, but this segment of L1Oc is related significantly to type II cytoskeletal keratin.
Collapse
Affiliation(s)
- G W Demers
- Department of Molecular and Cell Biology, The Pennsylvania State University, University Park 16802
| | | | | |
Collapse
|
23
|
Nur I, Pascale E, Furano AV. The left end of rat L1 (L1Rn, long interspersed repeated) DNA which is a CpG island can function as a promoter. Nucleic Acids Res 1988; 16:9233-51. [PMID: 2459662 PMCID: PMC338703 DOI: 10.1093/nar/16.19.9233] [Citation(s) in RCA: 60] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Here we report that the 600 bp promoter-like region at the left end of a newly isolated and characterized rat L1 DNA element can activate the prokaryotic chloramphenicol acyltransferase gene in a rat cell line. Activation only occurs when the promoter region is oriented to the transferase gene as it is to the L1 protein encoding sequences and is 75% inhibited by methylation of just 5 of the 22 CpGs present in the promoter. The G + C rich promoter contains enough CpGs to qualify it as a CpG island, but in contrast to other CpG islands, genomic L1 promoters are fully methylated in both somatic cell and sperm DNA as judged by restriction enzyme analysis. Partial demethylation of the genomic promoters by treatment with 5-azacytidine failed to produce discrete L1 transcripts. The relationship of methylation to the evolutionary history and fate of the rat L1 promoter is discussed.
Collapse
Affiliation(s)
- I Nur
- Section on Genomic Structure and Function, National Institute of Diabetes, and Digestive and Kidney Diseases, Bethesda, MD 20892
| | | | | |
Collapse
|
24
|
Crozatier M, Vaury C, Busseau I, Pelisson A, Bucheton A. Structure and genomic organization of I elements involved in I-R hybrid dysgenesis in Drosophila melanogaster. Nucleic Acids Res 1988; 16:9199-213. [PMID: 2845368 PMCID: PMC338701 DOI: 10.1093/nar/16.19.9199] [Citation(s) in RCA: 53] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
I-R hybrid dysgenesis in D. melanogaster is controlled by transposable elements known as I factors which terminate at their 3' ends by an A-rich sequence. Inducer strains contain active I factors. Both reactive and inducer stocks possess defective I elements. We have cloned various I elements from both categories of strains. The I elements having recently transposed in inducer strains have a structure closely related to that of active I factors. However we have isolated one such I element that is truncated at its 5' end. The I elements common to reactive and inducer strains are affected by various rearrangements and many point mutations. They do not appear to be simple derivatives of complete I factors.
Collapse
Affiliation(s)
- M Crozatier
- Unité Associée au CNRS no. 360, Université Blaise Pascal, Aubière, France
| | | | | | | | | |
Collapse
|
25
|
Furano AV, Robb SM, Robb FT. The structure of the regulatory region of the rat L1 (L1Rn, long interspersed repeated) DNA family of transposable elements. Nucleic Acids Res 1988; 16:9215-31. [PMID: 2845369 PMCID: PMC338702 DOI: 10.1093/nar/16.19.9215] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Here we report the DNA structure of the left 1.5 kb of two newly isolated full length members of the rat L1 DNA family (L1Rn, long interspersed repeated DNA). In contrast to earlier isolated rat L1 members, both of these contain promoter-like regions that are most likely full length. In addition, the promoter-like region of both members has undergone a partial tandem duplication. A second internal region of the left end of one of the reported members is also tandemly duplicated. The propensity of the left end of rat L1 elements to undergo this form of genetic rearrangement, as well as other structural features revealed by the present work, is discussed in light of the fact that during evolution the otherwise conserved mammalian L1 DNA families have each acquired completely different promoter-like regions. In an accompanying paper [Nur, I., Pascale, E., and Furano, A. V. (1988) Nucleic Acids Res. 16, submitted], we report that one of the rat promoter-like regions can function as a promoter in rat cells when fused to the Escherichia coli chloramphenicol acyltransferase gene.
Collapse
Affiliation(s)
- A V Furano
- Section on Genomic Structure and Function, National Institute of Diabetes, and Digestive and Kidney Diseases, Bethesda, MD 20892
| | | | | |
Collapse
|
26
|
Abstract
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons.
Collapse
|
27
|
Lloyd JA, Potter SS. Distinct subfamilies of primate L1Gg retroposons, with some elements carrying tandem repeats in the 5' region. Nucleic Acids Res 1988; 16:6147-56. [PMID: 3399387 PMCID: PMC336853 DOI: 10.1093/nar/16.13.6147] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Two subfamilies of L1 elements, differing dramatically in the first 1.2 kb of sequence at their 5' ends, were identified in the prosimian primate, Galago garnetti. Interesting patterns of sequence similarity were observed between the galago subfamilies, and with the L1s from human and from another prosimian, the slow loris. Furthermore, members of one of the subfamilies have six to eight tandemly repeated units of 73 bp, starting about 730 bp from their 5' ends. Such tandem repeats have not been reported in other primate L1s, but a striking sequence similarity was found between the galago tandem repeats and those previously described at the 5' termini of some mouse L1s [Loeb, D. D. et al. Mol. Cell. Biol. 6, 168-182, 1986]. Although the similar sequence indicates a shared, conserved function, the galago repeats are sub-terminal and therefore cannot serve as portable RNA polymerase II promoters, as has been suggested for the mouse tandem repeats.
Collapse
Affiliation(s)
- J A Lloyd
- Children's Hospital Research Foundation, Cincinnati, OH 45229
| | | |
Collapse
|
28
|
Usdin K, Furano AV. Rat L (long interspersed repeated DNA) elements contain guanine-rich homopurine sequences that induce unpairing of contiguous duplex DNA. Proc Natl Acad Sci U S A 1988; 85:4416-20. [PMID: 2837766 PMCID: PMC280440 DOI: 10.1073/pnas.85.12.4416] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The L family (long interspersed repeated DNA) of mobile genetic elements is a persistent feature of the mammalian genome. In rats, this family contains approximately equal to 40,000 members and accounts for approximately equal to 10% of the haploid genome. We demonstrate here that the guanine-rich homopurine stretches located at the right end of L-DNA induce oligonucleotide uptake by contiguous duplex DNA. The uptake is dependent on negative supercoiling and the length of the homopurine stretch and occurs even when the L-DNA homopurine stretches are introduced into a different DNA environment. The bound oligomer primes DNA synthesis when DNA polymerase and deoxyribonucleoside triphosphates are added, resulting in a faithful copy of the template to which the oligonucleotide had bound. The implications of this property of the L-DNA guanine-rich homopurine stretches in the amplification, recombination, and dispersal of L elements is discussed.
Collapse
Affiliation(s)
- K Usdin
- Section on Genomic Structure and Function, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892
| | | |
Collapse
|
29
|
Skowronski J, Fanning TG, Singer MF. Unit-length line-1 transcripts in human teratocarcinoma cells. Mol Cell Biol 1988; 8:1385-97. [PMID: 2454389 PMCID: PMC363295 DOI: 10.1128/mcb.8.4.1385-1397.1988] [Citation(s) in RCA: 134] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons.
Collapse
Affiliation(s)
- J Skowronski
- Laboratory of Biochemistry, National Cancer Institute, Bethesda, Maryland 20892
| | | | | |
Collapse
|
30
|
6;7 chromosomal translocation in spontaneously arising rat immunocytomas: evidence for c-myc breakpoint clustering and correlation between isotypic expression and the c-myc target. Mol Cell Biol 1988. [PMID: 3122026 DOI: 10.1128/mcb.8.1.441] [Citation(s) in RCA: 29] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Our previous studies have shown that spontaneously arising immunocytomas in the LOU/Ws1 strain of rats contain a t(6;7) chromosomal translocation in all seven tumors studied (F. M. Babonits, J. Spira, G. Klein, and H. Bazin, Int. J. Cancer 29:431-437, 1982). We have also shown that the c-myc is located on chromosome 7 (J. Sümegi, J. Spira, H. Bazin, J. Szpirer, G. Levan, and G. Klein, Nature (London) 306:497-499, 1983) and the immunoglobulin H cluster on chromosome 6 (W.S. Pear, G. Wahlström, J. Szpirer, G. Levan, G. Klein, and J. Sümegi, Immunogenetics 23:393-395, 1986). We now report a detailed cytogenetic and molecular analysis of nine additional rat immunocytomas. The t(6;7) chromosomal translocation is found in all tumors. Mapping of the c-myc breakpoints showed that in 10 of 14 tumors, the c-myc breakpoints are clustered in a 1.5-kilobase region upstream of exon 1. In contrast with sporadic Burkitt's lymphoma and mouse plasmacytoma, only 1 of 14 tumors contains the c-myc breakpoints in either exon 1 or intron 1. Analysis of the sequences juxtaposed to the c-myc show that immunoglobulin H switch regions are the targets in at least five tumors and that there is a strong correlation between the secreted immunoglobulin and the c-myc target. Unlike sporadic Burkitt's lymphoma and mouse plasmacytoma, at least two rat immunocytomas show recombination of the c-myc with sequences distinct from immunoglobulin switch regions.
Collapse
|
31
|
Pear WS, Wahlström G, Nelson SF, Axelson H, Szeles A, Wiener F, Bazin H, Klein G, Sümegi J. 6;7 chromosomal translocation in spontaneously arising rat immunocytomas: evidence for c-myc breakpoint clustering and correlation between isotypic expression and the c-myc target. Mol Cell Biol 1988; 8:441-51. [PMID: 3122026 PMCID: PMC363146 DOI: 10.1128/mcb.8.1.441-451.1988] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Our previous studies have shown that spontaneously arising immunocytomas in the LOU/Ws1 strain of rats contain a t(6;7) chromosomal translocation in all seven tumors studied (F. M. Babonits, J. Spira, G. Klein, and H. Bazin, Int. J. Cancer 29:431-437, 1982). We have also shown that the c-myc is located on chromosome 7 (J. Sümegi, J. Spira, H. Bazin, J. Szpirer, G. Levan, and G. Klein, Nature (London) 306:497-499, 1983) and the immunoglobulin H cluster on chromosome 6 (W.S. Pear, G. Wahlström, J. Szpirer, G. Levan, G. Klein, and J. Sümegi, Immunogenetics 23:393-395, 1986). We now report a detailed cytogenetic and molecular analysis of nine additional rat immunocytomas. The t(6;7) chromosomal translocation is found in all tumors. Mapping of the c-myc breakpoints showed that in 10 of 14 tumors, the c-myc breakpoints are clustered in a 1.5-kilobase region upstream of exon 1. In contrast with sporadic Burkitt's lymphoma and mouse plasmacytoma, only 1 of 14 tumors contains the c-myc breakpoints in either exon 1 or intron 1. Analysis of the sequences juxtaposed to the c-myc show that immunoglobulin H switch regions are the targets in at least five tumors and that there is a strong correlation between the secreted immunoglobulin and the c-myc target. Unlike sporadic Burkitt's lymphoma and mouse plasmacytoma, at least two rat immunocytomas show recombination of the c-myc with sequences distinct from immunoglobulin switch regions.
Collapse
Affiliation(s)
- W S Pear
- Department of Tumor Biology, Karolinska Institute, Stockholm, Sweden
| | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Wincker P, Jubier-Maurin V, Roizès G. Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies. Nucleic Acids Res 1987; 15:8593-606. [PMID: 3684566 PMCID: PMC306393 DOI: 10.1093/nar/15.21.8593] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies.
Collapse
Affiliation(s)
- P Wincker
- CRBM-L.P 8402 CNRS, U.249 INSERM, Institut de Biologie, Montpellier, France
| | | | | |
Collapse
|
33
|
Wallenburg JC, Nepveu A, Chartrand P. Integration of a vector containing rodent repetitive elements in the rat genome. Nucleic Acids Res 1987; 15:7849-63. [PMID: 2823220 PMCID: PMC306312 DOI: 10.1093/nar/15.19.7849] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
We have previously shown that integration of a polyoma vector containing rodent repetitive elements into rat cellular DNA is non-random (Wallenburg et al. J. Virol. 50: 678-683). Junctions between the polyoma vector and the host DNA occur in the repetitive sequences of the vector about ten times more frequently than would be expected if sequences from the vector were used randomly for integration. In this paper we looked at the host sequences involved in these junctions. Our analysis did not reveal any repetitive or specific sequences and we presume therefore that the repetitive sequences of the vector acted as hot spots for illegitimate recombination. We also analysed the integration mechanism and found that: First, even though the polyoma vector was transfected in the presence of carrier DNA, integration did not involve the formation of a transgenome. Second, in at least one of the clones analysed, integration resulted in deletion of host DNA sequences. Third, the host DNA displaced at the integration site was considerably longer than the integrated segment.
Collapse
Affiliation(s)
- J C Wallenburg
- Département de microbiologie, Faculté de médecine, Centre Hospitalier Universitaire, Sherbrooke, Québec, Canada
| | | | | |
Collapse
|
34
|
Jubier-Maurin V, Wincker P, Cuny G, Roizès G. The relationships between the 5' end repeats and the largest members of the L1 interspersed repeated family in the mouse genome. Nucleic Acids Res 1987; 15:7395-410. [PMID: 3658697 PMCID: PMC306256 DOI: 10.1093/nar/15.18.7395] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Analysis of a few large L1 elements has revealed two types of tandem repeats at the 5' end: A and F. In this study, the relationships between these repeats and a series of large L1 elements has been analysed. Most of cloned L1 repeats were shown to lack either A or F sequences at their 5' ends. F sequences are found less frequently associated than A sequences to the 5' ends of L1 and an evolutionary comparison shows that the A type was introduced more recently during the evolution of the mouse genome than the F type.
Collapse
Affiliation(s)
- V Jubier-Maurin
- CNRS (CRBM-LP 8402), INSERM U.249, Institut de Biologie, Montpellier, France
| | | | | | | |
Collapse
|
35
|
d'Ambrosio E, Furano AV. DNA synthesis arrest sites at the right terminus of rat long interspersed repeated (LINE or L1Rn) DNA family members. Nucleic Acids Res 1987; 15:3155-75. [PMID: 2436148 PMCID: PMC340917 DOI: 10.1093/nar/15.7.3155] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
An approximately equal to 150-bp GC-rich (approximately equal to 60%) region is at the right end of rat long interspersed repeated DNA (LINE or L1Rn) family members. We report here that one of the DNA strands from this region contains several non-palindromic sites that strongly arrest DNA synthesis in vitro by the prokaryotic Klenow and T4 DNA polymerases, the eukaryotic alpha polymerase, and AMV reverse transcriptase. The strongest arrest sites are G-rich (approximately equal to 70%) homopurine stretches of 18 or more residues. Shorter homopurine stretches (12 residues or fewer) did not arrest DNA synthesis even if the stretch contains 11/12 G residues. Arrest of the prokaryotic polymerases was not affected by their respective single strand binding proteins or polymerase accessory proteins. The region of duplex DNA which contains DNA synthesis arrest sites reacts with bromoacetaldehyde when present in negatively supercoiled molecules. By contrast, homopurine stretches that do not arrest DNA synthesis do not react with bromoacetaldehyde. The presence of bromoacetaldehyde-reactive bases in a G-rich homopurine-containing duplex under torsional stress is thought to be caused by base stacking in the homopurine strand. Therefore, we suggest that base-stacked regions of the template arrest DNA synthesis.
Collapse
|
36
|
Furano AV, Somerville CC, Tsichlis PN, D'Ambrosio E. Target sites for the transposition of rat long interspersed repeated DNA elements (LINEs) are not random. Nucleic Acids Res 1986; 14:3717-27. [PMID: 3012480 PMCID: PMC339810 DOI: 10.1093/nar/14.9.3717] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The long interspersed repeated DNA family of rats (LINE or L1Rn family) contains about 40,000 6.7-kilobase (kb) long members (1). LINE members may be currently mobile since their presence or absence causes allelic variation at three single copy loci (2, 3): insulin 1, Moloney leukemia virus integration 2 (Mlvi-2) (4), and immunoglobulin heavy chain (Igh). To characterize target sites for LINE insertion, we compared the DNA sequences of the unoccupied Mlvi-2 target site, its LINE-containing allele, and several other LINE-containing sites. Although not homologous overall, the target sites share three characteristics: First, depending on the site, they are from 68% to 86% (A+T) compared to 58% (A+T) for total rat DNA (5). Depending on the site, a 7- to 15-bp target site sequence becomes duplicated and flanks the inserted LINE member. The second is a version (0 or 1 mismatch) of the hexanucleotide, TACTCA, which is also present in the LINE member, in a highly conserved region located just before the A-rich right end of the LINE member. The third is a stretch of alternating purine/pyrimidine (PQ). The A-rich right ends of different LINE members vary in length and composition, and the sequence of a particularly long one suggests that it contains the A-rich target site from a previous transposition.
Collapse
|