151
|
Alisch RS, Garcia-Perez JL, Muotri AR, Gage FH, Moran JV. Unconventional translation of mammalian LINE-1 retrotransposons. Genes Dev 2006; 20:210-24. [PMID: 16418485 PMCID: PMC1356112 DOI: 10.1101/gad.1380406] [Citation(s) in RCA: 135] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Long Interspersed Element-1 (LINE-1 or L1) retrotransposons encode proteins required for their mobility (ORF1p and ORF2p), yet little is known about how L1 mRNA is translated. Here, we show that ORF2 translation generally initiates from the first in-frame methionine codon of ORF2, and that both ORF1 and the inter-ORF spacer are dispensable for ORF2 translation. Remarkably, changing the ORF2 AUG codon to any other coding triplet is compatible with retrotransposition. However, introducing a premature termination codon in ORF1 or a thermostable hairpin in the inter-ORF spacer reduces ORF2p translation or L1 retrotransposition to approximately 5% of wild-type levels. Similar data obtained from "natural" and codon optimized "synthetic" mouse L1s lead us to propose that ORF2 is translated by an unconventional termination/reinitiation mechanism.
Collapse
Affiliation(s)
- Reid S Alisch
- Department of Human Genetics and Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, Michigan 48109-0618, USA
| | | | | | | | | |
Collapse
|
152
|
Piskareva O, Schmatchenko V. DNA polymerization by the reverse transcriptase of the human L1 retrotransposon on its own template in vitro. FEBS Lett 2006; 580:661-8. [PMID: 16412437 DOI: 10.1016/j.febslet.2005.12.077] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2005] [Revised: 12/02/2005] [Accepted: 12/27/2005] [Indexed: 10/25/2022]
Abstract
L1 elements (LINE-1s) account for 17% of the human genome and have achieved this abundance by transpositions via an RNA intermediate, or retrotransposition. Reverse transcription is a crucial event in the retrotransposition of the active human L1 element and is carried out by the L1-encoded ORF2 protein. Previously, we performed biochemical characterization of the human L1 ORF2 protein with reverse transcriptase (RT) activity (referred to as L1 RT), expressed in baculovirus-infected insect cells. In the present study, we describe the properties of DNA- and RNA-dependent DNA synthesis catalyzed by the L1 RT on the L1 templates in vitro. We found that L1 RT synthesized at least 620 of nucleotides per template binding event utilizing L1 RNA in vitro. Under processive conditions the L1 RT synthesized cDNA over 5 times longer than that Moloney murine leukemia virus RT on the heteropolymeric RNA template used in these studies. These data are the first to demonstrate that RT from the human L1 element is a highly processive polymerase among RT enzymes. This report also presents a strong evidence of lack of RNase H activity for the L1 ORF2 protein in vitro, distinguishing L1 RT from retroviral RTs. Finally, we found strong pausing for of the L1 RT during DNA polymerization within the 3' untranslated region of L1 mRNA, that is result of contribution both rGs runs of the polypurine stretch and immediately adjacent stem-loop structure. A mechanism facilitating minus-strand DNA synthesis during reverse transcription of L1 element in vivo is discussed.
Collapse
Affiliation(s)
- Olga Piskareva
- Institute of Biochemistry and Physiology of Microorganisms RAS Pushchino, Prosoekt Nauki 5, 142290 Pushchino, Moscow region, Russia
| | | |
Collapse
|
153
|
Babushok DV, Ostertag EM, Courtney CE, Choi JM, Kazazian HH. L1 integration in a transgenic mouse model. Genome Res 2005; 16:240-50. [PMID: 16365384 PMCID: PMC1361720 DOI: 10.1101/gr.4571606] [Citation(s) in RCA: 90] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
To study integration of the human LINE-1 retrotransposon (L1) in vivo, we developed a transgenic mouse model of L1 retrotransposition that displays de novo somatic L1 insertions at a high frequency, occasionally several insertions per mouse. We mapped 3' integration sites of 51 insertions by Thermal Asymmetric Interlaced PCR (TAIL-PCR). Analysis of integration locations revealed a broad genomic distribution with a modest preference for intergenic regions. We characterized the complete structures of 33 de novo retrotransposition events. Our results highlight the large number of highly truncated L1s, as over 52% (27/51) of total integrants were <1/3 the length of a full-length element. New integrants carry all structural characteristics typical of genomic L1s, including a number with inversions, deletions, and 5'-end microhomologies to the target DNA sequence. Notably, at least 13% (7/51) of all insertions contain a short stretch of extra nucleotides at their 5' end, which we postulate result from template-jumping by the L1-encoded reverse transcriptase. We propose a unified model of L1 integration that explains all of the characteristic features of L1 retrotransposition, such as 5' truncations, inversions, extra nucleotide additions, and 5' boundary and inversion point microhomologies.
Collapse
Affiliation(s)
- Daria V Babushok
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | | | | | | |
Collapse
|
154
|
Khan H, Smit A, Boissinot S. Molecular evolution and tempo of amplification of human LINE-1 retrotransposons since the origin of primates. Genome Res 2005; 16:78-87. [PMID: 16344559 PMCID: PMC1356131 DOI: 10.1101/gr.4001406] [Citation(s) in RCA: 253] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
We investigated the evolution of the families of LINE-1 (L1) retrotransposons that have amplified in the human lineage since the origin of primates. We identified two phases in the evolution of L1. From approximately 70 million years ago (Mya) until approximately 40 Mya, three distinct L1 lineages were simultaneously active in the genome of ancestral primates. In contrast, during the last 40 million years (Myr), i.e., during the evolution of anthropoid primates, a single lineage of families has evolved and amplified. We found that novel (i.e., unrelated) regulatory regions (5'UTR) have been frequently recruited during the evolution of L1, whereas the two open-reading frames (ORF1 and ORF2) have remained relatively conserved. We found that L1 families coexisted and formed independently evolving L1 lineages only when they had different 5'UTRs. We propose that L1 families with different 5'UTR can coexist because they don't rely on the same host-encoded factors for their transcription and therefore do not compete with each other. The most prolific L1 families (families L1PA8 to L1PA3) amplified between 40 and 12 Mya. This period of high activity corresponds to an episode of adaptive evolution in a segment of ORF1. The correlation between the high activity of L1 families and adaptive evolution could result from the coevolution of L1 and a host-encoded repressor of L1 activity.
Collapse
Affiliation(s)
- Hameed Khan
- Department of Biology, Queens College, the City University of New York, Flushing, New York 11367, USA
| | | | | |
Collapse
|
155
|
Buzdin A, Vinogradova T, Lebedev Y, Sverdlov E. Genome-wide experimental identification and functional analysis of human specific retroelements. Cytogenet Genome Res 2005; 110:468-74. [PMID: 16093700 DOI: 10.1159/000084980] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2003] [Accepted: 12/18/2003] [Indexed: 12/24/2022] Open
Abstract
Retroelements (REs) actively reshape genomes through genomic rearrangements, creation of new genes and modulation of the regulatory machinery of existing genes, thus introducing genomic novelties which potentially may be subject to natural selection. Thousands of RE integrations, presumably distinguishing the human and chimpanzee genomes, might well be involved in modern human speciation. In this self-review we describe our recent results on genome-wide identification of human specific RE integrations and their transcriptional activity obtained with three new experimental techniques (TGDA, DiffIR and SDDIR) developed by us for such studies. A new mechanism of the formation of retroelements involving template switches during L1-mediated mRNA reverse transcription, revealed in this research, will also be described in the review.
Collapse
Affiliation(s)
- A Buzdin
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
| | | | | | | |
Collapse
|
156
|
Boissinot S, Furano AV. The recent evolution of human L1 retrotransposons. Cytogenet Genome Res 2005; 110:402-6. [PMID: 16093692 DOI: 10.1159/000084972] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2003] [Accepted: 11/07/2003] [Indexed: 11/19/2022] Open
Abstract
L1 elements are the most successful retrotransposons in mammals and are responsible for at least 30% of human DNA. Far from being indolent genomic parasites, L1 elements have evolved and amplified rapidly during human evolution. Indeed during just the last 25 million years (MY) five distinct L1 families have emerged and generated tens of thousands of copies. The most recently evolved human specific L1 family is currently active and L1 copies have been accumulating in the human genome at about the same rate per generation as the currently active L1 families in Old World rats and mice. At times during the last 25 MY L1 activity constituted a significant enough genetic load to be subject to negative selection. During these same times, and in apparent response to the host, L1 underwent adaptive evolution. Understanding the molecular basis for these evolutionary changes should help illuminate one of the least understood but most important aspects of L1 biology, namely the extent and nature of the interaction between L1 and its host.
Collapse
Affiliation(s)
- S Boissinot
- Department of Biology, Queens College, CUNY, Flushing, NY 11367-1597, USA.
| | | |
Collapse
|
157
|
Martin SL, Li WLP, Furano AV, Boissinot S. The structures of mouse and human L1 elements reflect their insertion mechanism. Cytogenet Genome Res 2005; 110:223-8. [PMID: 16093676 DOI: 10.1159/000084956] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2003] [Accepted: 02/20/2004] [Indexed: 11/19/2022] Open
Abstract
L1 is an abundant, interspersed repeated DNA element of mammalian genomes. It has achieved its high copy number via retrotransposition. Like other non-LTR retrotransposons, L1 insertion into chromosomal DNA apparently occurs by target-site primed reverse transcription, or TPRT. L1 retrotransposition often generates elements with 5' truncations that are flanked by a duplication of the genomic target site (TSD). It is typically assumed that the 5' truncated elements are the consequence of poor processivity of the L1 reverse transcriptase. However, we find that the majority of young L1 elements from both the human and mouse genomes are truncated at sequences that can basepair with the target site. Thus, to whatever extent truncation is a consequence of poor processivity, we suggest that truncation is likely to occur when target site sequence can basepair with L1 sequence. This finding supports a model for insertion that occurs by two sequential TPRT reactions, the second of which relies upon the homology between the target site and L1. Because perfect heteroduplex formation is not required for all insertions, a dynamic relationship between the primer, template and enzyme during reverse transcription is inferred. 5' truncation may be a successful evolutionary strategy that is exploited by L1 as a means to escape host suppression of transposition.
Collapse
Affiliation(s)
- S L Martin
- Department of Cell and Developmental Biology, University of Colorado School of Medicine, Denver, CO 80045, USA.
| | | | | | | |
Collapse
|
158
|
Belshaw R, Dawson ALA, Woolven-Allen J, Redding J, Burt A, Tristem M. Genomewide screening reveals high levels of insertional polymorphism in the human endogenous retrovirus family HERV-K(HML2): implications for present-day activity. J Virol 2005; 79:12507-14. [PMID: 16160178 PMCID: PMC1211540 DOI: 10.1128/jvi.79.19.12507-12514.2005] [Citation(s) in RCA: 173] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2005] [Accepted: 07/01/2005] [Indexed: 11/20/2022] Open
Abstract
The published human genome sequence contains many thousands of endogenous retroviruses (HERVs) but all are defective, containing nonsense mutations or major deletions. Only the HERV-K(HML2) family has been active since the divergence of humans and chimpanzees; it contains many members that are human specific, as well as several that are insertionally polymorphic (an inserted element present only in some human individuals). Here we perform a genomewide survey of insertional polymorphism levels in this family by using the published human genome sequence and a diverse sample of 19 humans. We find that there are 113 human-specific HERV-K(HML2) elements in the human genome sequence, 8 of which are insertionally polymorphic (11 if we extrapolate to those within regions of the genome that were not suitable for amplification). The average rate of accumulation since the divergence with chimpanzees is thus approximately 3.8 x 10(-4) per haploid genome per generation. Furthermore, we find that the number of polymorphic elements is not significantly different from that predicted by a standard population genetic model that assumes constant activity of the family until the present. This suggests to us that the HERV-K(HML2) family may be active in present-day humans. Active (replication-competent) elements are likely to have inserted very recently and to be present at low allele frequencies, and they may be causing disease in the individuals carrying them. This view of the family from a population perspective rather than a genome perspective will inform the current debate about a possible role of HERV-K(HML2) in human disease.
Collapse
Affiliation(s)
- Robert Belshaw
- Department of Biological Sciences, Imperial College, Ascot, Berks, United Kingdom.
| | | | | | | | | | | |
Collapse
|
159
|
Gilbert N, Lutz S, Morrish TA, Moran JV. Multiple fates of L1 retrotransposition intermediates in cultured human cells. Mol Cell Biol 2005; 25:7780-95. [PMID: 16107723 PMCID: PMC1190285 DOI: 10.1128/mcb.25.17.7780-7795.2005] [Citation(s) in RCA: 231] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
LINE-1 (L1) retrotransposons comprise approximately 17% of human DNA, yet little is known about L1 integration. Here, we characterized 100 retrotransposition events in HeLa cells and show that distinct DNA repair pathways can resolve L1 cDNA retrotransposition intermediates. L1 cDNA resolution can lead to various forms of genetic instability including the generation of chimeric L1s, intrachromosomal deletions, intrachromosomal duplications, and intra-L1 rearrangements as well as a possible interchromosomal translocation. The L1 retrotransposition machinery also can mobilize U6 snRNA to new genomic locations, increasing the repertoire of noncoding RNAs that are mobilized by L1s. Finally, we have determined that the L1 reverse transcriptase can faithfully replicate its own transcript and has a base misincorporation error rate of approximately 1/7,000 bases. These data indicate that L1 retrotransposition in transformed human cells can lead to a variety of genomic rearrangements and suggest that host processes act to restrict L1 integration in cultured human cells. Indeed, the initial steps in L1 retrotransposition may define a host/parasite battleground that serves to limit the number of active L1s in the genome.
Collapse
Affiliation(s)
- Nicolas Gilbert
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, 48109-0618, USA
| | | | | | | |
Collapse
|
160
|
Zingler N, Willhoeft U, Brose HP, Schoder V, Jahns T, Hanschmann KMO, Morrish TA, Löwer J, Schumann GG. Analysis of 5' junctions of human LINE-1 and Alu retrotransposons suggests an alternative model for 5'-end attachment requiring microhomology-mediated end-joining. Genome Res 2005; 15:780-9. [PMID: 15930490 PMCID: PMC1142468 DOI: 10.1101/gr.3421505] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Insertion of the human non-LTR retrotransposon LINE-1 (L1) into chromosomal DNA is thought to be initiated by a mechanism called target-primed reverse transcription (TPRT). This mechanism readily accounts for the attachment of the 3'-end of an L1 copy to the genomic target, but the subsequent integration steps leading to the attachment of the 5'-end to the chromosomal DNA are still cause for speculation. By applying bioinformatics to analyze the 5' junctions of recent L1 insertions in the human genome, we provide evidence that L1 uses at least two distinct mechanisms to link the 5'-end of the nascent L1 copy to its genomic target. While 5'-truncated L1 elements show a statistically significant preference for short patches of overlapping nucleotides between their target site and the point of truncation, full-length insertions display no distinct bias for such microhomologies at their 5'-ends. In a second genome-wide approach, we analyzed Alu elements to examine whether these nonautonomous retrotransposons, which are thought to be mobilized through L1 proteins, show similar characteristics. We found that Alu elements appear to be predominantly integrated via a pathway not involving overlapping nucleotides. The results indicate that a cellular nonhomologous DNA end-joining pathway may resolve intermediates from incomplete L1 retrotransposition events and thus lead to 5' truncations.
Collapse
Affiliation(s)
- Nora Zingler
- Fachgebiet Pr2/Retroelemente, Paul-Ehrlich-Institut, D-63225 Langen, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
161
|
Ustyugova SV, Amosova AL, Lebedev YB, Sverdlov ED. Cell line fingerprinting using retroelement insertion polymorphism. Biotechniques 2005; 38:561-5. [PMID: 15884674 DOI: 10.2144/05384st02] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Human cell lines are an indispensable tool for functional studies of living entities in their numerous manifestations starting with integral complex systems such as signal pathways and networks, regulation of gene ensembles, epigenetic factors, and finishing with pathological changes and impact of artificially introduced elements, such as various transgenes, on the behavior of the cell. Therefore, it is highly desirable to have reliable cell line identification techniques to make sure that the cell lines to be used in experiments are exactly what is expected. To this end, we developed a set of informative markers based on insertion polymorphism of human retroelements (REs). The set includes 47 pairs of PCR primers corresponding to introns of the human genes with dimorphic LINE1 (L1) and Alu insertions. Using locus-specific PCR assays, we have genotyped 10 human cell lines of various origins. For each of these cell lines, characteristic fingerprints were obtained. An estimated probability that two different cell lines possess the same marker genotype is about 10-18. Therefore, the proposed set of markers provides a reliable tool for cell line identification.
Collapse
Affiliation(s)
- Svetlana V Ustyugova
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | | | | | | |
Collapse
|
162
|
Wheelan SJ, Aizawa Y, Han JS, Boeke JD. Gene-breaking: a new paradigm for human retrotransposon-mediated gene evolution. Genome Res 2005; 15:1073-8. [PMID: 16024818 PMCID: PMC1182219 DOI: 10.1101/gr.3688905] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The L1 retrotransposon is the most highly successful autonomous retrotransposon in mammals. This prolific genome parasite may on occasion benefit its host through genome rearrangements or adjustments of host gene expression. In examining possible effects of L1 elements on host gene expression, we investigated whether a full-length L1 element inserted in the antisense orientation into an intron of a cellular gene may actually split the gene's transcript into two smaller transcripts: (1) a transcript containing the upstream exons and terminating in the major antisense polyadenylation site (MAPS) of the L1, and (2) a transcript derived from the L1 antisense promoter (ASP) that includes the downstream exons of the gene. Bioinformatic analysis and experimental follow-up provide evidence for this L1 "gene-breaking" hypothesis. We identified three human genes apparently "broken" by L1 elements, as well as 12 more candidate genes. Most of the inserted L1 elements in our 15 candidate genes predate the human/chimp divergence. If indeed split, the transcripts of these genes may in at least one case encode potentially interacting proteins, and in another case may encode novel proteins. Gene-breaking represents a new mechanism through which L1 elements remodel mammalian genomes.
Collapse
Affiliation(s)
- Sarah J Wheelan
- Department of Molecular Biology and Genetics, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | |
Collapse
|
163
|
Chen JM, Stenson PD, Cooper DN, Férec C. A systematic analysis of LINE-1 endonuclease-dependent retrotranspositional events causing human genetic disease. Hum Genet 2005; 117:411-27. [PMID: 15983781 DOI: 10.1007/s00439-005-1321-0] [Citation(s) in RCA: 155] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2005] [Accepted: 04/04/2005] [Indexed: 10/25/2022]
Abstract
Diverse long interspersed element-1 (LINE-1 or L1)-dependent mutational mechanisms have been extensively studied with respect to L1 and Alu elements engineered for retrotransposition in cultured cells and/or in genome-wide analyses. To what extent the in vitro studies can be held to accurately reflect in vivo events in the human genome, however, remains to be clarified. We have attempted to address this question by means of a systematic analysis of recent L1-mediated retrotranspositional events that have caused human genetic disease, with a view to providing a more complete picture of how L1-mediated retrotransposition impacts upon the architecture of the human genome. A total of 48 such mutations were identified, including those described as L1-mediated retrotransposons, as well as insertions reported to contain a poly(A) tail: 26 were L1 trans-driven Alu insertions, 15 were direct L1 insertions, four were L1 trans-driven SVA insertions, and three were associated with simple poly(A) insertions. The systematic study of these lesions, when combined with previous in vitro and genome-wide analyses, has strengthened several important conclusions regarding L1-mediated retrotransposition in humans: (a) approximately 25% of L1 insertions are associated with the 3' transduction of adjacent genomic sequences, (b) approximately 25% of the new L1 inserts are full-length, (c) poly(A) tail length correlates inversely with the age of the element, and (d) the length of target site duplication in vivo is rarely longer than 20 bp. Our analysis also suggests that some 10% of L1-mediated retrotranspositional events are associated with significant genomic deletions in humans. Finally, the identification of independent retrotranspositional events that have integrated at the same genomic locations provides new insight into the L1-mediated insertional process in humans.
Collapse
Affiliation(s)
- Jian-Min Chen
- INSERM U613-Génétique Moléculaire et Génétique Epidémiologique, Etablissement Français du Sang-Bretagne, Université de Bretagne Occidentale, Centre Hospitalier Universitaire, Brest, 29220, France.
| | | | | | | |
Collapse
|
164
|
Bennett EA, Coleman LE, Tsui C, Pittard WS, Devine SE. Natural genetic variation caused by transposable elements in humans. Genetics 2005; 168:933-51. [PMID: 15514065 PMCID: PMC1448813 DOI: 10.1534/genetics.104.031757] [Citation(s) in RCA: 127] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Transposons and transposon-like repetitive elements collectively occupy 44% of the human genome sequence. In an effort to measure the levels of genetic variation that are caused by human transposons, we have developed a new method to broadly detect transposon insertion polymorphisms of all kinds in humans. We began by identifying 606,093 insertion and deletion (indel) polymorphisms in the genomes of diverse humans. We then screened these polymorphisms to detect indels that were caused by de novo transposon insertions. Our method was highly efficient and led to the identification of 605 nonredundant transposon insertion polymorphisms in 36 diverse humans. We estimate that this represents 25-35% of approximately 2075 common transposon polymorphisms in human populations. Because we identified all transposon insertion polymorphisms with a single method, we could evaluate the relative levels of variation that were caused by each transposon class. The average human in our study was estimated to harbor 1283 Alu insertion polymorphisms, 180 L1 polymorphisms, 56 SVA polymorphisms, and 17 polymorphisms related to other forms of mobilized DNA. Overall, our study provides significant steps toward (i) measuring the genetic variation that is caused by transposon insertions in humans and (ii) identifying the transposon copies that produce this variation.
Collapse
Affiliation(s)
- E Andrew Bennett
- Department of Biochemistry, Emory University School of Medicine, Atlanta, Georgia 30322, USA
| | | | | | | | | |
Collapse
|
165
|
Kim DI, Eo HS, Joh JH. Identification of differentially expressed genes in primary varicose veins. J Surg Res 2005; 123:222-6. [PMID: 15680382 DOI: 10.1016/j.jss.2004.08.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2004] [Indexed: 01/14/2023]
Abstract
BACKGROUND A number of changes in protein expression have been described in primary varicose veins, but the altered gene expressions in this disease are unknown. The aim of this study was to identify differentially expressed genes in primary varicose veins. MATERIALS AND METHODS Total RNAs were isolated from two groups of greater saphenous veins (four primary varicose veins and three normal) and then were reverse transcribed into cDNAs. We used the differential display reverse transcription-polymerase chain reaction technique to screen the differences in the mRNA expression profiles of the groups. RESULTS We found that three cDNAs showed differences in expression patterns between normal and diseased saphenous veins. The cDNAs are prominently expressed only in patients with varicose veins. We identified that the cDNAs had significant similarities to the L1M4 repeat sequence of clone RP11-57L9, clone RP11-299H13, and Alu repetitive sequence of human tropomyosin 4 mRNA. CONCLUSIONS Our results suggest that the screened cDNA clones are useful disease markers in the genetic diagnosis of primary varicose vein and that the L1 and Alu elements possibly participated in the development of primary varicose veins through their expression patterns in genes encoded with structural proteins, such as collagen, elastin, and tropomyosin. Further studies are required to elucidate the potential relationship between repeat sequences and primary varicose veins.
Collapse
Affiliation(s)
- Dong-Ik Kim
- Division of Vascular Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea.
| | | | | |
Collapse
|
166
|
Mamedov IZ, Arzumanyan ES, Amosova AL, Lebedev YB, Sverdlov ED. Whole-genome experimental identification of insertion/deletion polymorphisms of interspersed repeats by a new general approach. Nucleic Acids Res 2005; 33:e16. [PMID: 15673711 PMCID: PMC548376 DOI: 10.1093/nar/gni018] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A new experimental technique for genome-wide detection of integration sites of polymorphic retroelements (REs) is described. The technique allows one to reveal the absence of a retroelement in an individual genome provided that this retroelement is present in at least one of several other genomes under comparison. Since quite a number of genomes are compared simultaneously, the search for polymorphic REs insertions is very efficient. The technique includes two whole-genome selective PCR amplifications of sequences flanking REs: one for a particular genome and another one for a mixture of ten different genomes. A subsequent subtractive hybridization of the obtained amplicons with DNA of a particular genome as driver results in isolation of polymorphic insertions. The technique was successfully applied for identification of 41 new polymorphic human AluYa5/Ya8 insertions. Among them, 18 individual Alu elements first sequenced in this work were not found in the available human genome databases. This result suggests that significant part of polymorphic REs were not identified during genome sequencing and remain to be detected and characterized. The proposed method does not depend on preliminary knowledge of evolutionary history of retroelements and can be applied for identification of insertion/deletion polymorphic markers in genomes of different species.
Collapse
Affiliation(s)
- Ilgar Z Mamedov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences 16/10 Miklukho-Maklaya Street, 117997 Moscow, Russia.
| | | | | | | | | |
Collapse
|
167
|
Mateus Pereira LH, Socorro A, Fernandez I, Masleh M, Vidal D, Bianchi NO, Bonatto SL, Salzano FM, Herrera RJ. Phylogenetic information in polymorphic L1 andAlu insertions from East Asians and Native American populations. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2005; 128:171-84. [PMID: 15761878 DOI: 10.1002/ajpa.10441] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
This study attempts to ascertain genetic affinities between Native American and East Asian populations by analyzing four polymorphic Alu insertions (PAIs) and three L1 polymorphic loci. These two genetic systems demonstrated strong congruence when levels of diversity and genetic distances were considered. Overall, genetic relatedness within Native American groups does not correlate with geographical and linguistic structure, although strong grouping for Native Americans with East Asians was demonstrated, with clear discrimination from African and European groups. Most of the variation was assigned to differences occurring within groups, but the interpopulation variation found for South Amerindians was recognizably higher in comparison to the other sampled groups of populations. Our data suggest that bottleneck events followed by strong influence of genetic drift in the process of the peopling of the Americas may have been determinant factors in delineating the genetic background of present-day South Amerindians. Since no clear subgroups were detected within Native Americans and East Asians, there is no indication of multiple waves in the early colonization of the New World.
Collapse
Affiliation(s)
- L H Mateus Pereira
- Department of Biological Sciences, Florida International University, University Park, Miami, Florida 33199, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
168
|
Han JS, Boeke JD. LINE-1 retrotransposons: Modulators of quantity and quality of mammalian gene expression? Bioessays 2005; 27:775-84. [PMID: 16015595 DOI: 10.1002/bies.20257] [Citation(s) in RCA: 156] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
LINE-1 (L1) retrotransposons are replicating repetitive elements that, by mass, are the most-abundant sequences in the human genome. Over one-third of mammalian genomes are the result, directly or indirectly, of L1 retrotransposition. L1 encodes two proteins: ORF1, an RNA-binding protein, and ORF2, an endonuclease/reverse transcriptase. Both proteins are required for L1 mobilization. Apart from the obvious function of self-replication, it is not clear what other roles, if any, L1 plays within its host. The sheer magnitude of L1 sequences in our genome has fueled speculation that over evolutionary time L1 insertions may structurally modify endogenous genes and regulate gene expression. Here we provide a review of L1 replication and its potential functional consequences.
Collapse
Affiliation(s)
- Jeffrey S Han
- Department of Molecular Biology and Genetics and High Throughput Biology Center, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | | |
Collapse
|
169
|
Bhangale TR, Rieder MJ, Livingston RJ, Nickerson DA. Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes. Hum Mol Genet 2004; 14:59-69. [PMID: 15525656 DOI: 10.1093/hmg/ddi006] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Despite being the second most frequent type of polymorphism in the genome, diallelic insertion-deletion polymorphisms (indels) have received far less attention in the study of sequence variation. In this report, we describe an approach that can detect indels in the heterozygous state and can comprehensively identify indels in the target sequence. Using this approach, we identified 2393 indels in a set of 330 candidate genes, i.e. an average of seven indels per gene with about two indels per gene being common (minor allele frequency >or=0.1). We compared the population genetic characteristics of indels with substitutions in this data. Our data supported the findings that deletions occur more frequently in the human genome. 5'-UTR and coding regions of the genes showed a significantly lower diversity for indels compared with other regions, suggesting differences in effects of selection on indels and substitutions. Sequence diversity and pairwise linkage disequilibrium (LD) findings of the different populations were similar to earlier results and included a greater skew towards low-frequency variants and a faster rate of LD decay in the African-descent population compared with the non-African populations. Within populations, the allele frequency spectra and LD-decay profiles for indels were similar to substitutions. Overall, the findings suggest that, although the mechanisms giving rise to indels may be different from those causing substitutions, the evolutionary histories of indels and substitutions are similar, and that indels can play a valuable role in association studies and marker selection strategies.
Collapse
Affiliation(s)
- Tushar R Bhangale
- Department of Bioengineering, University of Washington, Seattle, WA 98195-7730, USA
| | | | | | | |
Collapse
|
170
|
Abstract
To investigate patterns of diversity and the evolutionary history of Eurasians, we have sequenced a 2.8 kb region at Xp11.23 in a sample of African and Eurasian chromosomes. This region is in a long intron of CLCN5 and is immediately flanked by a highly variable minisatellite, DXS255, and a human-specific Ta0 LINE. Compared to Africans, Eurasians showed a marked reduction in sequence diversity. The main Euro-Asiatic haplotype seems to be the ancestral haplotype for the whole sample. Coalescent simulations, including recombination and exponential growth, indicate a median length of strong linkage disequilibrium, up to approximately 9 kb for this area. The Ka/Ks ratio between the coding sequence of human CLCN5 and its mouse orthologue is much less than 1. This implies that the region sequenced is unlikely to be under the strong influence of positive selective processes on CLCN5, mutations in which have been associated with disorders such as Dent's disease. In contrast, a scenario based on a population bottleneck and exponential growth seems a more likely explanation for the reduced diversity observed in Eurasians. Coalescent analysis and linked minisatellite diversity (which reaches a gene diversity value greater than 98% in Eurasians) suggest an estimated age of origin of the Euro-Asiatic diversity compatible with a recent out-of-Africa model for colonization of Eurasia by modern Homo sapiens.
Collapse
Affiliation(s)
- S Alonso
- Dept. Genetics, Physical Anthropology and Animal Physiology, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), Barrio Sarriena s/n. 48940 Leioa, Bizkaia, Spain.
| | | |
Collapse
|
171
|
Waters PD, Dobigny G, Pardini AT, Robinson TJ. LINE-1 distribution in Afrotheria and Xenarthra: implications for understanding the evolution of LINE-1 in eutherian genomes. Chromosoma 2004; 113:137-44. [PMID: 15338236 DOI: 10.1007/s00412-004-0301-9] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2004] [Revised: 06/04/2004] [Accepted: 06/11/2004] [Indexed: 10/26/2022]
Abstract
Long interspersed nuclear elements (LINEs) comprise about 21% of the human genome (of which L1 is most abundant) and are preferentially accumulated in AT-rich regions, as well as the X and Y chromosomes. Most knowledge of L1 distribution in mammals is restricted to human and mouse. Here we report the first investigation of L1 distribution in the genomes of a wide variety of eutherian mammals, including species in the two basal clades, Afrotheria and Xenarthra. Our results show L1 accumulation on the X of all eutherian mammals, an observation consistent with an ancestral involvement of these elements in the X-inactivation process (the Lyon repeat hypothesis). Surprisingly, conspicuous accumulation of L1 in AT-rich regions of the genome was not observed in any species outside of Euarchontoglires (represented by human, mouse and rabbit). Although several features were common to most species investigated, our comprehensive survey shows that the patterns observed in human and mouse are, in many aspects, far from typical for all mammals. We discuss these findings with reference to models that have previously been proposed to explain the AT distribution bias of L1 in human and mouse, and how this relates to the evolution of these elements in other eutherian genomes.
Collapse
Affiliation(s)
- Paul D Waters
- Evolutionary Genomics Group, Department of Zoology, University of Stellenbosch, Private Bag X1, 7602, Matieland, South Africa
| | | | | | | |
Collapse
|
172
|
Boissinot S, Entezam A, Young L, Munson PJ, Furano AV. The insertional history of an active family of L1 retrotransposons in humans. Genome Res 2004; 14:1221-31. [PMID: 15197167 PMCID: PMC442137 DOI: 10.1101/gr.2326704] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
As humans contain a currently active L1 (LINE-1) non-LTR retrotransposon family (Ta-1), the human genome database likely provides only a partial picture of Ta-1-generated diversity. We used a non-biased method to clone Ta-1 retrotransposon-containing loci from representatives of four ethnic populations. We obtained 277 distinct Ta-1 loci and identified an additional 67 loci in the human genome database. This collection represents approximately 90% of the Ta-1 population in the individuals examined and is thus more representative of the insertional history of Ta-1 than the human genome database, which lacked approximately 40% of our cloned Ta-1 elements. As both polymorphic and fixed Ta-1 elements are as abundant in the GC-poor genomic regions as in ancestral L1 elements, the enrichment of L1 elements in GC-poor areas is likely due to insertional bias rather than selection. Although the chromosomal distribution of Ta-1 inserts is generally a function of chromosomal length and gene density, chromosome 4 significantly deviates from this pattern and has been much more hospitable to Ta-1 insertions than any other chromosome. Also, the intra-chromosomal distribution of Ta-1 elements is not uniform. Ta-1 elements tend to cluster, and the maximal gaps between Ta-1 inserts are larger than would be expected from a model of uniform random insertion.
Collapse
Affiliation(s)
- Stéphane Boissinot
- Section on Genomic Structure and Function, Laboratory of Molecular and Cellular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, Maryland 20892, USA
| | | | | | | | | |
Collapse
|
173
|
Han JS, Boeke JD. A highly active synthetic mammalian retrotransposon. Nature 2004; 429:314-8. [PMID: 15152256 DOI: 10.1038/nature02535] [Citation(s) in RCA: 151] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2004] [Accepted: 03/30/2004] [Indexed: 11/09/2022]
Abstract
LINE-1 (L1) elements are retrotransposons that comprise large fractions of mammalian genomes. Transcription through L1 open reading frames is inefficient owing to an elongation defect, inhibiting the robust expression of L1 RNA and proteins, the substrate and enzyme(s) for retrotransposition. This elongation defect probably controls L1 transposition frequency in mammalian cells. Here we report bypassing this transcriptional defect by synthesizing the open reading frames of L1 from synthetic oligonucleotides, altering 24% of the nucleic acid sequence without changing the amino acid sequence. Such resynthesis led to greatly enhanced steady-state L1 RNA and protein levels. Remarkably, when the synthetic open reading frames were substituted for the wild-type open reading frames in an established retrotransposition assay, transposition levels increased more than 200-fold. This indicates that there are probably no large, rigidly conserved cis-acting nucleic acid sequences required for retrotransposition within L1 coding regions. These synthetic retrotransposons are also the most highly active L1 elements known so far and have potential as practical tools for manipulating mammalian genomes.
Collapse
Affiliation(s)
- Jeffrey S Han
- Department of Molecular Biology and Genetics and High Throughput Biology Center, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | |
Collapse
|
174
|
Han JS, Szak ST, Boeke JD. Transcriptional disruption by the L1 retrotransposon and implications for mammalian transcriptomes. Nature 2004; 429:268-74. [PMID: 15152245 DOI: 10.1038/nature02536] [Citation(s) in RCA: 371] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2004] [Accepted: 03/30/2004] [Indexed: 11/08/2022]
Abstract
LINE-1 (L1) elements are the most abundant autonomous retrotransposons in the human genome, accounting for about 17% of human DNA. The L1 retrotransposon encodes two proteins, open reading frame (ORF)1 and the ORF2 endonuclease/reverse transcriptase. L1 RNA and ORF2 protein are difficult to detect in mammalian cells, even in the context of overexpression systems. Here we show that inserting L1 sequences on a transcript significantly decreases RNA expression and therefore protein expression. This decreased RNA concentration does not result from major effects on the transcription initiation rate or RNA stability. Rather, the poor L1 expression is primarily due to inadequate transcriptional elongation. Because L1 is an abundant and broadly distributed mobile element, the inhibition of transcriptional elongation by L1 might profoundly affect expression of endogenous human genes. We propose a model in which L1 affects gene expression genome-wide by acting as a 'molecular rheostat' of target genes. Bioinformatic data are consistent with the hypothesis that L1 can serve as an evolutionary fine-tuner of the human transcriptome.
Collapse
Affiliation(s)
- Jeffrey S Han
- Department of Molecular Biology and Genetics and High Throughput Biology Center, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | |
Collapse
|
175
|
|
176
|
Boissinot S, Roos C, Furano AV. Different rates of LINE-1 (L1) retrotransposon amplification and evolution in New World monkeys. J Mol Evol 2004; 58:122-30. [PMID: 14743320 DOI: 10.1007/s00239-003-2539-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2003] [Accepted: 08/08/2003] [Indexed: 10/26/2022]
Abstract
LINE-1 (L1) elements constitute the major family of retrotransposons in mammalian genomes. Here we report the first investigation of L1 evolution in New World monkeys (NWM). Two regions of the second open-reading frame were analyzed by two methods in three NWM species, the squirrel monkey (Saimiri sciureus), the tamarin (Saguinus oedipus), and the spider monkey (Ateles paniscus). Since these three species diverged, L1 has amplified in the Saimiri and Saguinus lineages but L1 activity seems to have been strongly reduced in the Ateles lineage. In addition, the active L1 lineage has evolved rapidly in Saimiri and Saguinus, generating species-specific subfamilies. In contrast, we found no evidence for a species-specific subfamily in Ateles, a result consistent with the low L1 activity in this species for the last approximately 25 My.
Collapse
Affiliation(s)
- Stéphane Boissinot
- Section on Genomic Structure and Function, Laboratory of Molecular and Cellular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0830, USA
| | | | | |
Collapse
|
177
|
Bae YA, Kong Y. Evolutionary course of CsRn1 long-terminal-repeat retrotransposon and its heterogeneous integrations into the genome of the liver fluke, Clonorchis sinensis. THE KOREAN JOURNAL OF PARASITOLOGY 2004; 41:209-19. [PMID: 14699262 PMCID: PMC2717513 DOI: 10.3347/kjp.2003.41.4.209] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The evolutionary course of the CsRn1 long-terminal-repeat (LTR) retrotransposon was predicted by conducting a phylogenetic analysis with its paralog LTR sequences. Based on the clustering patterns in the phylogenetic tree, multiple CsRn1 copies could be grouped into four subsets, which were shown to have different integration times. Their differential sequence divergences and heterogeneous integration patterns strongly suggested that these subsets appeared sequentially in the genome of C. sinensis. Members of recently expanding subset showed the lowest level of divergence in their LTR and reverse transcriptase gene sequences. They were also shown to be highly polymorphic among individual genomes of the trematode. The CsRn1 element exhibited a preference for repetitive, agenic chromosomal regions in terms of selecting integration targets. Our results suggested that CsRn1 might induce a considerable degree of intergenomic variation and, thereby, have influenced the evolution of the C. sinensis genome.
Collapse
Affiliation(s)
- Young-An Bae
- Department of Molecular Parasitology and Center for Molecular Medicine, Sungkyunkwan University School of Medicine and Samsung Biomedical Research Institute, Suwon 440-746, Republic of Korea
| | | |
Collapse
|
178
|
Khodosevich K, Lebedev Y, Sverdlov ED. Large-scale determination of the methylation status of retrotransposons in different tissues using a methylation tags approach. Nucleic Acids Res 2004; 32:e31. [PMID: 14973327 PMCID: PMC373429 DOI: 10.1093/nar/gnh035] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2003] [Revised: 01/24/2004] [Accepted: 01/24/2004] [Indexed: 12/13/2022] Open
Abstract
A technique for simultaneous determination of the methylation status of numerous loci containing retroelements (REs) is reported. It is based on the observation that methylated and unmethylated areas in the genome are usually extended, and therefore the methylation of particular methyl-sensitive restriction endonuclease recognition sites might reflect the methylation status of DNA regions around them. The method includes dot-blot hybridization of repeat flanking sequences arrayed on a solid support with specifically amplified flanking regions of presumably unmethylated repeats. A multitude of flanking regions of REs adjacent to unmethylated restriction sites are amplified simultaneously, providing a complex hybridization probe. The technique thus allows the determination of the methylation status of restriction sites, which serve as tags of the methylation status of the surrounding regions. The validity of the technique was confirmed by various means, including bisulfite sequencing. The technique was successfully applied to the identification of methylation patterns of the regions surrounding 38 human-specific HERV-K(HML-2) long terminal repeats in cerebellum- and lymph node-derived genomic DNAs. The described technique can be readily adapted to the use of DNA microarray technology.
Collapse
Affiliation(s)
- Konstantin Khodosevich
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Science, 16/10 Miklukho-Maklaya St., 117997 Moscow, Russia.
| | | | | |
Collapse
|
179
|
Farley AH, Luning Prak ET, Kazazian HH. More active human L1 retrotransposons produce longer insertions. Nucleic Acids Res 2004; 32:502-10. [PMID: 14742665 PMCID: PMC373329 DOI: 10.1093/nar/gkh202] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2003] [Revised: 09/18/2003] [Accepted: 12/10/2003] [Indexed: 11/13/2022] Open
Abstract
The vast majority of L1 insertions are 5' truncated and thus inactive. Yet, the mechanism of 5' truncation is unknown. To examine whether the frequency of L1 retrotransposition is directly correlated with the length of genomic L1 insertions, we used a cell culture assay to measure retrotransposition frequency and a PCR-based assay to measure L1 insertion length. We tested five full-length human L1 elements that retrotranspose at different frequencies: LRE3, L1(RP), L1.3, L1.2A and L1.2B. Our data suggest that L1 insertion length correlates with L1 retrotransposition frequency for insertions >1 kb in length. For two elements, L1(RP) and L1.2A, we found that swapping the reverse transcriptase domains had little effect. Instead, we found that genomic insertion length and retrotransposition frequency are substantially affected by amino acid substitutions at positions 363, 1220 and 1259 in ORF2. We suggest that the region containing residues 1220 and 1259 may be important in the binding of ORF2p to L1 RNA to facilitate reverse transcription.
Collapse
Affiliation(s)
- Alexander H Farley
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, PA, USA
| | | | | |
Collapse
|
180
|
Tishkoff SA, Verrelli BC. Patterns of human genetic diversity: implications for human evolutionary history and disease. Annu Rev Genomics Hum Genet 2003; 4:293-340. [PMID: 14527305 DOI: 10.1146/annurev.genom.4.070802.110226] [Citation(s) in RCA: 234] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Since the completion of the human genome sequencing project, the discovery and characterization of human genetic variation is a principal focus for future research. Comparative studies across ethnically diverse human populations and across human and nonhuman primate species is important for reconstructing human evolutionary history and for understanding the genetic basis of human disease. In this review, we summarize data on patterns of human genetic diversity and the evolutionary forces (mutation, genetic drift, migration, and selection) that have shaped these patterns of variation across both human populations and the genome. African population samples typically have higher levels of genetic diversity, a complex population substructure, and low levels of linkage disequilibrium (LD) relative to non-African populations. We discuss these differences and their implications for mapping disease genes and for understanding how population and genomic diversity have been important in the evolution, differentiation, and adaptation of humans.
Collapse
Affiliation(s)
- Sarah A Tishkoff
- Department of Biology, University of Maryland, College Park, Maryland 20742, USA.
| | | |
Collapse
|
181
|
Ostertag EM, Goodier JL, Zhang Y, Kazazian HH. SVA elements are nonautonomous retrotransposons that cause disease in humans. Am J Hum Genet 2003; 73:1444-51. [PMID: 14628287 PMCID: PMC1180407 DOI: 10.1086/380207] [Citation(s) in RCA: 253] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2003] [Accepted: 09/24/2003] [Indexed: 11/04/2022] Open
Abstract
L1 elements are the only active autonomous retrotransposons in the human genome. The nonautonomous Alu elements, as well as processed pseudogenes, are retrotransposed by the L1 retrotransposition proteins working in trans. Here, we describe another repetitive sequence in the human genome, the SVA element. Our analysis reveals that SVA elements are currently active in the human genome. SVA elements, like Alus and L1s, occasionally insert into genes and cause disease. Furthermore, SVA elements are probably mobilized in trans by active L1 elements.
Collapse
Affiliation(s)
- Eric M Ostertag
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104-6145, USA
| | | | | | | |
Collapse
|
182
|
Lutz SM, Vincent BJ, Kazazian HH, Batzer MA, Moran JV. Allelic heterogeneity in LINE-1 retrotransposition activity. Am J Hum Genet 2003; 73:1431-7. [PMID: 14610717 PMCID: PMC1180405 DOI: 10.1086/379744] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2003] [Accepted: 09/09/2003] [Indexed: 11/03/2022] Open
Abstract
De novo LINE-1 (long interspersed element-1, or L1) retrotransposition events are responsible for approximately 1/1,000 disease-causing mutations in humans. Previously, L1.2 was identified as the likely progenitor of a mutagenic insertion in the factor VIII gene in a patient with hemophilia A. It subsequently was shown to be one of a small number of active L1s in the human genome. Here, we demonstrate that L1.2 is present at an intermediate insertion allele frequency in worldwide human populations and that common alleles (L1.2A and L1.2B) exhibit an approximately 16-fold difference in their ability to retrotranspose in cultured human HeLa cells. Chimera analysis revealed that two amino acid substitutions (S1259L and I1220M) downstream of the conserved cysteine-rich motif in L1 open reading frame 2 are largely responsible for the observed reduction in L1.2A retrotransposition efficiency. Thus, common L1 alleles can vary widely in their retrotransposition potential. We propose that such allelic heterogeneity can influence the potential L1 mutational load present in an individual genome.
Collapse
Affiliation(s)
- Sheila M Lutz
- Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109-0618, USA.
| | | | | | | | | |
Collapse
|
183
|
Buzdin A, Ustyugova S, Gogvadze E, Lebedev Y, Hunsmann G, Sverdlov E. Genome-wide targeted search for human specific and polymorphic L1 integrations. Hum Genet 2003; 112:527-33. [PMID: 12601470 DOI: 10.1007/s00439-002-0904-2] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2002] [Accepted: 12/06/2002] [Indexed: 11/30/2022]
Abstract
Retroelements (REs) occupy up to 40% of the human genome. Newly integrated REs can change the pattern of expression of pre-existing host genes and therefore might play a significant role in evolution. In particular, human- and primate-specific REs could affect the divergence of the Hominoidea superfamily. A comparative genome-wide analysis of RE sites of integration, neighboring genes, and their regulatory interplay in human and ape genomes would be of help in understanding the impact of REs on evolution and genome regulation. We have developed a technique for the genome-wide comparison of the integrations of transposable elements in genomic DNAs of closely related species. The technique called targeted genome differences analysis (TGDA) is also useful for the detection of deletion/insertion polymorphisms of REs. The technique is based on an enhanced version of subtractive hybridization and does not require preliminary knowledge of the genome sequences under comparison. In this report, we describe its application to the detection and analysis of human specific L1 integrations and their polymorphisms. We obtained a library highly enriched in human-specific L1 insertions and identified 24 such new insertions. Many of these insertions are polymorphic in human populations. The total number of human-specific L1 inserts was estimated to be approximately 4000. The results suggest that TGDA is a universal method that can be successfully used for the detection of evolutionary and polymorphic markers in any closely related genomes.
Collapse
Affiliation(s)
- Anton Buzdin
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya, 117997 Moscow, Russia.
| | | | | | | | | | | |
Collapse
|
184
|
Brouha B, Schustak J, Badge RM, Lutz-Prigge S, Farley AH, Moran JV, Kazazian HH. Hot L1s account for the bulk of retrotransposition in the human population. Proc Natl Acad Sci U S A 2003; 100:5280-5. [PMID: 12682288 PMCID: PMC154336 DOI: 10.1073/pnas.0831042100] [Citation(s) in RCA: 764] [Impact Index Per Article: 36.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2002] [Accepted: 02/20/2003] [Indexed: 11/18/2022] Open
Abstract
Although LINE-1 (long interspersed nucleotide element-1, L1) retrotransposons comprise 17% of the human genome, an exhaustive search of the December 2001 "freeze" of the haploid human genome working draft sequence (95% complete) yielded only 90 L1s with intact ORFs. We demonstrate that 38 of 86 (44%) L1s are polymorphic as to their presence in human populations. We cloned 82 (91%) of the 90 L1s and found that 40 of the 82 (49%) are active in a cultured cell retrotransposition assay. From these data, we predict that there are 80-100 retrotransposition-competent L1s in an average human being. Remarkably, 84% of assayed retrotransposition capability was present in six highly active L1s (hot L1s). By comparison, four of five full-length L1s involved in recent human insertions had retrotransposition activity comparable to the six hot L1s in the human genome working draft sequence. Thus, our data indicate that most L1 retrotransposition in the human population stems from hot L1s, with the remaining elements playing a lesser role in genome plasticity.
Collapse
Affiliation(s)
- Brook Brouha
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | | | | | | | | | | | | |
Collapse
|
185
|
Badge RM, Alisch RS, Moran JV. ATLAS: a system to selectively identify human-specific L1 insertions. Am J Hum Genet 2003; 72:823-38. [PMID: 12632328 PMCID: PMC1180347 DOI: 10.1086/373939] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2002] [Accepted: 12/30/2002] [Indexed: 11/03/2022] Open
Abstract
Retrotransposition of L1 LINEs (long interspersed elements) continues to sculpt the human genome. However, because recent insertions are dimorphic, they are not fully represented in sequence databases. Here, we have developed a system, termed "ATLAS" (amplification typing of L1 active subfamilies), that enables the selective amplification and display of DNA fragments containing the termini of human-specific L1s and their respective flanking sequences. We demonstrate that ATLAS is robust and that the resultant display patterns are highly reproducible, segregate in Centre d'Etude du Polymorphisme Humain pedigrees, and provide an individual-specific fingerprint. ATLAS also allows the identification of L1s that are absent from current genome databases, and we show that some of these L1s can retrotranspose at high frequencies in cultured human cells. Finally, we demonstrate that ATLAS also can identify single-nucleotide polymorphisms within a subset of older, primate-specific L1s. Thus, ATLAS provides a simple, high-throughput means to assess genetic variation associated with L1 retrotransposons.
Collapse
Affiliation(s)
- Richard M. Badge
- Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; and Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Reid S. Alisch
- Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; and Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - John V. Moran
- Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; and Department of Genetics, University of Leicester, Leicester, United Kingdom
| |
Collapse
|
186
|
Mathews LM, Chi SY, Greenberg N, Ovchinnikov I, Swergold GD. Large differences between LINE-1 amplification rates in the human and chimpanzee lineages. Am J Hum Genet 2003; 72:739-48. [PMID: 12574987 PMCID: PMC1180250 DOI: 10.1086/368275] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2002] [Accepted: 12/09/2002] [Indexed: 11/03/2022] Open
Abstract
The genomic evolution and causes of phenotypic variation among humans and great apes remain largely unknown, although the phylogenetic relationships among them have been extensively explored. Previous studies that focus on differences at the amino acid and nucleotide sequence levels have revealed a high degree of similarity between humans and chimpanzees, suggesting that other types of genomic change may have contributed to the relatively large phenotypic differences between them. For example, the activity of long interspersed element 1 (LINE-1) retrotransposons may impose significant changes on genomic structure and function and, consequently, on phenotype. Here we investigate the relative rates of LINE-1 amplification in the lineages leading to humans, bonobos (Pan paniscus), and chimpanzees (P. troglodytes). Our data indicate that LINE-1 insertions have accumulated at significantly greater rates in bonobos and chimpanzees than in humans, provide insights into the timing of major LINE-1 amplification events during great ape evolution, and identify a Pan-specific LINE-1 subfamily.
Collapse
Affiliation(s)
- Lauren M Mathews
- Division of Molecular Medicine, Department of Medicine, Columbia University, New York, NY 10032, USA
| | | | | | | | | |
Collapse
|
187
|
Salem AH, Myers JS, Otieno AC, Watkins WS, Jorde LB, Batzer MA. LINE-1 preTa elements in the human genome. J Mol Biol 2003; 326:1127-46. [PMID: 12589758 DOI: 10.1016/s0022-2836(03)00032-9] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The preTa subfamily of long interspersed elements (LINEs) is characterized by a three base-pair "ACG" sequence in the 3' untranslated region, contains approximately 400 members in the human genome, and has low level of nucleotide divergence with an estimated average age of 2.34 million years old suggesting that expansion of the L1 preTa subfamily occurred just after the divergence of humans and African apes. We have identified 362 preTa L1 elements from the draft human genomic sequence, investigated the genomic characteristics of preTa L1 insertions, and screened individual elements across diverse human populations and various non-human primate species using polymerase chain reaction (PCR) assays to determine the phylogenetic origin and levels of human genomic diversity associated with the L1 elements. All of the preTa L1 elements analyzed by PCR were absent from the orthologous positions in non-human primate genomes with 33 (14%) of the L1 elements being polymorphic with respect to insertion presence or absence in the human genome. The newly identified L1 insertion polymorphisms will prove useful as identical by descent genetic markers for the study of human population genetics. We provide evidence that preTa L1 elements show an integration site preference for genomic regions with low GC content. Computational analysis of the preTa L1 elements revealed that 29% of the elements amenable to complete sequence analysis have apparently escaped 5' truncation and are essentially full-length (approximately 6kb). In all, 29 have two intact open reading frames and may be capable of retrotransposition.
Collapse
Affiliation(s)
- Abdel Halim Salem
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| | | | | | | | | | | |
Collapse
|
188
|
Prak ETL, Dodson AW, Farkash EA, Kazazian HH. Tracking an embryonic L1 retrotransposition event. Proc Natl Acad Sci U S A 2003; 100:1832-7. [PMID: 12569170 PMCID: PMC149919 DOI: 10.1073/pnas.0337627100] [Citation(s) in RCA: 81] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Long interspersed nuclear elements 1 (L1) are active retrotransposons that reside in many species, including humans and rodents. L1 elements produce an RNA intermediate that is reverse transcribed to DNA and inserted in a new genomic location. We have tagged an active human L1 element (L1(RP)) with a gene encoding enhanced GFP (EGFP). Expression of GFP occurs only if L1-EGFP has undergone a cycle of transcription, reverse transcription, and integration into a transcriptionally permissive genomic region. We show here that L1-EGFP can undergo retrotransposition in vivo and produce fluorescence in mouse testis. The retrotransposition event characterized here has occurred at a very early stage in the development of an L1-EGFP transgenic founder mouse.
Collapse
Affiliation(s)
- Eline T Luning Prak
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, 405B Stellar Chance Laboratories, 422 Curie Boulevard, Philadelphia, PA 19104, USA.
| | | | | | | |
Collapse
|
189
|
Birth of ‘human-specific’ genes during primate evolution. CONTEMPORARY ISSUES IN GENETICS AND EVOLUTION 2003. [DOI: 10.1007/978-94-010-0229-5_9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
190
|
Ostertag EM, DeBerardinis RJ, Goodier JL, Zhang Y, Yang N, Gerton GL, Kazazian HH. A mouse model of human L1 retrotransposition. Nat Genet 2002; 32:655-60. [PMID: 12415270 DOI: 10.1038/ng1022] [Citation(s) in RCA: 167] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2002] [Accepted: 09/24/2002] [Indexed: 11/09/2022]
Abstract
The L1 retrotransposon has had an immense impact on the size and structure of the human genome through a variety of mechanisms, including insertional mutagenesis. To study retrotransposition in a living organism, we created a mouse model of human L1 retrotransposition. Here we show that L1 elements can retrotranspose in male germ cells, and that expression of a human L1 element under the control of its endogenous promoter is restricted to testis and ovary. In the mouse line with the highest level of L1 expression, we found two de novo L1 insertions in 135 offspring. Both insertions were structurally indistinguishable from natural endogenous insertions. This suggests that an individual L1 element can have substantial mutagenic potential. In addition to providing a valuable in vivo model of retrotransposition in mammals, these mice are an important step in the development of a new random mutagenesis system.
Collapse
Affiliation(s)
- Eric M Ostertag
- Department of Genetics and University of Pennsylvania School of Medicine, 475 Clinical Research Bldg., 415 Curie Blvd., Philadelphia, Pennsylvania 19104, USA
| | | | | | | | | | | | | |
Collapse
|
191
|
Pavlícek A, Paces J, Zíka R, Hejnar J. Length distribution of long interspersed nucleotide elements (LINEs) and processed pseudogenes of human endogenous retroviruses: implications for retrotransposition and pseudogene detection. Gene 2002; 300:189-94. [PMID: 12468100 DOI: 10.1016/s0378-1119(02)01047-8] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Deciphering the human genome includes reliable identification and structural characterization of individual retrotransposon elements. The most active group of autonomous transposable elements, the long interspersed nuclear elements (LINE), transpose themselves as well as other RNAs, including those of human endogenous retroviruses (HERV). During this transposition, however, the LINE-encoded reverse transcriptase (RT) often abortively dissociates from the RNA template, leaving a prematurely terminated, 5' truncated copy. We have analyzed the length distributions of LINEs and of processed pseudogenes derived from HERV-W. As expected, we have found that the majority of 5' truncated LINEs and HERV-W processed pseudogenes show a prevalence of very short elements terminated close to the 3' end. On the other hand, the number of complete elements is far above the expectation. The characteristic distribution in both cases indicates two important conclusions: (i) dissociation of LINE RT from the template cannot be fully explained by low processivity of RT modelled as a stochastic, Poisson-type process. (ii) Currently cited numbers of pseudogenes within the human genome are underestimated, since a large percentage of pseudogenes are terminated in the 3' untranslated region and remain undetectable in translated homology searches of protein databases against the human genome.
Collapse
Affiliation(s)
- Adam Pavlícek
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Flemingovo nam. 2, Prague 6, CZ-16637, Czech Republic
| | | | | | | |
Collapse
|
192
|
Abstract
The eukaryotic genome has undergone a series of epidemics of amplification of mobile elements that have resulted in most eukaryotic genomes containing much more of this 'junk' DNA than actual coding DNA. The majority of these elements utilize an RNA intermediate and are termed retroelements. Most of these retroelements appear to amplify in evolutionary waves that insert in the genome and then gradually diverge. In humans, almost half of the genome is recognizably derived from retroelements, with the two elements that are currently actively amplifying, L1 and Alu, making up about 25% of the genome and contributing extensively to disease. The mechanisms of this amplification process are beginning to be understood, although there are still more questions than answers. Insertion of new retroelements may directly damage the genome, and the presence of multiple copies of these elements throughout the genome has longer-term influences on recombination events in the genome and more subtle influences on gene expression.
Collapse
Affiliation(s)
- Prescott L Deininger
- Tulane Cancer Center, Department of Environmental Health Sciences, Tulane University Health Sciences Center, New Orleans, Louisiana 70112, USA.
| | | |
Collapse
|
193
|
Roy-Engel AM, Salem AH, Oyeniran OO, Deininger L, Hedges DJ, Kilroy GE, Batzer MA, Deininger PL. Active Alu element "A-tails": size does matter. Genome Res 2002; 12:1333-44. [PMID: 12213770 PMCID: PMC186649 DOI: 10.1101/gr.384802] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Long and short interspersed elements (LINEs and SINEs) are retroelements that make up almost half of the human genome. L1 and Alu represent the most prolific human LINE and SINE families, respectively. Only a few Alu elements are able to retropose, and the factors determining their retroposition capacity are poorly understood. The data presented in this paper indicate that the length of Alu "A-tails" is one of the principal factors in determining the retropositional capability of an Alu element. The A stretches of the Alu subfamilies analyzed, both old (Alu S and J) and young (Ya5), had a Poisson distribution of A-tail lengths with a mean size of 21 and 26, respectively. In contrast, the A-tails of very recent Alu insertions (disease causing) were all between 40 and 97 bp in length. The L1 elements analyzed displayed a similar tendency, in which the "disease"-associated elements have much longer A-tails (mean of 77) than do the elements even from the young Ta subfamily (mean of 41). Analysis of the draft sequence of the human genome showed that only about 1000 of the over one million Alu elements have tails of 40 or more adenosine residues in length. The presence of these long A stretches shows a strong bias toward the actively amplifying subfamilies, consistent with their playing a major role in the amplification process. Evaluation of the 19 Alu elements retrieved from the draft sequence of the human genome that are identical to the Alu Ya5a2 insert in the NF1 gene showed that only five have tails with 40 or more adenosine residues. Sequence analysis of the loci with the Alu elements containing the longest A-tails (7 of the 19) from the genomes of the NF1 patient and the father revealed that there are at least two loci with A-tails long enough to serve as source elements within our model. Analysis of the A-tail lengths of 12 Ya5a2 elements in diverse human population groups showed substantial variability in both the Alu A-tail length and sequence homogeneity. On the basis of these observations, a model is presented for the role of A-tail length in determining which Alu elements are active.
Collapse
Affiliation(s)
- Astrid M Roy-Engel
- Tulane Cancer Center, SL-66, Department of Environmental Health Sciences, Tulane University-Health Sciences Center, New Orleans, Louisiana 70112, USA
| | | | | | | | | | | | | | | |
Collapse
|
194
|
Symer DE, Connelly C, Szak ST, Caputo EM, Cost GJ, Parmigiani G, Boeke JD. Human l1 retrotransposition is associated with genetic instability in vivo. Cell 2002; 110:327-38. [PMID: 12176320 DOI: 10.1016/s0092-8674(02)00839-5] [Citation(s) in RCA: 336] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Retrotransposons have shaped eukaryotic genomes for millions of years. To analyze the consequences of human L1 retrotransposition, we developed a genetic system to recover many new L1 insertions in somatic cells. Forty-two de novo integrants were recovered that faithfully mimic many aspects of L1s that accumulated since the primate radiation. Their structures experimentally demonstrate an association between L1 retrotransposition and various forms of genetic instability. Numerous L1 element inversions, extra nucleotide insertions, exon deletions, a chromosomal inversion, and flanking sequence comobilization (called 5' transduction) were identified. In a striking number of integrants, short identical sequences were shared between the donor and the target site's 3' end, suggesting a mechanistic model that helps explain the structure of L1 insertions.
Collapse
Affiliation(s)
- David E Symer
- Department of Molecular Biology and Genetics, John Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | | | | | | | |
Collapse
|
195
|
Ovchinnikov I, Rubin A, Swergold GD. Tracing the LINEs of human evolution. Proc Natl Acad Sci U S A 2002; 99:10522-7. [PMID: 12138175 PMCID: PMC124962 DOI: 10.1073/pnas.152346799] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2002] [Accepted: 06/10/2002] [Indexed: 11/18/2022] Open
Abstract
The amplification of DNA by LINE-1 (L1) retrotransposons has created a large fraction of the human genome. To better understand their role in human evolution we endeavored to delineate the L1 elements that have amplified since the emergence of the hominid lineage. We used an approach based on shared sequence variants to trace backwards from the currently amplifying Ta subfamily. The newly identified groups of insertions account for much of the molecular evolution of human L1s. We report the identification of a L1 subfamily that amplified both before and after the divergence of humans from our closest extant relatives. Progressively more modern groups of L1s include greater numbers of insertions. Our data are consistent with the hypothesis that the rate of L1 amplification has been increasing during recent human evolution.
Collapse
Affiliation(s)
- Igor Ovchinnikov
- Division of Molecular Medicine, Department of Medicine, Columbia University, 600 West 168th Street, New York, NY 10032, USA
| | | | | |
Collapse
|
196
|
Brouha B, Meischl C, Ostertag E, de Boer M, Zhang Y, Neijens H, Roos D, Kazazian HH. Evidence consistent with human L1 retrotransposition in maternal meiosis I. Am J Hum Genet 2002; 71:327-36. [PMID: 12094329 PMCID: PMC379165 DOI: 10.1086/341722] [Citation(s) in RCA: 102] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2002] [Accepted: 05/10/2002] [Indexed: 11/04/2022] Open
Abstract
We have used a unique polymorphic 3' transduction to show that a human L1, or LINE-1 (long interspersed nucleotide element-1), retrotransposition event most likely occurred in the maternal primary oocyte during meiosis I. We characterized a truncated L1 retrotransposon with a 3' transduction that was inserted, in a Dutch male patient, into the X-linked gene CYBB, thereby causing chronic granulomatous disease. We used the unique flanking sequence to localize the precursor L1 locus, LRE3, to chromosome 2q24.1. In a cell culture assay, the retrotransposition frequency of LRE3 is greater than that for any other element that has been tested to date. The patient's mother had two LRE3 alleles that differed slightly in the 3'-flanking genomic DNA. The patient had a single LRE3 allele that was identical to one of the maternal alleles; however, the patient's insertion matched the maternal LRE3 allele that he did not inherit. Other data indicate that there is only a small chance that the father (unavailable for analysis) carries the precursor LRE3 allele. In addition, paternal origin of the insertion would have required that an LRE3 mRNA transcribed before meiosis II be carried separately from its precursor LRE3 allele in the fertilizing sperm. Since the mother carries a potential precursor allele and the insertion was on the patient's maternal X chromosome, it is highly likely that the insertion originated during maternal meiosis I.
Collapse
Affiliation(s)
- Brook Brouha
- Department of Genetics, University of Pennsylvania School of Medicine, 475 Clinical Research Building, 415 Curie Boulevard, Philadelphia, PA 19104, USA
| | | | | | | | | | | | | | | |
Collapse
|
197
|
Myers JS, Vincent BJ, Udall H, Watkins WS, Morrish TA, Kilroy GE, Swergold GD, Henke J, Henke L, Moran JV, Jorde LB, Batzer MA. A comprehensive analysis of recently integrated human Ta L1 elements. Am J Hum Genet 2002; 71:312-26. [PMID: 12070800 PMCID: PMC379164 DOI: 10.1086/341718] [Citation(s) in RCA: 120] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2002] [Accepted: 05/09/2002] [Indexed: 11/04/2022] Open
Abstract
The Ta (transcribed, subset a) subfamily of L1 LINEs (long interspersed elements) is characterized by a 3-bp ACA sequence in the 3' untranslated region and contains approximately 520 members in the human genome. Here, we have extracted 468 Ta L1Hs (L1 human specific) elements from the draft human genomic sequence and screened individual elements using polymerase-chain-reaction (PCR) assays to determine their phylogenetic origin and levels of human genomic diversity. One hundred twenty-four of the elements amenable to complete sequence analysis were full length ( approximately 6 kb) and have apparently escaped any 5' truncation. Forty-four of these full-length elements have two intact open reading frames and may be capable of retrotransposition. Sequence analysis of the Ta L1 elements showed a low level of nucleotide divergence with an estimated age of 1.99 million years, suggesting that expansion of the L1 Ta subfamily occurred after the divergence of humans and African apes. A total of 262 Ta L1 elements were screened with PCR-based assays to determine their phylogenetic origin and the level of human genomic variation associated with each element. All of the Ta L1 elements analyzed by PCR were absent from the orthologous positions in nonhuman primate genomes, except for a single element (L1HS72) that was also present in the common (Pan troglodytes) and pygmy (P. paniscus) chimpanzee genomes. Sequence analysis revealed that this single exception is the product of a gene conversion event involving an older preexisting L1 element. One hundred fifteen (45%) of the Ta L1 elements were polymorphic with respect to insertion presence or absence and will serve as identical-by-descent markers for the study of human evolution.
Collapse
Affiliation(s)
- Jeremy S. Myers
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Bethaney J. Vincent
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Hunt Udall
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - W. Scott Watkins
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Tammy A. Morrish
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Gail E. Kilroy
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Gary D. Swergold
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Jurgen Henke
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Lotte Henke
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - John V. Moran
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Lynn B. Jorde
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| | - Mark A. Batzer
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge; Departments of Pathology, Genetics, Biochemistry, and Molecular Biology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Health Sciences Center, New Orleans; Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City; Departments of Human Genetics and Internal Medicine, University of Michigan Medical School, Ann Arbor; Division of Molecular Medicine, Department of Medicine, Columbia University, New York; and Institut für Blutgruppenforschung, Cologne
| |
Collapse
|
198
|
Mamedov I, Batrak A, Buzdin A, Arzumanyan E, Lebedev Y, Sverdlov ED. Genome-wide comparison of differences in the integration sites of interspersed repeats between closely related genomes. Nucleic Acids Res 2002; 30:e71. [PMID: 12136119 PMCID: PMC135772 DOI: 10.1093/nar/gnf071] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2002] [Revised: 05/29/2002] [Accepted: 05/29/2002] [Indexed: 11/14/2022] Open
Abstract
A technique for genome-wide detection of differences in the integration site positions of interspersed repeats in related genomes (DiffIR) is described. The technique is based on a whole- genome selective PCR amplification of the repeats' flanking regions followed by a differential hybridization screening of the arrayed library of the selected amplicons. The technique was successfully applied to the comparison of the integration sites in the human and chimpanzee genomes, allowing us to discover 11 new human-specific integrations of human endogenous retrovirus, K family (HML-2) long terminal repeats.
Collapse
Affiliation(s)
- Ilgar Mamedov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya Street, 117997 Moscow, Russia.
| | | | | | | | | | | |
Collapse
|
199
|
Abstract
A review was made in relation to the molecular variability present in North, Central, and South American Indian populations. It involved results from ancient DNA, mitochondrial DNA in extant populations, HLA and other autosomal markers, X and Y chromosome variation, as well as data from parasitic viruses which could show coevolutionary changes. The questions considered were their origin, ways in which the early colonization of the continent took place, types and levels of the variability which developed, peculiarities of the Amerindian evolutionary processes, and eventual genetic heterogeneity which evolved in different geographical areas. Although much information is already available, it is highly heterogeneous in relation to populations and types of genetic systems investigated. Unfortunately, the present trend of favoring essentially applied research suggest that the situation will not basically improve in the future.
Collapse
Affiliation(s)
- Francisco M Salzano
- Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS.
| |
Collapse
|
200
|
Abstract
To gauge the processes that might direct the length of introns, I studied the balance of indels (insertions or deletions, determined using Alu and LINE1 retroposon repeats) and the density of these repeats in the introns of the human genome. The indel balance is biased in favour of deletions and correlated with the divergence of repeats. At fixed repeat divergence, the indel bias correlated with the intron size: the shorter the intron, the more deletions were favoured over insertions. This correlation with the intron size was stronger than with the gene-wide or isochore-wide parameters. The density of repeats (the number of repeats in a unit of intron length) correlated positively with the intron size. Thus, quite different mechanisms, the indel bias and the integration and/or persistence of retroposons, act in the same direction in regards to intron size, which suggests selection for the size of individual introns.
Collapse
Affiliation(s)
- Alexander E Vinogradov
- Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Ave. 4, St Petersburg 194064, Russia.
| |
Collapse
|