1
|
Omole AD, Czuppon P. Maintenance of long-term transposable element activity through regulation by nonautonomous elements. Genetics 2025:iyae209. [PMID: 39810601 DOI: 10.1093/genetics/iyae209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Accepted: 12/10/2024] [Indexed: 01/16/2025] Open
Abstract
Transposable elements are DNA sequences that can move and replicate within genomes. Broadly, there are 2 types: autonomous elements, which encode the necessary enzymes for transposition, and nonautonomous elements, which rely on the enzymes produced by autonomous elements for their transposition. Nonautonomous elements have been proposed to regulate the numbers of transposable elements, which is a possible explanation for the persistence of transposition activity over long evolutionary times. However, previous modeling studies indicate that interactions between autonomous and nonautonomous elements usually result in the extinction of one type. Here, we study a stochastic model that allows for the stable coexistence of autonomous and nonautonomous retrotransposons. We determine the conditions for this coexistence and derive an analytical expression for the stationary distribution of their copy numbers, showing that nonautonomous elements regulate stochastic fluctuations and the number of autonomous elements in stationarity. We find that the stationary variances of each element can be expressed as a function of the average copy numbers and their covariance, enabling data comparison and model validation. These results suggest that continued transposition activity of transposable elements, regulated by nonautonomous elements, is a possible evolutionary outcome that could for example explain the long coevolutionary history of autonomous LINE1 and nonautonomous Alu element transposition in the human ancestry.
Collapse
Affiliation(s)
- Adekanmi Daniel Omole
- Institute for Evolution and Biodiversity, University of Münster, Münster 48149, Germany
| | - Peter Czuppon
- Institute for Evolution and Biodiversity, University of Münster, Münster 48149, Germany
| |
Collapse
|
2
|
Moldovan JB, Kopera HC, Liu Y, Garcia-Canadas M, Catalina P, Leone P, Sanchez L, Kitzman J, Kidd J, Garcia-Perez J, Moran J. Variable patterns of retrotransposition in different HeLa strains provide mechanistic insights into SINE RNA mobilization processes. Nucleic Acids Res 2024; 52:7761-7779. [PMID: 38850156 PMCID: PMC11260458 DOI: 10.1093/nar/gkae448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 05/08/2024] [Accepted: 05/14/2024] [Indexed: 06/10/2024] Open
Abstract
Alu elements are non-autonomous Short INterspersed Elements (SINEs) derived from the 7SL RNA gene that are present at over one million copies in human genomic DNA. Alu mobilizes by a mechanism known as retrotransposition, which requires the Long INterspersed Element-1 (LINE-1) ORF2-encoded protein (ORF2p). Here, we demonstrate that HeLa strains differ in their capacity to support Alu retrotransposition. Human Alu elements retrotranspose efficiently in HeLa-HA and HeLa-CCL2 (Alu-permissive) strains, but not in HeLa-JVM or HeLa-H1 (Alu-nonpermissive) strains. A similar pattern of retrotransposition was observed for other 7SL RNA-derived SINEs and tRNA-derived SINEs. In contrast, mammalian LINE-1s, a zebrafish LINE, a human SINE-VNTR-Alu (SVA) element, and an L1 ORF1-containing mRNA can retrotranspose in all four HeLa strains. Using an in vitro reverse transcriptase-based assay, we show that Alu RNAs associate with ORF2p and are converted into cDNAs in both Alu-permissive and Alu-nonpermissive HeLa strains, suggesting that 7SL- and tRNA-derived SINEs use strategies to 'hijack' L1 ORF2p that are distinct from those used by SVA elements and ORF1-containing mRNAs. These data further suggest ORF2p associates with the Alu RNA poly(A) tract in both Alu-permissive and Alu-nonpermissive HeLa strains, but that Alu retrotransposition is blocked after this critical step in Alu-nonpermissive HeLa strains.
Collapse
Affiliation(s)
- John B Moldovan
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Huira C Kopera
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Ying Liu
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Marta Garcia-Canadas
- Department of Genomic Medicine, GENYO, Centre for Genomics and Oncological Research, Pfizer-University of Granada-Andalusian Regional Government, PTS Granada 18016, Spain
| | | | - Paola E Leone
- Genetics and Genomics Laboratory, SOLCA Hospital, Quito, Ecuador
| | - Laura Sanchez
- Department of Genomic Medicine, GENYO, Centre for Genomics and Oncological Research, Pfizer-University of Granada-Andalusian Regional Government, PTS Granada 18016, Spain
| | - Jacob O Kitzman
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jeffrey M Kidd
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jose Luis Garcia-Perez
- Department of Genomic Medicine, GENYO, Centre for Genomics and Oncological Research, Pfizer-University of Granada-Andalusian Regional Government, PTS Granada 18016, Spain
| | - John V Moran
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
3
|
Ustyantsev IG, Kosushkin SA, Borodulina OR, Vassetzky NS, Kramerov DA. Ere, a Family of Short Interspersed Elements in the Genomes of Odd-Toed Ungulates (Perissodactyla). Animals (Basel) 2024; 14:1982. [PMID: 38998094 PMCID: PMC11240701 DOI: 10.3390/ani14131982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 07/01/2024] [Accepted: 07/03/2024] [Indexed: 07/14/2024] Open
Abstract
Short Interspersed Elements (SINEs) are eukaryotic retrotransposons transcribed by RNA polymerase III (pol III). Many mammalian SINEs (T+ SINEs) contain a polyadenylation signal (AATAAA), a pol III transcription terminator, and an A-rich tail in their 3'-end. The RNAs of such SINEs have the capacity for AAUAAA-dependent polyadenylation, which is unique to pol III-generated transcripts. The structure, evolution, and polyadenylation of the Ere SINE of ungulates (horses, rhinos, and tapirs) were investigated in this study. A bioinformatics analysis revealed the presence of up to ~4 × 105 Ere copies in representatives of all three families. These copies can be classified into two large subfamilies, EreA and EreB, the former distinguished by an additional 60 bp sequence. The 3'-end of numerous EreA and all EreB copies exhibit a 50 bp sequence designated as a terminal domain (TD). The Ere family can be further subdivided into subfamilies EreA_0TD, EreA_1TD, EreB_1TD, and EreB_2TD, depending on the presence and number of terminal domains (TDs). Only EreA_0TD copies can be assigned to T+ SINEs as they contain the AATAAA signal and the TCTTT transcription terminator. The analysis of young Ere copies identified by comparison with related perissodactyl genomes revealed that EreA_0TD and, to a much lesser extent, EreB_2TD have retained retrotranspositional activity in the recent evolution of equids and rhinoceroses. The targeted mutagenesis and transfection of HeLa cells were used to identify sequences in equine EreA_0TD that are critical for the polyadenylation of its pol III transcripts. In addition to AATAAA and the transcription terminator, two sites in the 3' half of EreA, termed the β and τ signals, were found to be essential for this process. The evolution of Ere, with a particular focus on the emergence of T+ SINEs, as well as the polyadenylation signals are discussed in comparison with other T+ SINEs.
Collapse
Affiliation(s)
- Ilia G. Ustyantsev
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Sergey A. Kosushkin
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Olga R. Borodulina
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Nikita S. Vassetzky
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Dmitri A. Kramerov
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| |
Collapse
|
4
|
Moldovan JB, Kopera HC, Liu Y, Garcia-Canadas M, Catalina P, Leone PE, Sanchez L, Kitzman JO, Kidd JM, Garcia-Perez JL, Moran JV. Variable patterns of retrotransposition in different HeLa strains provide mechanistic insights into SINE RNA mobilization processes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.03.592410. [PMID: 38746229 PMCID: PMC11092746 DOI: 10.1101/2024.05.03.592410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Alu elements are non-autonomous Short INterspersed Elements (SINEs) derived from the 7SL RNA gene that are present at over one million copies in human genomic DNA. Alu mobilizes by a mechanism known as retrotransposition, which requires the Long INterspersed Element-1 (LINE-1 or L1) ORF2 -encoded protein (ORF2p). Here, we demonstrate that HeLa strains differ in their capacity to support Alu retrotransposition. Human Alu elements retrotranspose efficiently in HeLa-HA and HeLa-CCL2 ( Alu -permissive) strains, but not in HeLa-JVM or HeLa-H1 ( Alu -nonpermissive) strains. A similar pattern of retrotransposition was observed for other 7SL RNA -derived SINEs and tRNA -derived SINEs. In contrast, mammalian LINE-1s, a zebrafish LINE, a human SINE-VNTR - Alu ( SVA ) element, and an L1 ORF1 -containing messenger RNA can retrotranspose in all four HeLa strains. Using an in vitro reverse transcriptase-based assay, we show that Alu RNAs associate with ORF2p and are converted into cDNAs in both Alu -permissive and Alu -nonpermissive HeLa strains, suggesting that 7SL - and tRNA -derived SINE RNAs use strategies to 'hijack' L1 ORF2p that are distinct from those used by SVA elements and ORF1 -containing mRNAs. These data further suggest ORF2p associates with the Alu RNA poly(A) tract in both Alu -permissive and Alu -nonpermissive HeLa strains, but that Alu retrotransposition is blocked after this critical step in Alu -nonpermissive HeLa strains.
Collapse
|
5
|
Lawson HA, Liang Y, Wang T. Transposable elements in mammalian chromatin organization. Nat Rev Genet 2023; 24:712-723. [PMID: 37286742 DOI: 10.1038/s41576-023-00609-6] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/24/2023] [Indexed: 06/09/2023]
Abstract
Transposable elements (TEs) are mobile DNA elements that comprise almost 50% of mammalian genomic sequence. TEs are capable of making additional copies of themselves that integrate into new positions in host genomes. This unique property has had an important impact on mammalian genome evolution and on the regulation of gene expression because TE-derived sequences can function as cis-regulatory elements such as enhancers, promoters and silencers. Now, advances in our ability to identify and characterize TEs have revealed that TE-derived sequences also regulate gene expression by both maintaining and shaping 3D genome architecture. Studies are revealing how TEs contribute raw sequence that can give rise to the structures that shape chromatin organization, and thus gene expression, allowing for species-specific genome innovation and evolutionary novelty.
Collapse
Affiliation(s)
- Heather A Lawson
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO, USA.
| | - Yonghao Liang
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO, USA
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO, USA
| | - Ting Wang
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO, USA.
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO, USA.
- McDonnell Genome Institute, Washington University School of Medicine, Saint Louis, MO, USA.
| |
Collapse
|
6
|
Cao J, Yu T, Xu B, Hu Z, Zhang XO, Theurkauf W, Weng Z. Epigenetic and chromosomal features drive transposon insertion in Drosophila melanogaster. Nucleic Acids Res 2023; 51:2066-2086. [PMID: 36762470 PMCID: PMC10018349 DOI: 10.1093/nar/gkad054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 01/12/2023] [Accepted: 02/07/2023] [Indexed: 02/11/2023] Open
Abstract
Transposons are mobile genetic elements prevalent in the genomes of most species. The distribution of transposons within a genome reflects the actions of two opposing processes: initial insertion site selection, and selective pressure from the host. By analyzing whole-genome sequencing data from transposon-activated Drosophila melanogaster, we identified 43 316 de novo and 237 germline insertions from four long-terminal-repeat (LTR) transposons, one LINE transposon (I-element), and one DNA transposon (P-element). We found that all transposon types favored insertion into promoters de novo, but otherwise displayed distinct insertion patterns. De novo and germline P-element insertions preferred replication origins, often landing in a narrow region around transcription start sites and in regions of high chromatin accessibility. De novo LTR transposon insertions preferred regions with high H3K36me3, promoters and exons of active genes; within genes, LTR insertion frequency correlated with gene expression. De novo I-element insertion density increased with distance from the centromere. Germline I-element and LTR transposon insertions were depleted in promoters and exons, suggesting strong selective pressure to remove transposons from functional elements. Transposon movement is associated with genome evolution and disease; therefore, our results can improve our understanding of genome and disease biology.
Collapse
Affiliation(s)
- Jichuan Cao
- The School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Tianxiong Yu
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Bo Xu
- The School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Zhongren Hu
- The School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Xiao-ou Zhang
- The School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - William E Theurkauf
- Program in Molecular Medicine, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| |
Collapse
|
7
|
Kosushkin SA, Ustyantsev IG, Borodulina OR, Vassetzky NS, Kramerov DA. Tail Wags Dog’s SINE: Retropositional Mechanisms of Can SINE Depend on Its A-Tail Structure. BIOLOGY 2022; 11:biology11101403. [PMID: 36290307 PMCID: PMC9599045 DOI: 10.3390/biology11101403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 09/17/2022] [Accepted: 09/22/2022] [Indexed: 11/25/2022]
Abstract
Simple Summary The genomes of higher organisms including humans are invaded by millions of repetitive elements (transposons), which can sometimes be deleterious or beneficial for hosts. Many aspects of the mechanisms underlying the expansion of transposons in the genomes remain unclear. Short retrotransposons (SINEs) are one of the most abundant classes of genomic repeats. Their amplification relies on two major processes: transcription and reverse transcription. Here, short retrotransposons of dogs and other canids called Can SINE were analyzed. Their amplification was extraordinarily active in the wolf and, particularly, dog breeds relative to other canids. We also studied a variation of their transcription mechanism involving the polyadenylation of transcripts. An analysis of specific signals involved in this process allowed us to conclude that Can SINEs could alternate amplification with and without polyadenylation in their evolution. Understanding the mechanisms of transposon replication can shed light on the mechanisms of genome function. Abstract SINEs, non-autonomous short retrotransposons, are widespread in mammalian genomes. Their transcripts are generated by RNA polymerase III (pol III). Transcripts of certain SINEs can be polyadenylated, which requires polyadenylation and pol III termination signals in their sequences. Our sequence analysis divided Can SINEs in canids into four subfamilies, older a1 and a2 and younger b1 and b2. Can_b2 and to a lesser extent Can_b1 remained retrotranspositionally active, while the amplification of Can_a1 and Can_a2 ceased long ago. An extraordinarily high Can amplification was revealed in different dog breeds. Functional polyadenylation signals were analyzed in Can subfamilies, particularly in fractions of recently amplified, i.e., active copies. The transcription of various Can constructs transfected into HeLa cells proposed AATAAA and (TC)n as functional polyadenylation signals. Our analysis indicates that older Can subfamilies (a1, a2, and b1) with an active transcription terminator were amplified by the T+ mechanism (with polyadenylation of pol III transcripts). In the currently active Can_b2 subfamily, the amplification mechanisms with (T+) and without the polyadenylation of pol III transcripts (T−) irregularly alternate. The active transcription terminator tends to shorten, which renders it nonfunctional and favors a switch to the T− retrotransposition. The activity of a truncated terminator is occasionally restored by its elongation, which rehabilitates the T+ retrotransposition for a particular SINE copy.
Collapse
|
8
|
Analysis of SINE Families B2, Dip, and Ves with Special Reference to Polyadenylation Signals and Transcription Terminators. Int J Mol Sci 2021; 22:ijms22189897. [PMID: 34576060 PMCID: PMC8466645 DOI: 10.3390/ijms22189897] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/05/2021] [Accepted: 09/06/2021] [Indexed: 01/09/2023] Open
Abstract
Short Interspersed Elements (SINEs) are eukaryotic non-autonomous retrotransposons transcribed by RNA polymerase III (pol III). The 3′-terminus of many mammalian SINEs has a polyadenylation signal (AATAAA), pol III transcription terminator, and A-rich tail. The RNAs of such SINEs can be polyadenylated, which is unique for pol III transcripts. Here, B2 (mice and related rodents), Dip (jerboas), and Ves (vespertilionid bats) SINE families were thoroughly studied. They were divided into subfamilies reliably distinguished by relatively long indels. The age of SINE subfamilies can be estimated, which allows us to reconstruct their evolution. The youngest and most active variants of SINE subfamilies were given special attention. The shortest pol III transcription terminators are TCTTT (B2), TATTT (Ves and Dip), and the rarer TTTT. The last nucleotide of the terminator is often not transcribed; accordingly, the truncated terminator of its descendant becomes nonfunctional. The incidence of complete transcription of the TCTTT terminator is twice higher compared to TTTT and thus functional terminators are more likely preserved in daughter SINE copies. Young copies have long poly(A) tails; however, they gradually shorten in host generations. Unexpectedly, the tail shortening below A10 increases the incidence of terminator elongation by Ts thus restoring its efficiency. This process can be critical for the maintenance of SINE activity in the genome.
Collapse
|
9
|
Chen D, Cremona MA, Qi Z, Mitra RD, Chiaromonte F, Makova KD. Human L1 Transposition Dynamics Unraveled with Functional Data Analysis. Mol Biol Evol 2021; 37:3576-3600. [PMID: 32722770 DOI: 10.1093/molbev/msaa194] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Long INterspersed Elements-1 (L1s) constitute >17% of the human genome and still actively transpose in it. Characterizing L1 transposition across the genome is critical for understanding genome evolution and somatic mutations. However, to date, L1 insertion and fixation patterns have not been studied comprehensively. To fill this gap, we investigated three genome-wide data sets of L1s that integrated at different evolutionary times: 17,037 de novo L1s (from an L1 insertion cell-line experiment conducted in-house), and 1,212 polymorphic and 1,205 human-specific L1s (from public databases). We characterized 49 genomic features-proxying chromatin accessibility, transcriptional activity, replication, recombination, etc.-in the ±50 kb flanks of these elements. These features were contrasted between the three L1 data sets and L1-free regions using state-of-the-art Functional Data Analysis statistical methods, which treat high-resolution data as mathematical functions. Our results indicate that de novo, polymorphic, and human-specific L1s are surrounded by different genomic features acting at specific locations and scales. This led to an integrative model of L1 transposition, according to which L1s preferentially integrate into open-chromatin regions enriched in non-B DNA motifs, whereas they are fixed in regions largely free of purifying selection-depleted of genes and noncoding most conserved elements. Intriguingly, our results suggest that L1 insertions modify local genomic landscape by extending CpG methylation and increasing mononucleotide microsatellite density. Altogether, our findings substantially facilitate understanding of L1 integration and fixation preferences, pave the way for uncovering their role in aging and cancer, and inform their use as mutagenesis tools in genetic studies.
Collapse
Affiliation(s)
- Di Chen
- Intercollege Graduate Degree Program in Genetics, The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA
| | - Marzia A Cremona
- Department of Statistics, The Pennsylvania State University, University Park, PA.,Department of Operations and Decision Systems, Université Laval, Québec, Canada
| | - Zongtai Qi
- Department of Genetics and Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO
| | - Robi D Mitra
- Department of Genetics and Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO
| | - Francesca Chiaromonte
- Department of Statistics, The Pennsylvania State University, University Park, PA.,EMbeDS, Sant'Anna School of Advanced Studies, Pisa, Italy.,The Huck Institutes of the Life Sciences, Center for Medical Genomics, The Pennsylvania State University, University Park, PA
| | - Kateryna D Makova
- The Huck Institutes of the Life Sciences, Center for Medical Genomics, The Pennsylvania State University, University Park, PA.,Department of Biology, The Pennsylvania State University, University Park, PA
| |
Collapse
|
10
|
Ustyantsev IG, Borodulina OR, Kramerov DA. Identification of nucleotide sequences and some proteins involved in polyadenylation of RNA transcribed by Pol III from SINEs. RNA Biol 2020; 18:1475-1488. [PMID: 33258402 DOI: 10.1080/15476286.2020.1857942] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
We have previously reported that not only transcripts of RNA polymerase II (pol II), but also one type of RNA transcribed by RNA polymerase III (pol III), undergo AAUAAA-dependent polyadenylation. Such an unusual feature is inherent in Short Interspersed Elements (SINEs) from genomes of certain mammals. For polyadenylation of its transcript, SINE should contain, besides an AATAAA hexamer and a transcription terminator, two specific regions: β, located downstream of box B of a promoter, and τ, preceding AATAAA. Here, using nucleotide substitutions in SINEs B2 (mouse) and Ves (bat), we identified nucleotides of β regions necessary for polyadenylation of their transcripts. These sequences (β signals) are the following: ACCACATgg in B2 and GGGCATGT in Ves. Using this approach, we identified τ signal of SINE B2 (GCTACagTGTACTTACAT), where TGTA tetramer is most important for polyadenylation. In Ves, τ region is a long polypyrimidine motif which is able to interact with PTB protein in Ves transcripts. We demonstrated by knockdown that B2 and Ves transcript polyadenylation is performed by canonical poly(A) polymerase with the participation of proteins CSPF-160 and Fip1, the known factors of mRNA polyadenylation. We also showed that a factor CFIm partaking in polyadenylation of many mRNAs, is involved only in polyadenylation of B2 transcripts. CFIm seems to interact with τ signal of В2 RNA and thereby facilitates the recruiting of other proteins engaged in polyadenylation. Thus, SINEs utilize at least some proteins involved in polyadenylation of pol II transcripts to polyadenylate their pol III transcripts.
Collapse
Affiliation(s)
- Ilia G Ustyantsev
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Olga R Borodulina
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Dmitri A Kramerov
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
11
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
12
|
Lu JY, Shao W, Chang L, Yin Y, Li T, Zhang H, Hong Y, Percharde M, Guo L, Wu Z, Liu L, Liu W, Yan P, Ramalho-Santos M, Sun Y, Shen X. Genomic Repeats Categorize Genes with Distinct Functions for Orchestrated Regulation. Cell Rep 2020; 30:3296-3311.e5. [PMID: 32160538 PMCID: PMC7195444 DOI: 10.1016/j.celrep.2020.02.048] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 11/11/2019] [Accepted: 02/10/2020] [Indexed: 11/06/2022] Open
Abstract
Repetitive elements are abundantly distributed in mammalian genomes. Here, we reveal a striking association between repeat subtypes and gene function. SINE, L1, and low-complexity repeats demarcate distinct functional categories of genes and may dictate the time and level of gene expression by providing binding sites for different regulatory proteins. Importantly, imaging and sequencing analysis show that L1 repeats sequester a large set of genes with specialized functions in nucleolus- and lamina-associated inactive domains that are depleted of SINE repeats. In addition, L1 transcripts bind extensively to its DNA in embryonic stem cells (ESCs). Depletion of L1 RNA in ESCs leads to relocation of L1-enriched chromosomal segments from inactive domains to the nuclear interior and de-repression of L1-associated genes. These results demonstrate a role of L1 DNA and RNA in gene silencing and suggest a general theme of genomic repeats in orchestrating the function, regulation, and expression of their host genes.
Collapse
Affiliation(s)
- J Yuyang Lu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Wen Shao
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lei Chang
- State Key Laboratory of Membrane Biology, School of Life Sciences, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing 100871, China
| | - Yafei Yin
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Tong Li
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Hui Zhang
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Yantao Hong
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Michelle Percharde
- MRC London Institute of Medical Sciences (LMS), London W120NN, UK; Institute of Clinical Sciences (ICS), Faculty of Medicine, Imperial College London, London W120NN, UK
| | - Lerui Guo
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Zhongyang Wu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lichao Liu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Wei Liu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Pixi Yan
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Miguel Ramalho-Santos
- Lunenfeld-Tanenbaum Research Institute and Department of Molecular Genetics, University of Toronto, Toronto, ON M5T 3H7, Canada
| | - Yujie Sun
- State Key Laboratory of Membrane Biology, School of Life Sciences, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing 100871, China
| | - Xiaohua Shen
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
13
|
Gagnier L, Belancio VP, Mager DL. Mouse germ line mutations due to retrotransposon insertions. Mob DNA 2019; 10:15. [PMID: 31011371 PMCID: PMC6466679 DOI: 10.1186/s13100-019-0157-4] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 04/01/2019] [Indexed: 12/24/2022] Open
Abstract
Transposable element (TE) insertions are responsible for a significant fraction of spontaneous germ line mutations reported in inbred mouse strains. This major contribution of TEs to the mutational landscape in mouse contrasts with the situation in human, where their relative contribution as germ line insertional mutagens is much lower. In this focussed review, we provide comprehensive lists of TE-induced mouse mutations, discuss the different TE types involved in these insertional mutations and elaborate on particularly interesting cases. We also discuss differences and similarities between the mutational role of TEs in mice and humans.
Collapse
Affiliation(s)
- Liane Gagnier
- Terry Fox Laboratory, BC Cancer and Department of Medical Genetics, University of British Columbia, V5Z1L3, Vancouver, BC Canada
| | - Victoria P. Belancio
- Department of Structural and Cellular Biology, Tulane University School of Medicine, Tulane Cancer Center, Tulane Center for Aging, New Orleans, LA 70112 USA
| | - Dixie L. Mager
- Terry Fox Laboratory, BC Cancer and Department of Medical Genetics, University of British Columbia, V5Z1L3, Vancouver, BC Canada
| |
Collapse
|
14
|
Sultana T, van Essen D, Siol O, Bailly-Bechet M, Philippe C, Zine El Aabidine A, Pioger L, Nigumann P, Saccani S, Andrau JC, Gilbert N, Cristofari G. The Landscape of L1 Retrotransposons in the Human Genome Is Shaped by Pre-insertion Sequence Biases and Post-insertion Selection. Mol Cell 2019; 74:555-570.e7. [PMID: 30956044 DOI: 10.1016/j.molcel.2019.02.036] [Citation(s) in RCA: 88] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Revised: 01/28/2019] [Accepted: 02/25/2019] [Indexed: 01/10/2023]
Abstract
L1 retrotransposons are transposable elements and major contributors of genetic variation in humans. Where L1 integrates into the genome can directly impact human evolution and disease. Here, we experimentally induced L1 retrotransposition in cells and mapped integration sites at nucleotide resolution. At local scales, L1 integration is mostly restricted by genome sequence biases and the specificity of the L1 machinery. At regional scales, L1 shows a broad capacity for integration into all chromatin states, in contrast to other known mobile genetic elements. However, integration is influenced by the replication timing of target regions, suggesting a link to host DNA replication. The distribution of new L1 integrations differs from those of preexisting L1 copies, which are significantly reshaped by natural selection. Our findings reveal that the L1 machinery has evolved to efficiently target all genomic regions and underline a predominant role for post-integrative processes on the distribution of endogenous L1 elements.
Collapse
Affiliation(s)
- Tania Sultana
- Université Côte d'Azur, Inserm, CNRS, IRCAN, Nice, France
| | | | - Oliver Siol
- Institut de Génétique Humaine, University of Montpellier, CNRS, Montpellier, France
| | | | | | - Amal Zine El Aabidine
- Institut de Génétique Moléculaire de Montpellier, University of Montpellier, CNRS, Montpellier, France
| | - Léo Pioger
- Institut de Génétique Moléculaire de Montpellier, University of Montpellier, CNRS, Montpellier, France
| | - Pilvi Nigumann
- Université Côte d'Azur, Inserm, CNRS, IRCAN, Nice, France
| | - Simona Saccani
- Université Côte d'Azur, Inserm, CNRS, IRCAN, Nice, France
| | - Jean-Christophe Andrau
- Institut de Génétique Moléculaire de Montpellier, University of Montpellier, CNRS, Montpellier, France
| | - Nicolas Gilbert
- Institut de Génétique Humaine, University of Montpellier, CNRS, Montpellier, France; Institut de Médecine Régénératrice et de Biothérapie, Inserm U1183, CHU Montpellier, Montpellier, France
| | | |
Collapse
|
15
|
Kent TV, Uzunović J, Wright SI. Coevolution between transposable elements and recombination. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0458. [PMID: 29109221 DOI: 10.1098/rstb.2016.0458] [Citation(s) in RCA: 169] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/18/2017] [Indexed: 12/24/2022] Open
Abstract
One of the most striking patterns of genome structure is the tight, typically negative, association between transposable elements (TEs) and meiotic recombination rates. While this is a highly recurring feature of eukaryotic genomes, the mechanisms driving correlations between TEs and recombination remain poorly understood, and distinguishing cause versus effect is challenging. Here, we review the evidence for a relation between TEs and recombination, and discuss the underlying evolutionary forces. Evidence to date suggests that overall TE densities correlate negatively with recombination, but the strength of this correlation varies across element types, and the pattern can be reversed. Results suggest that heterogeneity in the strength of selection against ectopic recombination and gene disruption can drive TE accumulation in regions of low recombination, but there is also strong evidence that the regulation of TEs can influence local recombination rates. We hypothesize that TE insertion polymorphism may be important in driving within-species variation in recombination rates in surrounding genomic regions. Furthermore, the interaction between TEs and recombination may create positive feedback, whereby TE accumulation in non-recombining regions contributes to the spread of recombination suppression. Further investigation of the coevolution between recombination and TEs has important implications for our understanding of the evolution of recombination rates and genome structure.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Tyler V Kent
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St, Toronto, Ontario, Canada M5S3B2
| | - Jasmina Uzunović
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St, Toronto, Ontario, Canada M5S3B2
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St, Toronto, Ontario, Canada M5S3B2
| |
Collapse
|
16
|
Ade CM, Derbes RS, Wagstaff BJ, Linker SB, White TB, Deharo D, Belancio VP, Ivics Z, Roy-Engel AM. Evaluating different DNA binding domains to modulate L1 ORF2p-driven site-specific retrotransposition events in human cells. Gene 2017; 642:188-198. [PMID: 29154869 DOI: 10.1016/j.gene.2017.11.033] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 11/11/2017] [Indexed: 12/28/2022]
Abstract
DNA binding domains (DBDs) have been used with great success to impart targeting capabilities to a variety of proteins creating highly useful genomic tools. We evaluated the ability of five types of DBDs and strategies (AAV Rep proteins, Cre, TAL effectors, zinc finger proteins, and Cas9/gRNA system) to target the L1 ORF2 protein to drive retrotransposition of Alu inserts to specific sequences in the human genome. First, we find that the L1 ORF2 protein tolerates the addition of protein domains both at the amino- and carboxy-terminus. Although in some instances retrotransposition efficiencies slightly diminished, all fusion proteins containing an intact ORF2 were capable of driving retrotransposition. Second, the stability of individual ORF2 fusion proteins varies and difficult to predict. Third, DBDs that require the formation of multimers for target recognition are unlikely to modify targeting of ORF2p-driven insertions. Fourth, the more components needed to assemble into a complex to drive targeted retrotransposition, the less likely the strategy will increase targeted insertions. Fifth, abundance of target sequences present in the genome will likely dictate the effectiveness and efficiency of targeted insertions. Lastly, the cleavage capabilities of Cas9 (or a Cas9 nickase variant) are unable to substitute for the L1 ORF2 endonuclease domain functions, suggestive that the endonuclease domain has alternate functions needed for retrotransposition. From these studies, we conclude that the most critical component for the modification of the human L1 ORF2 protein to drive targeted insertions is the selection of the DBD due to the varying functional requirements and impacts on protein stability.
Collapse
Affiliation(s)
- Catherine M Ade
- Department of Cellular and Molecular Biology, Tulane University, USA
| | - Rebecca S Derbes
- Tulane Cancer Center SL-66, Dept. of Epidemiology, Tulane University Health Sciences Center and LCRC, 1700 Tulane Ave., New Orleans, LA 70112, USA
| | - Bradley J Wagstaff
- Tulane Cancer Center SL-66, Dept. of Epidemiology, Tulane University Health Sciences Center and LCRC, 1700 Tulane Ave., New Orleans, LA 70112, USA
| | - Sara B Linker
- Laboratory of Genetics, The Salk Institute for Biological Studies, 10010 N Torrey Pines Road, La Jolla, CA 92037-1002, USA
| | - Travis B White
- Sloan Kettering Institute for Cancer Research, New York, NY 10065, USA
| | - Dawn Deharo
- Department of Structural and Cellular Biology, Tulane University School of Medicine, Tulane Cancer Center, Tulane Center for Aging, New Orleans, LA 70112, USA
| | - Victoria P Belancio
- Department of Structural and Cellular Biology, Tulane University School of Medicine, Tulane Cancer Center, Tulane Center for Aging, New Orleans, LA 70112, USA
| | - Zoltán Ivics
- Division of Medical Biotechnology, Paul-Ehrlich-Institute, Langen, Germany
| | - Astrid M Roy-Engel
- Tulane Cancer Center SL-66, Dept. of Epidemiology, Tulane University Health Sciences Center and LCRC, 1700 Tulane Ave., New Orleans, LA 70112, USA.
| |
Collapse
|
17
|
Integration site selection by retroviruses and transposable elements in eukaryotes. Nat Rev Genet 2017; 18:292-308. [PMID: 28286338 DOI: 10.1038/nrg.2017.7] [Citation(s) in RCA: 153] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Transposable elements and retroviruses are found in most genomes, can be pathogenic and are widely used as gene-delivery and functional genomics tools. Exploring whether these genetic elements target specific genomic sites for integration and how this preference is achieved is crucial to our understanding of genome evolution, somatic genome plasticity in cancer and ageing, host-parasite interactions and genome engineering applications. High-throughput profiling of integration sites by next-generation sequencing, combined with large-scale genomic data mining and cellular or biochemical approaches, has revealed that the insertions are usually non-random. The DNA sequence, chromatin and nuclear context, and cellular proteins cooperate in guiding integration in eukaryotic genomes, leading to a remarkable diversity of insertion site distribution and evolutionary strategies.
Collapse
|
18
|
Evsikov AV, Marín de Evsikova C. Friend or Foe: Epigenetic Regulation of Retrotransposons in Mammalian Oogenesis and Early Development. THE YALE JOURNAL OF BIOLOGY AND MEDICINE 2016; 89:487-497. [PMID: 28018140 PMCID: PMC5168827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Epigenetics is the study of phenotypic variation arising from developmental and environmental factors regulating gene transcription at molecular, cellular, and physiological levels. A naturally occurring biological process driven by epigenetics is the egg-to-embryo developmental transition when two fully differentiated adult cells - egg and sperm - revert to an early stem cell type with totipotency but subsequently differentiates into pluripotent embryonic stem cells that give rise to any cell type. Transposable elements (TEs) are active in mammalian oocytes and early embryos, and this activity, albeit counterintuitive because TEs can lead to genomic instability in somatic cells, correlates to successful development. TEs bridge genetic and epigenetic landscapes because TEs are genetic elements whose silencing and de-repression are regulated by epigenetic mechanisms that are sensitive to environmental factors. Ultimately, transposition events can change size, content, and function of mammalian genomes. Thus, TEs act beyond mutagenic agents reshuffling the genomes, and epigenetic regulation of TEs may act as a proximate mechanism by which evolutionary forces increase a species' hidden reserve of epigenetic and phenotypic variability facilitating the adaptation of genomes to their environment.
Collapse
Affiliation(s)
- Alexei V. Evsikov
- To whom all correspondence should be addressed: Caralina Marín de Evsikova, Alexei V. Evsikov, Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, 12901 Bruce B Downs Blvd., MDC07, Tampa, FL 33612, CMdE: ; (813) 974 2248; AVE: ; (813) 974 6922, Fax: 813-974-7357
| | - Caralina Marín de Evsikova
- To whom all correspondence should be addressed: Caralina Marín de Evsikova, Alexei V. Evsikov, Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, 12901 Bruce B Downs Blvd., MDC07, Tampa, FL 33612, CMdE: ; (813) 974 2248; AVE: ; (813) 974 6922, Fax: 813-974-7357
| |
Collapse
|
19
|
Muñoz-Lopez M, Vilar-Astasio R, Tristan-Ramos P, Lopez-Ruiz C, Garcia-Pérez JL. Study of Transposable Elements and Their Genomic Impact. Methods Mol Biol 2016; 1400:1-19. [PMID: 26895043 DOI: 10.1007/978-1-4939-3372-3_1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Transposable elements (TEs) have been considered traditionally as junk DNA, i.e., DNA sequences that despite representing a high proportion of genomes had no evident cellular functions. However, over the last decades, it has become undeniable that not only TE-derived DNA sequences have (and had) a fundamental role during genome evolution, but also TEs have important implications in the origin and evolution of many genomic disorders. This concise review provides a brief overview of the different types of TEs that can be found in genomes, as well as a list of techniques and methods used to study their impact and mobilization. Some of these techniques will be covered in detail in this Method Book.
Collapse
Affiliation(s)
- Martin Muñoz-Lopez
- Department of Human DNA Variability, Pfizer/University of Granada and Andalusian Regional Government Center for Genomics and Oncology (GENYO), Avda Ilustracion 114, PTS Granada, 18016, Granada, Spain.
| | - Raquel Vilar-Astasio
- Department of Human DNA Variability, Pfizer/University of Granada and Andalusian Regional Government Center for Genomics and Oncology (GENYO), Avda Ilustracion 114, PTS Granada, 18016, Granada, Spain
| | - Pablo Tristan-Ramos
- Department of Human DNA Variability, Pfizer/University of Granada and Andalusian Regional Government Center for Genomics and Oncology (GENYO), Avda Ilustracion 114, PTS Granada, 18016, Granada, Spain
| | - Cesar Lopez-Ruiz
- Department of Human DNA Variability, Pfizer/University of Granada and Andalusian Regional Government Center for Genomics and Oncology (GENYO), Avda Ilustracion 114, PTS Granada, 18016, Granada, Spain
| | - Jose L Garcia-Pérez
- -Genyo (Center for Genomics and Oncological Research), Pfizer/Universidad de Granada/Junta de Andalucia. PTS Granada, Spain-Institute of Genetics and Molecular Medicine (IGMM), University of Edinburgh,, Edinburgh, UK
| |
Collapse
|
20
|
Conserved 3' UTR stem-loop structure in L1 and Alu transposons in human genome: possible role in retrotransposition. BMC Genomics 2016; 17:992. [PMID: 27914481 PMCID: PMC5135761 DOI: 10.1186/s12864-016-3344-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 11/25/2016] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND In the process of retrotransposition LINEs use their own machinery for copying and inserting themselves into new genomic locations, while SINEs are parasitic and require the machinery of LINEs. The exact mechanism of how a LINE-encoded reverse transcriptase (RT) recognizes its own and SINE RNA remains unclear. However it was shown for the stringent-type LINEs that recognition of a stem-loop at the 3'UTR by RT is essential for retrotransposition. For the relaxed-type LINEs it is believed that the poly-A tail is a common recognition element between LINE and SINE RNA. However polyadenylation is a property of any messenger RNA, and how the LINE RT recognizes transposon and non-transposon RNAs remains an open question. It is likely that RNA secondary structures play an important role in RNA recognition by LINE encoded proteins. RESULTS Here we selected a set of L1 and Alu elements from the human genome and investigated their sequences for the presence of position-specific stem-loop structures. We found highly conserved stem-loop positions at the 3'UTR. Comparative structural analyses of a human L1 3'UTR stem-loop showed a similarity to 3'UTR stem-loops of the stringent-type LINEs, which were experimentally shown to be recognized by LINE RT. The consensus stem-loop structure consists of 5-7 bp loop, 8-10 bp stem with a bulge at a distance of 4-6 bp from the loop. The results show that a stem loop with a bulge exists at the 3'-end of Alu. We also found conserved stem-loop positions at 5'UTR and at the end of ORF2 and discuss their possible role. CONCLUSIONS Here we presented an evidence for the presence of a highly conserved 3'UTR stem-loop structure in L1 and Alu retrotransposons in the human genome. Both stem-loops show structural similarity to the stem-loops of the stringent-type LINEs experimentally confirmed as essential for retrotransposition. Here we hypothesize that both L1 and Alu RNA are recognized by L1 RT via the 3'-end RNA stem-loop structure. Other conserved stem-loop positions in L1 suggest their possible functions in protein-RNA interactions but to date no experimental evidence has been reported.
Collapse
|
21
|
Campos-Sánchez R, Cremona MA, Pini A, Chiaromonte F, Makova KD. Integration and Fixation Preferences of Human and Mouse Endogenous Retroviruses Uncovered with Functional Data Analysis. PLoS Comput Biol 2016; 12:e1004956. [PMID: 27309962 PMCID: PMC4911145 DOI: 10.1371/journal.pcbi.1004956] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Accepted: 04/29/2016] [Indexed: 01/24/2023] Open
Abstract
Endogenous retroviruses (ERVs), the remnants of retroviral infections in the germ line, occupy ~8% and ~10% of the human and mouse genomes, respectively, and affect their structure, evolution, and function. Yet we still have a limited understanding of how the genomic landscape influences integration and fixation of ERVs. Here we conducted a genome-wide study of the most recently active ERVs in the human and mouse genome. We investigated 826 fixed and 1,065 in vitro HERV-Ks in human, and 1,624 fixed and 242 polymorphic ETns, as well as 3,964 fixed and 1,986 polymorphic IAPs, in mouse. We quantitated >40 human and mouse genomic features (e.g., non-B DNA structure, recombination rates, and histone modifications) in ±32 kb of these ERVs' integration sites and in control regions, and analyzed them using Functional Data Analysis (FDA) methodology. In one of the first applications of FDA in genomics, we identified genomic scales and locations at which these features display their influence, and how they work in concert, to provide signals essential for integration and fixation of ERVs. The investigation of ERVs of different evolutionary ages (young in vitro and polymorphic ERVs, older fixed ERVs) allowed us to disentangle integration vs. fixation preferences. As a result of these analyses, we built a comprehensive model explaining the uneven distribution of ERVs along the genome. We found that ERVs integrate in late-replicating AT-rich regions with abundant microsatellites, mirror repeats, and repressive histone marks. Regions favoring fixation are depleted of genes and evolutionarily conserved elements, and have low recombination rates, reflecting the effects of purifying selection and ectopic recombination removing ERVs from the genome. In addition to providing these biological insights, our study demonstrates the power of exploiting multiple scales and localization with FDA. These powerful techniques are expected to be applicable to many other genomic investigations.
Collapse
Affiliation(s)
- Rebeca Campos-Sánchez
- Genetics Graduate Program, The Huck Institutes of the Life Sciences, Penn State University, University Park, Pennsylvania, United States of America
| | - Marzia A. Cremona
- MOX—Modeling and Scientific Computing, Department of Mathematics, Politecnico di Milano, Milano, Italy
- Department of Statistics, Penn State University, University Park, Pennsylvania, United States of America
| | - Alessia Pini
- MOX—Modeling and Scientific Computing, Department of Mathematics, Politecnico di Milano, Milano, Italy
| | - Francesca Chiaromonte
- Department of Statistics, Penn State University, University Park, Pennsylvania, United States of America
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, Pennsylvania, United States of America
| | - Kateryna D. Makova
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, Pennsylvania, United States of America
- Department of Biology, Penn State University, University Park, Pennsylvania, United States of America
| |
Collapse
|
22
|
Ade C, Roy-Engel AM. SINE Retrotransposition: Evaluation of Alu Activity and Recovery of De Novo Inserts. Methods Mol Biol 2016; 1400:183-201. [PMID: 26895055 DOI: 10.1007/978-1-4939-3372-3_13] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Mobile element activity is of great interest due to its impact on genomes. However, the types of mobile elements that inhabit any given genome are remarkably varied. Among the different varieties of mobile elements, the Short Interspersed Elements (SINEs) populate many genomes, including many mammalian species. Although SINEs are parasites of Long Interspersed Elements (LINEs), SINEs have been highly successful in both the primate and rodent genomes. When comparing copy numbers in mammals, SINEs have been vastly more successful than other nonautonomous elements, such as the retropseudogenes and SVA. Interestingly, in the human genome the copy number of Alu (a primate SINE) outnumbers LINE-1 (L1) copies 2 to 1. Estimates suggest that the retrotransposition rate for Alu is tenfold higher than LINE-1 with about 1 insert in every twenty births. Furthermore, Alu-induced mutagenesis is responsible for the majority of the documented instances of human retroelement insertion-induced disease. However, little is known on what contributes to these observed differences between SINEs and LINEs. The development of an assay to monitor SINE retrotransposition in culture has become an important tool for the elucidation of some of these differences. In this chapter, we present details of the SINE retrotransposition assay and the recovery of de novo inserts. We also focus on the nuances that are unique to the SINE assay.
Collapse
Affiliation(s)
- Catherine Ade
- Department of Epidemiology, Tulane Cancer Center, SL-66, Tulane University Health Sciences Center, 1430 Tulane Ave., New Orleans, LA, 70112, USA
| | - Astrid M Roy-Engel
- Department of Epidemiology, Tulane Cancer Center, SL-66, Tulane University Health Sciences Center, 1430 Tulane Ave., New Orleans, LA, 70112, USA.
| |
Collapse
|
23
|
Polyadenylation of RNA transcribed from mammalian SINEs by RNA polymerase III: Complex requirements for nucleotide sequences. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2015; 1859:355-65. [PMID: 26700565 DOI: 10.1016/j.bbagrm.2015.12.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Revised: 12/09/2015] [Accepted: 12/11/2015] [Indexed: 01/08/2023]
Abstract
It is generally accepted that only transcripts synthesized by RNA polymerase II (e.g., mRNA) were subject to AAUAAA-dependent polyadenylation. However, we previously showed that RNA transcribed by RNA polymerase III (pol III) from mouse B2 SINE could be polyadenylated in an AAUAAA-dependent manner. Many species of mammalian SINEs end with the pol III transcriptional terminator (TTTTT) and contain hexamers AATAAA in their A-rich tail. Such SINEs were united into Class T(+), whereas SINEs lacking the terminator and AATAAA sequences were classified as T(-). Here we studied the structural features of SINE pol III transcripts that are necessary for their polyadenylation. Eight and six SINE families from classes T(+) and T(-), respectively, were analyzed. The replacement of AATAAA with AACAAA in T(+) SINEs abolished the RNA polyadenylation. Interestingly, insertion of the polyadenylation signal (AATAAA) and pol III transcription terminator in T(-) SINEs did not result in polyadenylation. The detailed analysis of three T(+) SINEs (B2, DIP, and VES) revealed areas important for the polyadenylation of their pol III transcripts: the polyadenylation signal and terminator in A-rich tail, β region positioned immediately downstream of the box B of pol III promoter, and τ region located upstream of the tail. In DIP and VES (but not in B2), the τ region is a polypyrimidine motif which is also characteristic of many other T(+) SINEs. Most likely, SINEs of different mammals acquired these structural features independently as a result of parallel evolution.
Collapse
|
24
|
Konkel MK, Walker JA, Hotard AB, Ranck MC, Fontenot CC, Storer J, Stewart C, Marth GT, Batzer MA. Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project. Genome Biol Evol 2015; 7:2608-22. [PMID: 26319576 PMCID: PMC4607524 DOI: 10.1093/gbe/evv167] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2015] [Indexed: 12/17/2022] Open
Abstract
The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages.
Collapse
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Louisiana State University
| | | | - Ashley B Hotard
- Department of Biological Sciences, Louisiana State University
| | - Megan C Ranck
- Department of Biological Sciences, Louisiana State University
| | | | - Jessica Storer
- Department of Biological Sciences, Louisiana State University Department of Molecular, Cellular and Developmental Biology, The Ohio State University
| | - Chip Stewart
- Department of Biology, Boston College Cancer Genome Computational Analysis, Cambridge, MA
| | - Gabor T Marth
- Department of Biology, Boston College Eccles Institute of Human Genetics, University of Utah
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University
| |
Collapse
|
25
|
Makova KD, Hardison RC. The effects of chromatin organization on variation in mutation rates in the genome. Nat Rev Genet 2015; 16:213-23. [PMID: 25732611 PMCID: PMC4500049 DOI: 10.1038/nrg3890] [Citation(s) in RCA: 160] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The variation in local rates of mutations can affect both the evolution of genes and their function in normal and cancer cells. Deciphering the molecular determinants of this variation will be aided by the elucidation of distinct types of mutations, as they differ in regional preferences and in associations with genomic features. Chromatin organization contributes to regional variation in mutation rates, but its contribution differs among mutation types. In both germline and somatic mutations, base substitutions are more abundant in regions of closed chromatin, perhaps reflecting error accumulation late in replication. By contrast, a distinctive mutational state with very high levels of insertions and deletions (indels) and substitutions is enriched in regions of open chromatin. These associations indicate an intricate interplay between the nucleotide sequence of DNA and its dynamic packaging into chromatin, and have important implications for current biomedical research. This Review focuses on recent studies showing associations between chromatin state and mutation rates, including pairwise and multivariate investigations of germline and somatic (particularly cancer) mutations.
Collapse
Affiliation(s)
- Kateryna D Makova
- Department of Biology, Huck Institute for Genome Sciences, The Pennsylvania State University, University Park, State College, Pennsylvania 16802, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, Huck Institute for Genome Sciences, The Pennsylvania State University, University Park, State College, Pennsylvania 16802, USA
| |
Collapse
|
26
|
Lee J, Kim YJ, Mun S, Kim HS, Han K. Identification of human-specific AluS elements through comparative genomics. Gene 2014; 555:208-16. [PMID: 25447892 DOI: 10.1016/j.gene.2014.11.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Revised: 11/03/2014] [Accepted: 11/05/2014] [Indexed: 01/08/2023]
Abstract
Mobile elements are responsible for ~45% of the human genome. Among them is the Alu element, accounting for 10% of the human genome (>1.1million copies). Several studies of Alu elements have reported that they are frequently involved in human genetic diseases and genomic rearrangements. In this study, we investigated the AluS subfamily, which is a relatively old Alu subfamily and has the highest copy number in primate genomes. Previously, a set of 263 human-specific AluS insertions was identified in the human genome. To validate these, we compared each of the human-specific AluS loci with its pre-insertion site in other primate genomes, including chimpanzee, gorilla, and orangutan. We obtained 24 putative human-specific AluS candidates via the in silico analysis and manual inspection, and then tried to verify them using PCR amplification and DNA sequencing. Through the PCR product sequencing, we were able to detect two instances of near-parallel Alu insertions in nearby sites that led to computational false negatives. Finally, we computationally and experimentally verified 23 human-specific AluS elements. We reported three alternative Alu insertion events, which are accompanied by filler DNA and/or Alu retrotransposition mediated-deletion. Bisulfite sequencing was carried out to examine DNA methylation levels of human-specific AluS elements. The results showed that fixed AluS elements are hypermethylated compared with polymorphic elements, indicating a possible relation between DNA methylation and Alu fixation in the human genome.
Collapse
Affiliation(s)
- Jae Lee
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea
| | - Yun-Ji Kim
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Seyoung Mun
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Heui-Soo Kim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 609-735, Republic of Korea
| | - Kyudong Han
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea.
| |
Collapse
|
27
|
|
28
|
Dynamic Alu methylation during normal development, aging, and tumorigenesis. BIOMED RESEARCH INTERNATIONAL 2014; 2014:784706. [PMID: 25243180 PMCID: PMC4163490 DOI: 10.1155/2014/784706] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Accepted: 08/16/2014] [Indexed: 12/15/2022]
Abstract
DNA methylation primarily occurs on CpG dinucleotides and plays an important role in transcriptional regulations during tissue development and cell differentiation. Over 25% of CpG dinucleotides in the human genome reside within Alu elements, the most abundant human repeats. The methylation of Alu elements is an important mechanism to suppress Alu transcription and subsequent retrotransposition. Decades of studies revealed that Alu methylation is highly dynamic during early development and aging. Recently, many environmental factors were shown to have a great impact on Alu methylation. In addition, aberrant Alu methylation has been documented to be an early event in many tumors and Alu methylation levels have been associated with tumor aggressiveness. The assessment of the Alu methylation has become an important approach for early diagnosis and/or prognosis of cancer. This review focuses on the dynamic Alu methylation during development, aging, and tumor genesis. The cause and consequence of Alu methylation changes will be discussed.
Collapse
|
29
|
Campos-Sánchez R, Kapusta A, Feschotte C, Chiaromonte F, Makova KD. Genomic landscape of human, bat, and ex vivo DNA transposon integrations. Mol Biol Evol 2014; 31:1816-32. [PMID: 24809961 DOI: 10.1093/molbev/msu138] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The integration and fixation preferences of DNA transposons, one of the major classes of eukaryotic transposable elements, have never been evaluated comprehensively on a genome-wide scale. Here, we present a detailed study of the distribution of DNA transposons in the human and bat genomes. We studied three groups of DNA transposons that integrated at different evolutionary times: 1) ancient (>40 My) and currently inactive human elements, 2) younger (<40 My) bat elements, and 3) ex vivo integrations of piggyBat and Sleeping Beauty elements in HeLa cells. Although the distribution of ex vivo elements reflected integration preferences, the distribution of human and (to a lesser extent) bat elements was also affected by selection. We used regression techniques (linear, negative binomial, and logistic regression models with multiple predictors) applied to 20-kb and 1-Mb windows to investigate how the genomic landscape in the vicinity of DNA transposons contributes to their integration and fixation. Our models indicate that genomic landscape explains 16-79% of variability in DNA transposon genome-wide distribution. Importantly, we not only confirmed previously identified predictors (e.g., DNA conformation and recombination hotspots) but also identified several novel predictors (e.g., signatures of double-strand breaks and telomere hexamer). Ex vivo integrations showed a bias toward actively transcribed regions. Older DNA transposons were located in genomic regions scarce in most conserved elements-likely reflecting purifying selection. Our study highlights how DNA transposons are integral to the evolution of bat and human genomes, and has implications for the development of DNA transposon assays for gene therapy and mutagenesis applications.
Collapse
Affiliation(s)
- Rebeca Campos-Sánchez
- Genetics Program, The Huck Institutes of the Life Sciences, Penn State University, University Park, PA
| | - Aurélie Kapusta
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT
| | - Francesca Chiaromonte
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, PADepartment of Statistics, Penn State University, University Park, PA
| | - Kateryna D Makova
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, PADepartment of Biology, Penn State University, University Park, PA
| |
Collapse
|
30
|
Linker S, Hedges D. Linear decay of retrotransposon antisense bias across genes is contingent upon tissue specificity. PLoS One 2013; 8:e79402. [PMID: 24244495 PMCID: PMC3828378 DOI: 10.1371/journal.pone.0079402] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 09/28/2013] [Indexed: 12/23/2022] Open
Abstract
Retrotransposons comprise approximately half of the human genome and contribute to chromatin structure, regulatory motifs, and protein-coding sequences. Since retrotransposon insertions can disrupt functional genetic elements as well as introduce new sequence motifs to a region, they have the potential to affect the function of genes that harbour insertions as well as those nearby. Partly as a result of these effects, the distribution of retrotransposons across the genome is non-uniform and there are observed imbalances in the orientation of insertions with respect to the transcriptional direction of the containing gene. Although some of the factors underlying the observed distributions are understood, much of the variability remains unexplained. Detailed characterization of retrotransposon density in genes could help inform predictions of the functional consequence of de novo as well as polymorphic insertions. In order to characterize the relationship between genes and inserted elements, we have examined the distribution of retrotransposons and their internal motifs within tissue-specific and housekeeping genes. We have identified that the previously established retrotransposon antisense bias decays at a linear rate across genes, resulting in an equal density of sense and antisense retrotransposons near the 3'-UTR. In addition, the decay of antisense bias across genes is less pronounced among tissue-specific genes. Our results provide support for the scenario in which this linear decay in antisense bias is established by natural selection shortly after retrotransposon integration, and that total antisense bias observed is above and beyond any bias introduced by the integration process itself. Finally, we provide an example of a retrotransposon acting as an eQTL on a coincident gene, highlighting one of several possible avenues through which insertions may modulate gene function.
Collapse
Affiliation(s)
- Sara Linker
- Hussman Institute for Human Genomics, Dr John T. Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Dale Hedges
- Division of Human Genetics, Department of Internal Medicine, The Ohio State University, Columbus, Ohio, United States of America
| |
Collapse
|
31
|
David M, Mustafa H, Brudno M. Detecting Alu insertions from high-throughput sequencing data. Nucleic Acids Res 2013; 41:e169. [PMID: 23921633 PMCID: PMC3783187 DOI: 10.1093/nar/gkt612] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
High-throughput sequencing technologies have allowed for the cataloguing of variation in personal human genomes. In this manuscript, we present alu-detect, a tool that combines read-pair and split-read information to detect novel Alus and their precise breakpoints directly from either whole-genome or whole-exome sequencing data while also identifying insertions directly in the vicinity of existing Alus. To set the parameters of our method, we use simulation of a faux reference, which allows us to compute the precision and recall of various parameter settings using real sequencing data. Applying our method to 100 bp paired Illumina data from seven individuals, including two trios, we detected on average 1519 novel Alus per sample. Based on the faux-reference simulation, we estimate that our method has 97% precision and 85% recall. We identify 808 novel Alus not previously described in other studies. We also demonstrate the use of alu-detect to study the local sequence and global location preferences for novel Alu insertions.
Collapse
Affiliation(s)
- Matei David
- Department of Computer Science, University of Toronto, 10 King's College Road, Toronto, ON M5S 3G4, Canada and Centre for Computational Medicine, Genetics and Genome Biology Program, The Hospital for Sick Children, 555 University Avenue, Toronto, ON M5G 1X8, Canada
| | | | | |
Collapse
|
32
|
Grandi FC, An W. Non-LTR retrotransposons and microsatellites: Partners in genomic variation. Mob Genet Elements 2013; 3:e25674. [PMID: 24195012 PMCID: PMC3812793 DOI: 10.4161/mge.25674] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Revised: 07/07/2013] [Accepted: 07/09/2013] [Indexed: 01/10/2023] Open
Abstract
The human genome is laden with both non-LTR (long-terminal repeat) retrotransposons and microsatellite repeats. Both types of sequences are able to, either actively or passively, mutagenize the genomes of human individuals and are therefore poised to dynamically alter the human genomic landscape across generations. Non-LTR retrotransposons, such as L1 and Alu, are a major source of new microsatellites, which are born both concurrently and subsequently to L1 and Alu integration into the genome. Likewise, the mutation dynamics of microsatellite repeats have a direct impact on the fitness of their non-LTR retrotransposon parent owing to microsatellite expansion and contraction. This review explores the interactions and dynamics between non-LTR retrotransposons and microsatellites in the context of genomic variation and evolution.
Collapse
Affiliation(s)
- Fiorella C Grandi
- School of Molecular Biosciences and Center for Reproductive Biology; Washington State University; Pullman, WA USA
| | | |
Collapse
|
33
|
Monot C, Kuciak M, Viollet S, Mir AA, Gabus C, Darlix JL, Cristofari G. The specificity and flexibility of l1 reverse transcription priming at imperfect T-tracts. PLoS Genet 2013; 9:e1003499. [PMID: 23675310 PMCID: PMC3649969 DOI: 10.1371/journal.pgen.1003499] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2012] [Accepted: 03/22/2013] [Indexed: 01/18/2023] Open
Abstract
L1 retrotransposons have a prominent role in reshaping mammalian genomes. To replicate, the L1 ribonucleoprotein particle (RNP) first uses its endonuclease (EN) to nick the genomic DNA. The newly generated DNA end is subsequently used as a primer to initiate reverse transcription within the L1 RNA poly(A) tail, a process known as target-primed reverse transcription (TPRT). Prior studies demonstrated that most L1 insertions occur into sequences related to the L1 EN consensus sequence (degenerate 5′-TTTT/A-3′ sites) and frequently preceded by imperfect T-tracts. However, it is currently unclear whether—and to which degree—the liberated 3′-hydroxyl extremity on the genomic DNA needs to be accessible and complementary to the poly(A) tail of the L1 RNA for efficient priming of reverse transcription. Here, we employed a direct assay for the initiation of L1 reverse transcription to define the molecular rules that guide this process. First, efficient priming is detected with as few as 4 matching nucleotides at the primer 3′ end. Second, L1 RNP can tolerate terminal mismatches if they are compensated within the 10 last bases of the primer by an increased number of matching nucleotides. All terminal mismatches are not equally detrimental to DNA extension, a C being extended at higher levels than an A or a G. Third, efficient priming in the context of duplex DNA requires a 3′ overhang. This suggests the possible existence of additional DNA processing steps, which generate a single-stranded 3′ end to allow L1 reverse transcription. Based on these data we propose that the specificity of L1 reverse transcription initiation contributes, together with the specificity of the initial EN cleavage, to the distribution of new L1 insertions within the human genome. Jumping genes are DNA sequences present in the genome of most living organisms. They contribute to genome dynamics and occasionally result in hereditary genetic diseases or cancer. L1 elements are the only autonomously active jumping genes in the human genome. They replicate through an RNA–mediated copy-and-paste mechanism by cleaving the host genome and then using this new DNA end as a primer to reverse transcribe its own RNA, generating a new L1 DNA copy. The molecular determinants that influence L1 target site choice are not fully understood. Here we present a quantitative assay to measure the influence of DNA target site sequence and structure on the reverse transcription step. By testing more than 65 potential DNA primers, we observe that not all sites are equally extended by the L1 machinery, and we define the rules guiding this process. In particular, we highlight the importance of partial sequence complementarity between the target site and the L1 RNA extremity, but also the high level of flexibility of this process, since detrimental terminal mismatches can be compensated by an increasing number of interacting nucleotides. We propose that this mechanism contributes to the distribution of new L1 insertions within the human genome.
Collapse
Affiliation(s)
- Clément Monot
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Monika Kuciak
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Sébastien Viollet
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Ashfaq Ali Mir
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
| | - Caroline Gabus
- Ecole Normale Supérieure de Lyon, Human Virology Department, INSERM U758, Lyon, France
| | - Jean-Luc Darlix
- Ecole Normale Supérieure de Lyon, Human Virology Department, INSERM U758, Lyon, France
| | - Gaël Cristofari
- INSERM, U1081, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- CNRS, UMR 7284, Institute for Research on Cancer and Aging, Nice (IRCAN), Nice, France
- University of Nice-Sophia-Antipolis, Faculty of Medicine, Nice, France
- * E-mail:
| |
Collapse
|
34
|
Macfarlane CM, Collier P, Rahbari R, Beck CR, Wagstaff JF, Igoe S, Moran JV, Badge RM. Transduction-specific ATLAS reveals a cohort of highly active L1 retrotransposons in human populations. Hum Mutat 2013; 34:974-85. [PMID: 23553801 DOI: 10.1002/humu.22327] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Accepted: 03/15/2013] [Indexed: 11/09/2022]
Abstract
Long INterspersed Element-1 (LINE-1 or L1) retrotransposons are the only autonomously active transposable elements in the human genome. The average human genome contains ∼80-100 active L1s, but only a subset of these L1s are highly active or 'hot'. Human L1s are closely related in sequence, making it difficult to decipher progenitor/offspring relationships using traditional phylogenetic methods. However, L1 mRNAs can sometimes bypass their own polyadenylation signal and instead utilize fortuitous polyadenylation signals in 3' flanking genomic DNA. Retrotransposition of the resultant mRNAs then results in lineage specific sequence "tags" (i.e., 3' transductions) that mark the descendants of active L1 progenitors. Here, we developed a method (Transduction-Specific Amplification Typing of L1 Active Subfamilies or TS-ATLAS) that exploits L1 3' transductions to identify active L1 lineages in a genome-wide context. TS-ATLAS enabled the characterization of a putative active progenitor of one L1 lineage that includes the disease causing L1 insertion L1RP , and the identification of new retrotransposition events within two other "hot" L1 lineages. Intriguingly, the analysis of the newly discovered transduction lineage members suggests that L1 polyadenylation, even within a lineage, is highly stochastic. Thus, TS-ATLAS provides a new tool to explore the dynamics of L1 lineage evolution and retrotransposon biology.
Collapse
|
35
|
Hackett PB, Largaespada DA, Switzer KC, Cooper LJN. Evaluating risks of insertional mutagenesis by DNA transposons in gene therapy. Transl Res 2013; 161:265-83. [PMID: 23313630 PMCID: PMC3602164 DOI: 10.1016/j.trsl.2012.12.005] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Revised: 12/10/2012] [Accepted: 12/11/2012] [Indexed: 12/30/2022]
Abstract
Investigational therapy can be successfully undertaken using viral- and nonviral-mediated ex vivo gene transfer. Indeed, recent clinical trials have established the potential for genetically modified T cells to improve and restore health. Recently, the Sleeping Beauty (SB) transposon/transposase system has been applied in clinical trials to stably insert a chimeric antigen receptor (CAR) to redirect T-cell specificity. We discuss the context in which the SB system can be harnessed for gene therapy and describe the human application of SB-modified CAR(+) T cells. We have focused on theoretical issues relating to insertional mutagenesis in the context of human genomes that are naturally subjected to remobilization of transposons and the experimental evidence over the last decade of employing SB transposons for defining genes that induce cancer. These findings are put into the context of the use of SB transposons in the treatment of human disease.
Collapse
Affiliation(s)
- Perry B Hackett
- Department of Genetics Cell Biology and Development, Center for Genome Engineering and Masonic Cancer Center, University of Minnesota, Minneapolis, MN 55455, USA.
| | | | | | | |
Collapse
|