1
|
Moldovan JB, Kopera HC, Liu Y, Garcia-Canadas M, Catalina P, Leone PE, Sanchez L, Kitzman JO, Kidd JM, Garcia-Perez JL, Moran JV. Variable patterns of retrotransposition in different HeLa strains provide mechanistic insights into SINE RNA mobilization processes. Nucleic Acids Res 2024:gkae448. [PMID: 38850156 DOI: 10.1093/nar/gkae448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 05/08/2024] [Accepted: 05/14/2024] [Indexed: 06/10/2024] Open
Abstract
Alu elements are non-autonomous Short INterspersed Elements (SINEs) derived from the 7SL RNA gene that are present at over one million copies in human genomic DNA. Alu mobilizes by a mechanism known as retrotransposition, which requires the Long INterspersed Element-1 (LINE-1) ORF2-encoded protein (ORF2p). Here, we demonstrate that HeLa strains differ in their capacity to support Alu retrotransposition. Human Alu elements retrotranspose efficiently in HeLa-HA and HeLa-CCL2 (Alu-permissive) strains, but not in HeLa-JVM or HeLa-H1 (Alu-nonpermissive) strains. A similar pattern of retrotransposition was observed for other 7SL RNA-derived SINEs and tRNA-derived SINEs. In contrast, mammalian LINE-1s, a zebrafish LINE, a human SINE-VNTR-Alu (SVA) element, and an L1 ORF1-containing mRNA can retrotranspose in all four HeLa strains. Using an in vitro reverse transcriptase-based assay, we show that Alu RNAs associate with ORF2p and are converted into cDNAs in both Alu-permissive and Alu-nonpermissive HeLa strains, suggesting that 7SL- and tRNA-derived SINEs use strategies to 'hijack' L1 ORF2p that are distinct from those used by SVA elements and ORF1-containing mRNAs. These data further suggest ORF2p associates with the Alu RNA poly(A) tract in both Alu-permissive and Alu-nonpermissive HeLa strains, but that Alu retrotransposition is blocked after this critical step in Alu-nonpermissive HeLa strains.
Collapse
Affiliation(s)
- John B Moldovan
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Huira C Kopera
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Ying Liu
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Marta Garcia-Canadas
- Department of Genomic Medicine, GENYO, Centre for Genomics and Oncological Research, Pfizer-University of Granada-Andalusian Regional Government, PTS Granada 18016, Spain
| | | | - Paola E Leone
- Genetics and Genomics Laboratory, SOLCA Hospital, Quito, Ecuador
| | - Laura Sanchez
- Department of Genomic Medicine, GENYO, Centre for Genomics and Oncological Research, Pfizer-University of Granada-Andalusian Regional Government, PTS Granada 18016, Spain
| | - Jacob O Kitzman
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jeffrey M Kidd
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jose Luis Garcia-Perez
- Department of Genomic Medicine, GENYO, Centre for Genomics and Oncological Research, Pfizer-University of Granada-Andalusian Regional Government, PTS Granada 18016, Spain
| | - John V Moran
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
2
|
Fernández-Suárez E, González-Del Pozo M, Méndez-Vidal C, Martín-Sánchez M, Mena M, de la Morena-Barrio B, Corral J, Borrego S, Antiñolo G. Long-read sequencing improves the genetic diagnosis of retinitis pigmentosa by identifying an Alu retrotransposon insertion in the EYS gene. Mob DNA 2024; 15:9. [PMID: 38704576 PMCID: PMC11069205 DOI: 10.1186/s13100-024-00320-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 04/10/2024] [Indexed: 05/06/2024] Open
Abstract
BACKGROUND Biallelic variants in EYS are the major cause of autosomal recessive retinitis pigmentosa (arRP) in certain populations, a clinically and genetically heterogeneous disease that may lead to legal blindness. EYS is one of the largest genes (~ 2 Mb) expressed in the retina, in which structural variants (SVs) represent a common cause of disease. However, their identification using short-read sequencing (SRS) is not always feasible. Here, we conducted targeted long-read sequencing (T-LRS) using adaptive sampling of EYS on the MinION sequencing platform (Oxford Nanopore Technologies) to definitively diagnose an arRP family, whose affected individuals (n = 3) carried the heterozygous pathogenic deletion of exons 32-33 in the EYS gene. As this was a recurrent variant identified in three additional families in our cohort, we also aimed to characterize the known deletion at the nucleotide level to assess a possible founder effect. RESULTS T-LRS in family A unveiled a heterozygous AluYa5 insertion in the coding exon 43 of EYS (chr6(GRCh37):g.64430524_64430525ins352), which segregated with the disease in compound heterozygosity with the previously identified deletion. Visual inspection of previous SRS alignments using IGV revealed several reads containing soft-clipped bases, accompanied by a slight drop in coverage at the Alu insertion site. This prompted us to develop a simplified program using grep command to investigate the recurrence of this variant in our cohort from SRS data. Moreover, LRS also allowed the characterization of the CNV as a ~ 56.4kb deletion spanning exons 32-33 of EYS (chr6(GRCh37):g.64764235_64820592del). The results of further characterization by Sanger sequencing and linkage analysis in the four families were consistent with a founder variant. CONCLUSIONS To our knowledge, this is the first report of a mobile element insertion into the coding sequence of EYS, as a likely cause of arRP in a family. Our study highlights the value of LRS technology in characterizing and identifying hidden pathogenic SVs, such as retrotransposon insertions, whose contribution to the etiopathogenesis of rare diseases may be underestimated.
Collapse
Affiliation(s)
- Elena Fernández-Suárez
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville (IBiS), University Hospital Virgen del Rocío/CSIC, University of Seville, Seville, Spain
- Center for Biomedical Network Research On Rare Diseases (CIBERER), Seville, Spain
| | - María González-Del Pozo
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville (IBiS), University Hospital Virgen del Rocío/CSIC, University of Seville, Seville, Spain
- Center for Biomedical Network Research On Rare Diseases (CIBERER), Seville, Spain
| | - Cristina Méndez-Vidal
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville (IBiS), University Hospital Virgen del Rocío/CSIC, University of Seville, Seville, Spain
- Center for Biomedical Network Research On Rare Diseases (CIBERER), Seville, Spain
| | - Marta Martín-Sánchez
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville (IBiS), University Hospital Virgen del Rocío/CSIC, University of Seville, Seville, Spain
- Center for Biomedical Network Research On Rare Diseases (CIBERER), Seville, Spain
| | - Marcela Mena
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville (IBiS), University Hospital Virgen del Rocío/CSIC, University of Seville, Seville, Spain
- Center for Biomedical Network Research On Rare Diseases (CIBERER), Seville, Spain
| | - Belén de la Morena-Barrio
- Servicio de Hematología y Oncología Médica, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, IMIB-Pascual Parrilla, CIBERER-ISCIII, Murcia, Spain
| | - Javier Corral
- Servicio de Hematología y Oncología Médica, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, IMIB-Pascual Parrilla, CIBERER-ISCIII, Murcia, Spain
| | - Salud Borrego
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville (IBiS), University Hospital Virgen del Rocío/CSIC, University of Seville, Seville, Spain.
- Center for Biomedical Network Research On Rare Diseases (CIBERER), Seville, Spain.
| | - Guillermo Antiñolo
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville (IBiS), University Hospital Virgen del Rocío/CSIC, University of Seville, Seville, Spain.
- Center for Biomedical Network Research On Rare Diseases (CIBERER), Seville, Spain.
| |
Collapse
|
3
|
Moldovan JB, Kopera HC, Liu Y, Garcia-Canadas M, Catalina P, Leone PE, Sanchez L, Kitzman JO, Kidd JM, Garcia-Perez JL, Moran JV. Variable patterns of retrotransposition in different HeLa strains provide mechanistic insights into SINE RNA mobilization processes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.03.592410. [PMID: 38746229 PMCID: PMC11092746 DOI: 10.1101/2024.05.03.592410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Alu elements are non-autonomous Short INterspersed Elements (SINEs) derived from the 7SL RNA gene that are present at over one million copies in human genomic DNA. Alu mobilizes by a mechanism known as retrotransposition, which requires the Long INterspersed Element-1 (LINE-1 or L1) ORF2 -encoded protein (ORF2p). Here, we demonstrate that HeLa strains differ in their capacity to support Alu retrotransposition. Human Alu elements retrotranspose efficiently in HeLa-HA and HeLa-CCL2 ( Alu -permissive) strains, but not in HeLa-JVM or HeLa-H1 ( Alu -nonpermissive) strains. A similar pattern of retrotransposition was observed for other 7SL RNA -derived SINEs and tRNA -derived SINEs. In contrast, mammalian LINE-1s, a zebrafish LINE, a human SINE-VNTR - Alu ( SVA ) element, and an L1 ORF1 -containing messenger RNA can retrotranspose in all four HeLa strains. Using an in vitro reverse transcriptase-based assay, we show that Alu RNAs associate with ORF2p and are converted into cDNAs in both Alu -permissive and Alu -nonpermissive HeLa strains, suggesting that 7SL - and tRNA -derived SINE RNAs use strategies to 'hijack' L1 ORF2p that are distinct from those used by SVA elements and ORF1 -containing mRNAs. These data further suggest ORF2p associates with the Alu RNA poly(A) tract in both Alu -permissive and Alu -nonpermissive HeLa strains, but that Alu retrotransposition is blocked after this critical step in Alu -nonpermissive HeLa strains.
Collapse
|
4
|
Borodulina OR, Ustyantsev IG, Kramerov DA. SINEs as Potential Expression Cassettes: Impact of Deletions and Insertions on Polyadenylation and Lifetime of B2 and Ves SINE Transcripts Generated by RNA Polymerase III. Int J Mol Sci 2023; 24:14600. [PMID: 37834047 PMCID: PMC10572872 DOI: 10.3390/ijms241914600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 09/22/2023] [Accepted: 09/25/2023] [Indexed: 10/15/2023] Open
Abstract
Short Interspersed Elements (SINEs) are common in the genomes of most multicellular organisms. They are transcribed by RNA polymerase III from an internal promoter comprising boxes A and B. As transcripts of certain SINEs from mammalian genomes can be polyadenylated, such transcripts should contain the AATAAA sequence as well as those called β- and τ-signals. One of the goals of this work was to evaluate how autonomous and independent other SINE parts are β- and τ-signals. Extended regions outside of β- and τ-signals were deleted from SINEs B2 and Ves and the derived constructs were used to transfect HeLa cells in order to evaluate the relative levels of their transcripts as well as their polyadenylation efficiency. If the deleted regions affected boxes A and B, the 5'-flanking region of the U6 RNA gene with the external promoter was inserted upstream. Such substitution of the internal promoter in B2 completely restored its transcription. Almost all tested deletions/substitutions did not reduce the polyadenylation capacity of the transcripts, indicating a weak dependence of the function of β- and τ-signals on the neighboring sequences. A similar analysis of B2 and Ves constructs containing a 55-bp foreign sequence inserted between β- and τ-signals showed an equal polyadenylation efficiency of their transcripts compared to those of constructs without the insertion. The acquired poly(A)-tails significantly increased the lifetime and thus the cellular level of such transcripts. The data obtained highlight the potential of B2 and Ves SINEs as cassettes for the expression of relatively short sequences for various applications.
Collapse
Affiliation(s)
| | | | - Dmitri A. Kramerov
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov St., Moscow 119991, Russia; (O.R.B.); (I.G.U.)
| |
Collapse
|
5
|
Storer JM, Walker JA, Beckstrom TO, Batzer MA. Extensive Independent Amplification of Platy-1 Retroposons in Tamarins, Genus Saguinus. Genes (Basel) 2023; 14:1436. [PMID: 37510341 PMCID: PMC10378772 DOI: 10.3390/genes14071436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/05/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023] Open
Abstract
Platy-1 retroposons are short interspersed elements (SINEs) unique to platyrrhine primates. Discovered in the common marmoset (Callithrix jacchus) genome in 2016, these 100 bp mobile element insertions (MEIs) appeared to be novel drivers of platyrrhine evolution, with over 2200 full-length members across 62 different subfamilies, and strong evidence of ongoing proliferation in C. jacchus. Subsequent characterization of Platy-1 elements in Aotus, Saimiri and Cebus genera, suggested that the widespread mobilization detected in marmoset (family Callithrichidae) was perhaps an anomaly. Two additional Callithrichidae genomes are now available, a scaffold level genome assembly for Saguinus imperator (tamarin; SagImp_v1) and a chromosome-level assembly for Saguinus midas (Midas tamarin; ASM2_v1). Here, we report that each tamarin genome contains over 11,000 full-length Platy-1 insertions, about 1150 are shared by both Saguinus tamarins, 7511 are unique to S. imperator, and another 8187 are unique to S. midas. Roughly 325 are shared among the three callithrichids. We identified six new Platy-1 subfamilies derived from Platy-1-8, with the youngest new subfamily, Platy-1-8c_Saguinus, being the primary source of the Saguinus amplification burst. This constitutes the largest expansion of Platy-1 MEIs reported to date and the most extensive independent SINE amplification between two closely related species.
Collapse
Affiliation(s)
- Jessica M. Storer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Jerilyn A. Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| | - Thomas O. Beckstrom
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
- Department of Oral and Maxillofacial Surgery, University of Washington, 1959 NE Pacific Street, Health Sciences Building B-241, Seattle, WA 98195, USA
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| |
Collapse
|
6
|
Helm BM, Smith AM, Schmit K, Landis BJ, Vatta M, Ware SM. Disruption of FBN1 by an Alu element insertion: A novel genetic cause of Marfan syndrome. Eur J Med Genet 2023; 66:104775. [PMID: 37264881 DOI: 10.1016/j.ejmg.2023.104775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 02/17/2023] [Accepted: 04/29/2023] [Indexed: 06/03/2023]
Abstract
Alu elements are retrotransposons with ubiquitous presence in the human genome that have contributed to human genomic diversity and health. These approximately 300-bp sequences can cause or mediate disease by disrupting coding/splicing regions in the germline, by insertional mutagenesis in somatic cells, and in promoting formation of copy-number variants. Alu elements may also disrupt epigenetic regulation by affecting non-coding regulatory regions. There are increasing reports of apparently sporadic and inherited genetic disorders caused by Alu-related gene disruption, but Marfan syndrome resulting from Alu element insertion has not been previously described. We report a family with classic features of Marfan syndrome whose previous FBN1 genetic testing was inconclusive. Using contemporary next-generation sequencing and bioinformatics analysis, a pathogenic/disruptive Alu insertion occurring in the coding region of the FBN1 gene was identified (c.6564_6565insAlu; p. Glu2189fs) and was confirmed and specified further with Sanger sequencing. This identified the molecular basis of disease in the family that was missed using previous genetic testing technologies and highlights a novel pathogenic mechanism for Marfan syndrome. This case adds to the growing literature of Mendelian diseases caused by Alu retrotransposition, and it also shows the growing capability of genomic technologies for detecting atypical mutation events.
Collapse
Affiliation(s)
- Benjamin M Helm
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA; Department of Epidemiology, Indiana University Fairbanks School of Public Health, Indianapolis, IN, USA.
| | - Amanda M Smith
- Department of Pediatrics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Kelly Schmit
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA.
| | - Benjamin J Landis
- Department of Pediatrics, Indiana University School of Medicine, Indianapolis, IN, USA.
| | | | - Stephanie M Ware
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA; Department of Pediatrics, Indiana University School of Medicine, Indianapolis, IN, USA.
| |
Collapse
|
7
|
Kosushkin SA, Ustyantsev IG, Borodulina OR, Vassetzky NS, Kramerov DA. Tail Wags Dog’s SINE: Retropositional Mechanisms of Can SINE Depend on Its A-Tail Structure. BIOLOGY 2022; 11:biology11101403. [PMID: 36290307 PMCID: PMC9599045 DOI: 10.3390/biology11101403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 09/17/2022] [Accepted: 09/22/2022] [Indexed: 11/25/2022]
Abstract
Simple Summary The genomes of higher organisms including humans are invaded by millions of repetitive elements (transposons), which can sometimes be deleterious or beneficial for hosts. Many aspects of the mechanisms underlying the expansion of transposons in the genomes remain unclear. Short retrotransposons (SINEs) are one of the most abundant classes of genomic repeats. Their amplification relies on two major processes: transcription and reverse transcription. Here, short retrotransposons of dogs and other canids called Can SINE were analyzed. Their amplification was extraordinarily active in the wolf and, particularly, dog breeds relative to other canids. We also studied a variation of their transcription mechanism involving the polyadenylation of transcripts. An analysis of specific signals involved in this process allowed us to conclude that Can SINEs could alternate amplification with and without polyadenylation in their evolution. Understanding the mechanisms of transposon replication can shed light on the mechanisms of genome function. Abstract SINEs, non-autonomous short retrotransposons, are widespread in mammalian genomes. Their transcripts are generated by RNA polymerase III (pol III). Transcripts of certain SINEs can be polyadenylated, which requires polyadenylation and pol III termination signals in their sequences. Our sequence analysis divided Can SINEs in canids into four subfamilies, older a1 and a2 and younger b1 and b2. Can_b2 and to a lesser extent Can_b1 remained retrotranspositionally active, while the amplification of Can_a1 and Can_a2 ceased long ago. An extraordinarily high Can amplification was revealed in different dog breeds. Functional polyadenylation signals were analyzed in Can subfamilies, particularly in fractions of recently amplified, i.e., active copies. The transcription of various Can constructs transfected into HeLa cells proposed AATAAA and (TC)n as functional polyadenylation signals. Our analysis indicates that older Can subfamilies (a1, a2, and b1) with an active transcription terminator were amplified by the T+ mechanism (with polyadenylation of pol III transcripts). In the currently active Can_b2 subfamily, the amplification mechanisms with (T+) and without the polyadenylation of pol III transcripts (T−) irregularly alternate. The active transcription terminator tends to shorten, which renders it nonfunctional and favors a switch to the T− retrotransposition. The activity of a truncated terminator is occasionally restored by its elongation, which rehabilitates the T+ retrotransposition for a particular SINE copy.
Collapse
|
8
|
Kurosaki T, Ashizawa T. The genetic and molecular features of the intronic pentanucleotide repeat expansion in spinocerebellar ataxia type 10. Front Genet 2022; 13:936869. [PMID: 36199580 PMCID: PMC9528567 DOI: 10.3389/fgene.2022.936869] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 08/25/2022] [Indexed: 11/13/2022] Open
Abstract
Spinocerebellar ataxia type 10 (SCA10) is characterized by progressive cerebellar neurodegeneration and, in many patients, epilepsy. This disease mainly occurs in individuals with Indigenous American or East Asian ancestry, with strong evidence supporting a founder effect. The mutation causing SCA10 is a large expansion in an ATTCT pentanucleotide repeat in intron 9 of the ATXN10 gene. The ATTCT repeat is highly unstable, expanding to 280–4,500 repeats in affected patients compared with the 9–32 repeats in normal individuals, one of the largest repeat expansions causing neurological disorders identified to date. However, the underlying molecular basis of how this huge repeat expansion evolves and contributes to the SCA10 phenotype remains largely unknown. Recent progress in next-generation DNA sequencing technologies has established that the SCA10 repeat sequence has a highly heterogeneous structure. Here we summarize what is known about the structure and origin of SCA10 repeats, discuss the potential contribution of variant repeats to the SCA10 disease phenotype, and explore how this information can be exploited for therapeutic benefit.
Collapse
Affiliation(s)
- Tatsuaki Kurosaki
- Department of Biochemistry and Biophysics, School of Medicine and Dentistry, University of Rochester, Rochester, NY, United States
- Center for RNA Biology, University of Rochester, Rochester, NY, United States
- *Correspondence: Tatsuaki Kurosaki, ; Tetsuo Ashizawa,
| | - Tetsuo Ashizawa
- Stanley H. Appel Department of Neurology, Houston Methodist Research Institute and Weil Cornell Medical College at Houston Methodist Houston, TX, United States
- *Correspondence: Tatsuaki Kurosaki, ; Tetsuo Ashizawa,
| |
Collapse
|
9
|
Laine P, Rowell WJ, Paulin L, Kujawa S, Raterman D, Mayhew G, Wendt J, Burgess DL, Partonen T, Paunio T, Auvinen P, Ekholm JM. Alu element in the RNA binding motif protein, X-linked 2 (RBMX2) gene found to be linked to bipolar disorder. PLoS One 2021; 16:e0261170. [PMID: 34914762 PMCID: PMC8675739 DOI: 10.1371/journal.pone.0261170] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 11/24/2021] [Indexed: 11/23/2022] Open
Abstract
Objective We have used long-read single molecule, real-time (SMRT) sequencing to fully characterize a ~12Mb genomic region on chromosome Xq24-q27, significantly linked to bipolar disorder (BD) in an extended family from a genetic sub-isolate. This family segregates BD in at least four generations with 24 affected individuals. Methods We selected 16 family members for targeted sequencing. The selected individuals either carried the disease haplotype, were non-carriers of the disease haplotype, or served as married-in controls. We designed hybrid capture probes enriching for 5-9Kb fragments spanning the entire 12Mb region that were then sequenced to screen for candidate structural variants (SVs) that could explain the increased risk for BD in this extended family. Results Altogether, 201 variants were detected in the critically linked region. Although most of these represented common variants, three variants emerged that showed near-perfect segregation among all BD type I affected individuals. Two of the SVs were identified in or near genes belonging to the RNA Binding Motif Protein, X-Linked (RBMX) gene family—a 330bp Alu (subfamily AluYa5) deletion in intron 3 of the RBMX2 gene and an intergenic 27bp tandem repeat deletion between the RBMX and G protein-coupled receptor 101 (GPR101) genes. The third SV was a 50bp tandem repeat insertion in intron 1 of the Coagulation Factor IX (F9) gene. Conclusions Among the three genetically linked SVs, additional evidence supported the Alu element deletion in RBMX2 as the leading candidate for contributing directly to the disease development of BD type I in this extended family.
Collapse
Affiliation(s)
- Pia Laine
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | | | - Lars Paulin
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Steve Kujawa
- Pacific Biosciences, Menlo Park, CA, United States of America
| | - Denise Raterman
- Roche Sequencing Solutions, Madison, WI, United States of America
| | - George Mayhew
- Roche Sequencing Solutions, Madison, WI, United States of America
| | - Jennifer Wendt
- Roche Sequencing Solutions, Madison, WI, United States of America
| | | | - Timo Partonen
- Department of Public Health Solutions, National Institute for Health and Welfare, Helsinki, Finland
| | - Tiina Paunio
- Department of Public Health Solutions, National Institute for Health and Welfare, Helsinki, Finland
- Department of Psychiatry, University of Helsinki, Helsinki, Finland
| | - Petri Auvinen
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Jenny M. Ekholm
- Pacific Biosciences, Menlo Park, CA, United States of America
- * E-mail:
| |
Collapse
|
10
|
Kessler AC, Maraia RJ. The nuclear and cytoplasmic activities of RNA polymerase III, and an evolving transcriptome for surveillance. Nucleic Acids Res 2021; 49:12017-12034. [PMID: 34850129 PMCID: PMC8643620 DOI: 10.1093/nar/gkab1145] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 10/26/2021] [Accepted: 11/02/2021] [Indexed: 12/23/2022] Open
Abstract
A 1969 report that described biochemical and activity properties of the three eukaryotic RNA polymerases revealed Pol III as highly distinguishable, even before its transcripts were identified. Now known to be the most complex, Pol III contains several stably-associated subunits referred to as built-in transcription factors (BITFs) that enable highly efficient RNA synthesis by a unique termination-associated recycling process. In vertebrates, subunit RPC7(α/β) can be of two forms, encoded by POLR3G or POLR3GL, with differential activity. Here we review promoter-dependent transcription by Pol III as an evolutionary perspective of eukaryotic tRNA expression. Pol III also provides nonconventional functions reportedly by promoter-independent transcription, one of which is RNA synthesis from DNA 3'-ends during repair. Another is synthesis of 5'ppp-RNA signaling molecules from cytoplasmic viral DNA in a pathway of interferon activation that is dysfunctional in immunocompromised patients with mutations in Pol III subunits. These unconventional functions are also reviewed, including evidence that link them to the BITF subunits. We also review data on a fraction of the human Pol III transcriptome that evolved to include vault RNAs and snaRs with activities related to differentiation, and in innate immune and tumor surveillance. The Pol III of higher eukaryotes does considerably more than housekeeping.
Collapse
Affiliation(s)
- Alan C Kessler
- Division of Intramural Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, 20892 USA
| | - Richard J Maraia
- Division of Intramural Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, 20892 USA
| |
Collapse
|
11
|
Morgan M, Kumar L, Li Y, Baptissart M. Post-transcriptional regulation in spermatogenesis: all RNA pathways lead to healthy sperm. Cell Mol Life Sci 2021; 78:8049-8071. [PMID: 34748024 DOI: 10.1007/s00018-021-04012-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 10/11/2021] [Accepted: 10/25/2021] [Indexed: 01/22/2023]
Abstract
Multiple RNA pathways are required to produce functional sperm. Here, we review RNA post-transcriptional regulation during spermatogenesis with particular emphasis on the role of 3' end modifications. From early studies in the 1970s, it became clear that spermiogenesis transcripts could be stored for days only to be translated at advanced stages of spermatid differentiation. The transition between the translationally repressed and active states was observed to correlate with the shortening of the transcripts' poly(A) tail, establishing a link between RNA 3' end metabolism and male germ cell differentiation. Since then, numerous RNA metabolic pathways have been implicated not only in the progression through spermatogenesis, but also in the maintenance of genomic integrity. Recent studies have characterized the elusive 3' biogenesis of Piwi-interacting RNAs (piRNAs), identified a critical role for messenger RNA (mRNA) 3' uridylation in meiotic progression, established the mechanisms that destabilize transcripts with long 3' untranslated regions (3'UTRs) in post-mitotic cells, and defined the physiological relevance of RNA exonucleases and deadenylases in male germ cells. In this review, we discuss RNA processing in the male germline in the light of the most recent findings. A brief recollection of different RNA-processing events will aid future studies exploring post-transcriptional regulation in spermatogenesis.
Collapse
Affiliation(s)
- Marcos Morgan
- Reproductive and Developmental Biology Laboratory, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, 27709, USA.
| | - Lokesh Kumar
- Reproductive and Developmental Biology Laboratory, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, 27709, USA
| | - Yin Li
- Reproductive and Developmental Biology Laboratory, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, 27709, USA
| | - Marine Baptissart
- Reproductive and Developmental Biology Laboratory, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, 27709, USA
| |
Collapse
|
12
|
Analysis of SINE Families B2, Dip, and Ves with Special Reference to Polyadenylation Signals and Transcription Terminators. Int J Mol Sci 2021; 22:ijms22189897. [PMID: 34576060 PMCID: PMC8466645 DOI: 10.3390/ijms22189897] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/05/2021] [Accepted: 09/06/2021] [Indexed: 01/09/2023] Open
Abstract
Short Interspersed Elements (SINEs) are eukaryotic non-autonomous retrotransposons transcribed by RNA polymerase III (pol III). The 3′-terminus of many mammalian SINEs has a polyadenylation signal (AATAAA), pol III transcription terminator, and A-rich tail. The RNAs of such SINEs can be polyadenylated, which is unique for pol III transcripts. Here, B2 (mice and related rodents), Dip (jerboas), and Ves (vespertilionid bats) SINE families were thoroughly studied. They were divided into subfamilies reliably distinguished by relatively long indels. The age of SINE subfamilies can be estimated, which allows us to reconstruct their evolution. The youngest and most active variants of SINE subfamilies were given special attention. The shortest pol III transcription terminators are TCTTT (B2), TATTT (Ves and Dip), and the rarer TTTT. The last nucleotide of the terminator is often not transcribed; accordingly, the truncated terminator of its descendant becomes nonfunctional. The incidence of complete transcription of the TCTTT terminator is twice higher compared to TTTT and thus functional terminators are more likely preserved in daughter SINE copies. Young copies have long poly(A) tails; however, they gradually shorten in host generations. Unexpectedly, the tail shortening below A10 increases the incidence of terminator elongation by Ts thus restoring its efficiency. This process can be critical for the maintenance of SINE activity in the genome.
Collapse
|
13
|
Ebert P, Audano PA, Zhu Q, Rodriguez-Martin B, Porubsky D, Bonder MJ, Sulovari A, Ebler J, Zhou W, Serra Mari R, Yilmaz F, Zhao X, Hsieh P, Lee J, Kumar S, Lin J, Rausch T, Chen Y, Ren J, Santamarina M, Höps W, Ashraf H, Chuang NT, Yang X, Munson KM, Lewis AP, Fairley S, Tallon LJ, Clarke WE, Basile AO, Byrska-Bishop M, Corvelo A, Evani US, Lu TY, Chaisson MJP, Chen J, Li C, Brand H, Wenger AM, Ghareghani M, Harvey WT, Raeder B, Hasenfeld P, Regier AA, Abel HJ, Hall IM, Flicek P, Stegle O, Gerstein MB, Tubio JMC, Mu Z, Li YI, Shi X, Hastie AR, Ye K, Chong Z, Sanders AD, Zody MC, Talkowski ME, Mills RE, Devine SE, Lee C, Korbel JO, Marschall T, Eichler EE. Haplotype-resolved diverse human genomes and integrated analysis of structural variation. Science 2021; 372:eabf7117. [PMID: 33632895 PMCID: PMC8026704 DOI: 10.1126/science.abf7117] [Citation(s) in RCA: 289] [Impact Index Per Article: 96.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 02/09/2021] [Indexed: 12/14/2022]
Abstract
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci. We identified 107,590 structural variants (SVs), of which 68% were not discovered with short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterized 130 of the most active mobile element source elements and found that 63% of all SVs arise through homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.
Collapse
Affiliation(s)
- Peter Ebert
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Peter A Audano
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Bernardo Rodriguez-Martin
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Marc Jan Bonder
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Arvis Sulovari
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Jana Ebler
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Rebecca Serra Mari
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Xuefang Zhao
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Joyce Lee
- Bionano Genomics, San Diego, CA 92121, USA
| | - Sushant Kumar
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 and 437, 266 Whitney Avenue, New Haven, CT 06520, USA
| | - Jiadong Lin
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
| | - Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Yu Chen
- Department of Genetics and Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Jingwen Ren
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Martin Santamarina
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Department of Zoology, Genetics, and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Wolfram Höps
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Hufsah Ashraf
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Nelson T Chuang
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | - Xiaofei Yang
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Luke J Tallon
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | | | | | | | | | | | - Tsung-Yu Lu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Junjie Chen
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | - Chong Li
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | - Harrison Brand
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Aaron M Wenger
- Pacific Biosciences of California, Menlo Park, CA 94025, USA
| | - Maryam Ghareghani
- Max Planck Institute for Informatics, Saarland Informatics Campus E1.4, 66123 Saarbrücken, Germany
- Saarbrücken Graduate School of Computer Science, Saarland University, Saarland Informatics Campus E1.3, 66123 Saarbrücken, Germany
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Benjamin Raeder
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Allison A Regier
- Department of Medicine, Washington University, St. Louis, MO 63108, USA
| | - Haley J Abel
- Department of Medicine, Washington University, St. Louis, MO 63108, USA
| | - Ira M Hall
- Department of Genetics, Yale School of Medicine, 333 Cedar Street, New Haven, CT 06510, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Oliver Stegle
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Mark B Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 and 437, 266 Whitney Avenue, New Haven, CT 06520, USA
| | - Jose M C Tubio
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Department of Zoology, Genetics, and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Zepeng Mu
- Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - Yang I Li
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Xinghua Shi
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | | | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
- Department of Human Genetics, University of Michigan, 1241 E. Catherine Street, Ann Arbor, MI 48109, USA
| | - Zechen Chong
- Department of Genetics and Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Ashley D Sanders
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | | | - Michael E Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
- Department of Human Genetics, University of Michigan, 1241 E. Catherine Street, Ann Arbor, MI 48109, USA
| | - Scott E Devine
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA.
- Precision Medicine Center, The First Affiliated Hospital of Xi'an Jiaotong University, 277 West Yanta Road, Xi'an, 710061, Shaanxi, China
- Department of Graduate Studies-Life Sciences, Ewha Womans University, Ewhayeodae-gil, Seodaemun-gu, Seoul 120-750, South Korea
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany.
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tobias Marschall
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany.
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
14
|
Han G, Zhang N, Jiang H, Meng X, Qian K, Zheng Y, Xu J, Wang J. Diversity of short interspersed nuclear elements (SINEs) in lepidopteran insects and evidence of horizontal SINE transfer between baculovirus and lepidopteran hosts. BMC Genomics 2021; 22:226. [PMID: 33789582 PMCID: PMC8010984 DOI: 10.1186/s12864-021-07543-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 03/22/2021] [Indexed: 11/16/2022] Open
Abstract
Background Short interspersed nuclear elements (SINEs) belong to non-long terminal repeat (non-LTR) retrotransposons, which can mobilize dependent on the help of counterpart long interspersed nuclear elements (LINEs). Although 234 SINEs have been identified so far, only 23 are from insect species (SINEbase: http://sines.eimb.ru/). Results Here, five SINEs were identified from the genome of Plutella xylostella, among which PxSE1, PxSE2 and PxSE3 were tRNA-derived SINEs, PxSE4 and PxSE5 were 5S RNA-derived SINEs. A total of 18 related SINEs were further identified in 13 lepidopteran insects and a baculovirus. The 3′-tail of PxSE5 shares highly identity with that of LINE retrotransposon, PxLINE1. The analysis of relative age distribution profiles revealed that PxSE1 is a relatively young retrotransposon in the genome of P. xylostella and was generated by recent explosive amplification. Integration pattern analysis showed that SINEs in P. xylostella prefer to insert into or accumulate in introns and regions 5 kb downstream of genes. In particular, the PxSE1-like element, SlNPVSE1, in Spodoptera litura nucleopolyhedrovirus II genome is highly identical to SfSE1 in Spodoptera frugiperda, SlittSE1 in Spodoptera littoralis, and SlituSE1 in Spodoptera litura, suggesting the occurrence of horizontal transfer. Conclusions Lepidopteran insect genomes harbor a diversity of SINEs. The retrotransposition activity and copy number of these SINEs varies considerably between host lineages and SINE lineages. Host-parasite interactions facilitate the horizontal transfer of SINE between baculovirus and its lepidopteran hosts. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07543-z.
Collapse
Affiliation(s)
- Guangjie Han
- College of Horticulture and Plant Protection, Yangzhou University, Yangzhou, 225009, China.,Jiangsu Lixiahe District Institute of Agricultural Sciences, Yangzhou, 225008, China
| | - Nan Zhang
- College of Horticulture and Plant Protection, Yangzhou University, Yangzhou, 225009, China
| | - Heng Jiang
- College of Horticulture and Plant Protection, Yangzhou University, Yangzhou, 225009, China
| | - Xiangkun Meng
- College of Horticulture and Plant Protection, Yangzhou University, Yangzhou, 225009, China
| | - Kun Qian
- College of Horticulture and Plant Protection, Yangzhou University, Yangzhou, 225009, China
| | - Yang Zheng
- College of Horticulture and Plant Protection, Yangzhou University, Yangzhou, 225009, China
| | - Jian Xu
- Jiangsu Lixiahe District Institute of Agricultural Sciences, Yangzhou, 225008, China.
| | - Jianjun Wang
- College of Horticulture and Plant Protection, Yangzhou University, Yangzhou, 225009, China. .,Joint International Research Laboratory of Agriculture andAgri-Product Safety of the Ministry of Education, Yangzhou University, Yangzhou, 225009, China.
| |
Collapse
|
15
|
Analysis of HLA-G long-read genomic sequences in mother-offspring pairs with preeclampsia. Sci Rep 2020; 10:20027. [PMID: 33208885 PMCID: PMC7675977 DOI: 10.1038/s41598-020-77081-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 11/06/2020] [Indexed: 11/11/2022] Open
Abstract
Preeclampsia is a pregnancy-induced disorder that is characterized by hypertension and is a leading cause of perinatal and maternal–fetal morbidity and mortality. HLA-G is thought to play important roles in maternal–fetal immune tolerance, and the associations between HLA-G gene polymorphisms and the onset of pregnancy-related diseases have been explored extensively. Because contiguous genomic sequencing is difficult, the association between the HLA-G genotype and preeclampsia onset is controversial. In this study, genomic sequences of the HLA-G region (5.2 kb) from 31 pairs of mother–offspring genomic DNA samples (18 pairs from normal pregnancies/births and 13 from preeclampsia births) were obtained by single-molecule real-time sequencing using the PacBio RS II platform. The HLA-G alleles identified in our cohort matched seven known HLA-G alleles, but we also identified two new HLA-G alleles at the fourth-field resolution and compared them with nucleotide sequences from a public database that consisted of coding sequences that cover the 3.1-kb HLA-G gene span. Intriguingly, a potential association between preeclampsia onset and the poly T stretch within the downstream region of the HLA-G*01:01:01:01 allele was found. Our study suggests that long-read sequencing of HLA-G will provide clues for characterizing HLA-G variants that are involved in the pathophysiology of preeclampsia.
Collapse
|
16
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
17
|
Santagostino M, Piras FM, Cappelletti E, Del Giudice S, Semino O, Nergadze SG, Giulotto E. Insertion of Telomeric Repeats in the Human and Horse Genomes: An Evolutionary Perspective. Int J Mol Sci 2020; 21:ijms21082838. [PMID: 32325780 PMCID: PMC7215372 DOI: 10.3390/ijms21082838] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Revised: 04/15/2020] [Accepted: 04/16/2020] [Indexed: 01/06/2023] Open
Abstract
Interstitial telomeric sequences (ITSs) are short stretches of telomeric-like repeats (TTAGGG)n at nonterminal chromosomal sites. We previously demonstrated that, in the genomes of primates and rodents, ITSs were inserted during the repair of DNA double-strand breaks. These conclusions were derived from sequence comparisons of ITS-containing loci and ITS-less orthologous loci in different species. To our knowledge, insertion polymorphism of ITSs, i.e., the presence of an ITS-containing allele and an ITS-less allele in the same species, has not been described. In this work, we carried out a genome-wide analysis of 2504 human genomic sequences retrieved from the 1000 Genomes Project and a PCR-based analysis of 209 human DNA samples. In spite of the large number of individual genomes analyzed we did not find any evidence of insertion polymorphism in the human population. On the contrary, the analysis of ITS loci in the genome of a single horse individual, the reference genome, allowed us to identify five heterozygous ITS loci, suggesting that insertion polymorphism of ITSs is an important source of genetic variability in this species. Finally, following a comparative sequence analysis of horse ITSs and of their orthologous empty loci in other Perissodactyla, we propose models for the mechanism of ITS insertion during the evolution of this order.
Collapse
|
18
|
Shortt JA, Ruggiero RP, Cox C, Wacholder AC, Pollock DD. Finding and extending ancient simple sequence repeat-derived regions in the human genome. Mob DNA 2020; 11:11. [PMID: 32095164 PMCID: PMC7027126 DOI: 10.1186/s13100-020-00206-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 02/04/2020] [Indexed: 12/19/2022] Open
Abstract
Background Previously, 3% of the human genome has been annotated as simple sequence repeats (SSRs), similar to the proportion annotated as protein coding. The origin of much of the genome is not well annotated, however, and some of the unidentified regions are likely to be ancient SSR-derived regions not identified by current methods. The identification of these regions is complicated because SSRs appear to evolve through complex cycles of expansion and contraction, often interrupted by mutations that alter both the repeated motif and mutation rate. We applied an empirical, kmer-based, approach to identify genome regions that are likely derived from SSRs. Results The sequences flanking annotated SSRs are enriched for similar sequences and for SSRs with similar motifs, suggesting that the evolutionary remains of SSR activity abound in regions near obvious SSRs. Using our previously described P-clouds approach, we identified ‘SSR-clouds’, groups of similar kmers (or ‘oligos’) that are enriched near a training set of unbroken SSR loci, and then used the SSR-clouds to detect likely SSR-derived regions throughout the genome. Conclusions Our analysis indicates that the amount of likely SSR-derived sequence in the human genome is 6.77%, over twice as much as previous estimates, including millions of newly identified ancient SSR-derived loci. SSR-clouds identified poly-A sequences adjacent to transposable element termini in over 74% of the oldest class of Alu (roughly, AluJ), validating the sensitivity of the approach. Poly-A’s annotated by SSR-clouds also had a length distribution that was more consistent with their poly-A origins, with mean about 35 bp even in older Alus. This work demonstrates that the high sensitivity provided by SSR-Clouds improves the detection of SSR-derived regions and will enable deeper analysis of how decaying repeats contribute to genome structure.
Collapse
Affiliation(s)
- Jonathan A Shortt
- 1Colorado Center for Personalized Medicine, University of Colorado School of Medicine, Aurora, CO 80045 USA
| | - Robert P Ruggiero
- 2Department of Biology, Southeast Missouri State University, Cape Girardeau, MO 63701 USA
| | - Corey Cox
- 1Colorado Center for Personalized Medicine, University of Colorado School of Medicine, Aurora, CO 80045 USA
| | - Aaron C Wacholder
- 3Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213 USA
| | - David D Pollock
- 4Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045 USA
| |
Collapse
|
19
|
Meng H, Feng J, Bai T, Jian Z, Chen Y, Wu G. Genome-wide analysis of short interspersed nuclear elements provides insight into gene and genome evolution in citrus. DNA Res 2020; 27:5818487. [PMID: 32271875 PMCID: PMC7315354 DOI: 10.1093/dnares/dsaa004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 04/03/2020] [Indexed: 12/03/2022] Open
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous retrotransposons that are highly abundant, but not well annotated, in plant genomes. In this study, we identified 41,573 copies of SINEs in seven citrus genomes, including 11,275 full-length copies. The citrus SINEs were distributed among 12 families, with an average full-length rate of 0.27, and were dispersed throughout the chromosomes, preferentially in AT-rich areas. Approximately 18.4% of citrus SINEs were found in close proximity (≤1 kb upstream) to genes, indicating a significant enrichment of SINEs in promoter regions. Citrus SINEs promote gene and genome evolution by offering exons as well as splice sites and start and stop codons, creating novel genes and forming tandem and dispersed repeat structures. Comparative analysis of unique homologous SINE-containing loci (HSCLs) revealed chromosome rearrangements in sweet orange, pummelo, and mandarin, suggesting that unique HSCLs might be valuable for understanding chromosomal abnormalities. This study of SINEs provides us with new perspectives and new avenues by which to understand the evolution of citrus genes and genomes.
Collapse
Affiliation(s)
- Haijun Meng
- College of Horticulture, Henan Agricultural University, Zhengzhou 450002, China
| | - Jiancan Feng
- College of Horticulture, Henan Agricultural University, Zhengzhou 450002, China
| | - Tuanhui Bai
- College of Horticulture, Henan Agricultural University, Zhengzhou 450002, China
| | - Zaihai Jian
- College of Horticulture, Henan Agricultural University, Zhengzhou 450002, China
| | - Yanhui Chen
- College of Horticulture, Henan Agricultural University, Zhengzhou 450002, China
| | - Guoliang Wu
- College of Horticulture, Henan Agricultural University, Zhengzhou 450002, China
| |
Collapse
|
20
|
Ray DA, Grimshaw JR, Halsey MK, Korstian JM, Osmanski AB, Sullivan KAM, Wolf KA, Reddy H, Foley N, Stevens RD, Knisbacher BA, Levy O, Counterman B, Edelman NB, Mallet J. Simultaneous TE Analysis of 19 Heliconiine Butterflies Yields Novel Insights into Rapid TE-Based Genome Diversification and Multiple SINE Births and Deaths. Genome Biol Evol 2019; 11:2162-2177. [PMID: 31214686 PMCID: PMC6685494 DOI: 10.1093/gbe/evz125] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/11/2019] [Indexed: 12/21/2022] Open
Abstract
Transposable elements (TEs) play major roles in the evolution of genome structure and function. However, because of their repetitive nature, they are difficult to annotate and discovering the specific roles they may play in a lineage can be a daunting task. Heliconiine butterflies are models for the study of multiple evolutionary processes including phenotype evolution and hybridization. We attempted to determine how TEs may play a role in the diversification of genomes within this clade by performing a detailed examination of TE content and accumulation in 19 species whose genomes were recently sequenced. We found that TE content has diverged substantially and rapidly in the time since several subclades shared a common ancestor with each lineage harboring a unique TE repertoire. Several novel SINE lineages have been established that are restricted to a subset of species. Furthermore, the previously described SINE, Metulj, appears to have gone extinct in two subclades while expanding to significant numbers in others. This diversity in TE content and activity has the potential to impact how heliconiine butterflies continue to evolve and diverge.
Collapse
Affiliation(s)
- David A Ray
- Department of Biological Science, Texas Tech University
| | | | | | | | | | | | | | - Harsith Reddy
- Department of Biological Science, Texas Tech University
| | - Nicole Foley
- Department of Biological Science, Texas Tech University
- Department of Veterinary Integrative Biosciences, College of Veterinary Medicine, Texas A&M University, College Station, TX
| | | | - Binyamin A Knisbacher
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel
- Broad Institute of MIT and Harvard, Cambridge, MA
| | - Orr Levy
- Department of Physics, Bar-Ilan University, Ramat Gan, Israel
| | | | | | - James Mallet
- Department of Organismic and Evolutionary Biology, Harvard University
| |
Collapse
|
21
|
Selective elimination of long INterspersed element-1 expressing tumour cells by targeted expression of the HSV-TK suicide gene. Oncotarget 2018; 8:38239-38250. [PMID: 28415677 PMCID: PMC5503529 DOI: 10.18632/oncotarget.16013] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2017] [Accepted: 03/02/2017] [Indexed: 12/31/2022] Open
Abstract
In gene therapy, effective and selective suicide gene expression is crucial. We exploited the endogenous Long INterspersed Element-1 (L1) machinery often reactivated in human cancers to integrate the Herpes Simplex Virus Thymidine Kinase (HSV-TK) suicide gene selectively into the genome of cancer cells. We developed a plasmid-based system directing HSV-TK expression only when reverse transcribed and integrated in the host genome via the endogenous L1 ORF1/2 proteins and an Alu element. Delivery of these new constructs into cells followed by Ganciclovir (GCV) treatment selectively induced mortality of L1 ORF1/2 protein expressing cancer cells, but had no effect on primary cells that do not express L1 ORF1/2. This novel strategy for selective targeting of tumour cells provides high tolerability as the HSV-TK gene cannot be expressed without reverse transcription and integration, and high selectivity as these processes take place only in cancer cells expressing high levels of functional L1 ORF1/2.
Collapse
|
22
|
Klein SJ, O'Neill RJ. Transposable elements: genome innovation, chromosome diversity, and centromere conflict. Chromosome Res 2018; 26:5-23. [PMID: 29332159 PMCID: PMC5857280 DOI: 10.1007/s10577-017-9569-5] [Citation(s) in RCA: 106] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 12/05/2017] [Accepted: 12/12/2017] [Indexed: 12/21/2022]
Abstract
Although it was nearly 70 years ago when transposable elements (TEs) were first discovered “jumping” from one genomic location to another, TEs are now recognized as contributors to genomic innovations as well as genome instability across a wide variety of species. In this review, we illustrate the ways in which active TEs, specifically retroelements, can create novel chromosome rearrangements and impact gene expression, leading to disease in some cases and species-specific diversity in others. We explore the ways in which eukaryotic genomes have evolved defense mechanisms to temper TE activity and the ways in which TEs continue to influence genome structure despite being rendered transpositionally inactive. Finally, we focus on the role of TEs in the establishment, maintenance, and stabilization of critical, yet rapidly evolving, chromosome features: eukaryotic centromeres. Across centromeres, specific types of TEs participate in genomic conflict, a balancing act wherein they are actively inserting into centromeric domains yet are harnessed for the recruitment of centromeric histones and potentially new centromere formation.
Collapse
Affiliation(s)
- Savannah J Klein
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269, USA
| | - Rachel J O'Neill
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269, USA.
| |
Collapse
|
23
|
Kryatova MS, Steranka JP, Burns KH, Payer LM. Insertion and deletion polymorphisms of the ancient AluS family in the human genome. Mob DNA 2017; 8:6. [PMID: 28450901 PMCID: PMC5402677 DOI: 10.1186/s13100-017-0089-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Accepted: 04/04/2017] [Indexed: 01/09/2023] Open
Abstract
Background Polymorphic Alu elements account for 17% of structural variants in the human genome. The majority of these belong to the youngest AluY subfamilies, and most structural variant discovery efforts have focused on identifying Alu polymorphisms from these currently retrotranspositionally active subfamilies. In this report we analyze polymorphisms from the evolutionarily older AluS subfamily, whose peak activity was tens of millions of years ago. We annotate the AluS polymorphisms, assess their likely mechanism of origin, and evaluate their contribution to structural variation in the human genome. Results Of 52 previously reported polymorphic AluS elements ascertained for this study, 48 were confirmed to belong to the AluS subfamily using high stringency subfamily classification criteria. Of these, the majority (77%, 37/48) appear to be deletion polymorphisms. Two polymorphic AluS elements (4%) have features of non-classical Alu insertions and one polymorphic AluS element (2%) likely inserted by a mechanism involving internal priming. Seven AluS polymorphisms (15%) appear to have arisen by the classical target-primed reverse transcription (TPRT) retrotransposition mechanism. These seven TPRT products are 3′ intact with 3′ poly-A tails, and are flanked by target site duplications; L1 ORF2p endonuclease cleavage sites were also observed, providing additional evidence that these are L1 ORF2p endonuclease-mediated TPRT insertions. Further sequence analysis showed strong conservation of both the RNA polymerase III promoter and SRP9/14 binding sites, important for mediating transcription and interaction with retrotransposition machinery, respectively. This conservation of functional features implies that some of these are fairly recent insertions since they have not diverged significantly from their respective retrotranspositionally competent source elements. Conclusions Of the polymorphic AluS elements evaluated in this report, 15% (7/48) have features consistent with TPRT-mediated insertion, thus suggesting that some AluS elements have been more active recently than previously thought, or that fixation of AluS insertion alleles remains incomplete. These data expand the potential significance of polymorphic AluS elements in contributing to structural variation in the human genome. Future discovery efforts focusing on polymorphic AluS elements are likely to identify more such polymorphisms, and approaches tailored to identify deletion alleles may be warranted. Electronic supplementary material The online version of this article (doi:10.1186/s13100-017-0089-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Maria S Kryatova
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| | - Jared P Steranka
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| | - Kathleen H Burns
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| | - Lindsay M Payer
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| |
Collapse
|
24
|
Conserved 3' UTR stem-loop structure in L1 and Alu transposons in human genome: possible role in retrotransposition. BMC Genomics 2016; 17:992. [PMID: 27914481 PMCID: PMC5135761 DOI: 10.1186/s12864-016-3344-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 11/25/2016] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND In the process of retrotransposition LINEs use their own machinery for copying and inserting themselves into new genomic locations, while SINEs are parasitic and require the machinery of LINEs. The exact mechanism of how a LINE-encoded reverse transcriptase (RT) recognizes its own and SINE RNA remains unclear. However it was shown for the stringent-type LINEs that recognition of a stem-loop at the 3'UTR by RT is essential for retrotransposition. For the relaxed-type LINEs it is believed that the poly-A tail is a common recognition element between LINE and SINE RNA. However polyadenylation is a property of any messenger RNA, and how the LINE RT recognizes transposon and non-transposon RNAs remains an open question. It is likely that RNA secondary structures play an important role in RNA recognition by LINE encoded proteins. RESULTS Here we selected a set of L1 and Alu elements from the human genome and investigated their sequences for the presence of position-specific stem-loop structures. We found highly conserved stem-loop positions at the 3'UTR. Comparative structural analyses of a human L1 3'UTR stem-loop showed a similarity to 3'UTR stem-loops of the stringent-type LINEs, which were experimentally shown to be recognized by LINE RT. The consensus stem-loop structure consists of 5-7 bp loop, 8-10 bp stem with a bulge at a distance of 4-6 bp from the loop. The results show that a stem loop with a bulge exists at the 3'-end of Alu. We also found conserved stem-loop positions at 5'UTR and at the end of ORF2 and discuss their possible role. CONCLUSIONS Here we presented an evidence for the presence of a highly conserved 3'UTR stem-loop structure in L1 and Alu retrotransposons in the human genome. Both stem-loops show structural similarity to the stem-loops of the stringent-type LINEs experimentally confirmed as essential for retrotransposition. Here we hypothesize that both L1 and Alu RNA are recognized by L1 RT via the 3'-end RNA stem-loop structure. Other conserved stem-loop positions in L1 suggest their possible functions in protein-RNA interactions but to date no experimental evidence has been reported.
Collapse
|
25
|
Seibt KM, Wenke T, Muders K, Truberg B, Schmidt T. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 86:268-285. [PMID: 26996788 DOI: 10.1111/tpj.13170] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 03/11/2016] [Accepted: 03/14/2016] [Indexed: 06/05/2023]
Abstract
Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | | | | | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
26
|
Floor SN, Doudna JA. Get in LINE: Competition for Newly Minted Retrotransposon Proteins at the Ribosome. Mol Cell 2016; 60:712-714. [PMID: 26638173 DOI: 10.1016/j.molcel.2015.11.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
In this issue, Ahl et al. (2015) and Doucet et al. (2015) illuminate structural and functional features of substrates that promote integration of RNA molecules into the human genome by LINE retrotransposons, contributing to the ∼ 50% of the human genome that has been colonized by mobile genetic elements.
Collapse
Affiliation(s)
- Stephen N Floor
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Jennifer A Doudna
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720, USA; Innovative Genomics Initiative, University of California, Berkeley, Berkeley, CA 94720, USA; Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| |
Collapse
|
27
|
Konkel MK, Ullmer B, Arceneaux EL, Sanampudi S, Brantley SA, Hubley R, Smit AFA, Batzer MA. Discovery of a new repeat family in the Callithrix jacchus genome. Genome Res 2016; 26:649-59. [PMID: 26916108 PMCID: PMC4864456 DOI: 10.1101/gr.199075.115] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Accepted: 02/23/2016] [Indexed: 11/24/2022]
Abstract
We identified a novel repeat family, termed Platy-1, in the Callithrix jacchus (common marmoset) genome that arose around the time of the divergence of platyrrhines and catarrhines and established itself as a repeat family in New World monkeys (NWMs). A full-length Platy-1 element is ∼100 bp in length, making it the shortest known short interspersed element (SINE) in primates, and harbors features characteristic of non-LTR retrotransposons. We identified 2268 full-length Platy-1 elements across 62 subfamilies in the common marmoset genome. Our subfamily reconstruction and phylogenetic analyses support Platy-1 propagation throughout the evolution of NWMs in the lineage leading to C. jacchus Platy-1 appears to have reached its amplification peak in the common ancestor of current day marmosets and has since moderately declined. However, identification of more than 200 Platy-1 elements identical to their respective consensus sequence, and the presence of polymorphic elements within common marmoset populations, suggests ongoing retrotransposition activity. Platy-1, a SINE, appears to have originated from an Alu element, and hence is likely derived from 7SL RNA. Our analyses illustrate the birth of a new repeat family and its propagation dynamics in the lineage leading to the common marmoset over the last 40 million years.
Collapse
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Brygg Ullmer
- School of Electrical Engineering and Computer Science, Center for Computation and Technology, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Erika L Arceneaux
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Sreeja Sanampudi
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Sarah A Brantley
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Robert Hubley
- Institute for Systems Biology, Seattle, Washington 98109-5263, USA
| | - Arian F A Smit
- Institute for Systems Biology, Seattle, Washington 98109-5263, USA
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| |
Collapse
|
28
|
Bakshi A, Herke SW, Batzer MA, Kim J. DNA methylation variation of human-specific Alu repeats. Epigenetics 2016; 11:163-73. [PMID: 26890526 DOI: 10.1080/15592294.2015.1130518] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
DNA methylation is the major repression mechanism for human retrotransposons, such as the Alu family. Here, we have determined the methylation levels associated with 5238 loci belonging to 2 Alu subfamilies, AluYa5 and AluYb8, using high-throughput targeted repeat element bisulfite sequencing (HT-TREBS). The results indicate that ∼90% of loci are repressed by high methylation levels. Of the remaining loci, many of the hypomethylated elements are found near gene promoters and show high levels of DNA methylation variation. We have characterized this variation in the context of tumorigenesis and interindividual differences. Comparison of a primary breast tumor and its matched normal tissue revealed early DNA methylation changes in ∼1% of AluYb8 elements in response to tumorigenesis. Simultaneously, AluYa5/Yb8 elements proximal to promoters also showed differences in methylation of up to one order of magnitude, even between normal individuals. Overall, the current study demonstrates that early loss of methylation occurs during tumorigenesis in a subset of young Alu elements, suggesting their potential clinical relevance. However, approaches such as deep-bisulfite-sequencing of individual loci using HT-TREBS are required to distinguish clinically relevant loci from the background observed for AluYa5/Yb8 elements in general with regard to high levels of interindividual variation in DNA methylation.
Collapse
Affiliation(s)
- Arundhati Bakshi
- a Department of Biological Sciences , Louisiana State University , Baton Rouge , LA , USA
| | - Scott W Herke
- a Department of Biological Sciences , Louisiana State University , Baton Rouge , LA , USA
| | - Mark A Batzer
- a Department of Biological Sciences , Louisiana State University , Baton Rouge , LA , USA
| | - Joomyeong Kim
- a Department of Biological Sciences , Louisiana State University , Baton Rouge , LA , USA
| |
Collapse
|
29
|
Servant G, Deininger PL. Insertion of Retrotransposons at Chromosome Ends: Adaptive Response to Chromosome Maintenance. Front Genet 2016; 6:358. [PMID: 26779254 PMCID: PMC4700185 DOI: 10.3389/fgene.2015.00358] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 12/10/2015] [Indexed: 01/30/2023] Open
Abstract
The telomerase complex is a specialized reverse transcriptase (RT) that inserts tandem DNA arrays at the linear chromosome ends and contributes to the protection of the genetic information in eukaryotic genomes. Telomerases are phylogenetically related to retrotransposons, encoding also the RT activity required for the amplification of their sequences throughout the genome. Intriguingly the telomerase gene is lost from the Drosophila genome and tandem retrotransposons replace telomeric sequences at the chromosome extremities. This observation suggests the versatility of RT activity in counteracting the chromosome shortening associated with genome replication and that retrotransposons can provide this activity in case of a dysfunctional telomerase. In this review paper, we describe the major classes of retroelements present in eukaryotic genomes in order to point out the differences and similarities with the telomerase complex. In a second part, we discuss the insertion of retroelements at the ends of chromosomes as an adaptive response for dysfunctional telomeres.
Collapse
Affiliation(s)
| | - Prescott L. Deininger
- Tulane Cancer Center, Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LAUSA
| |
Collapse
|
30
|
Schwichtenberg K, Wenke T, Zakrzewski F, Seibt KM, Minoche A, Dohm JC, Weisshaar B, Himmelbauer H, Schmidt T. Diversification, evolution and methylation of short interspersed nuclear element families in sugar beet and related Amaranthaceae species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 85:229-44. [PMID: 26676716 DOI: 10.1111/tpj.13103] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2015] [Revised: 11/23/2015] [Accepted: 11/26/2015] [Indexed: 05/18/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-long terminal repeat retrotransposons which are widely distributed in eukaryotic organisms. While SINEs have been intensively studied in animals, only limited information is available about plant SINEs. We analysed 22 SINE families from seven genomes of the Amaranthaceae family and identified 34 806 SINEs, including 19 549 full-length copies. With the focus on sugar beet (Beta vulgaris), we performed a comparative analysis of the diversity, genomic and chromosomal organization and the methylation of SINEs to provide a detailed insight into the evolution and age of Amaranthaceae SINEs. The lengths of consensus sequences of SINEs range from 113 nucleotides (nt) up to 224 nt. The SINEs show dispersed distribution on all chromosomes but were found with higher incidence in subterminal euchromatic chromosome regions. The methylation of SINEs is increased compared with their flanking regions, and the strongest effect is visible for cytosines in the CHH context, indicating an involvement of asymmetric methylation in the silencing of SINEs.
Collapse
Affiliation(s)
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Falk Zakrzewski
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - André Minoche
- Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany
- Garvan Institute of Medical Research, 2010, Sydney, NSW, Australia
| | - Juliane C Dohm
- Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany
- Department of Biotechnology, University of Natural Resources and Life Sciences (BOKU), 1190, Vienna, Austria
| | - Bernd Weisshaar
- CeBiTec & Department of Biology, University of Bielefeld, 33615, Bielefeld, Germany
| | - Heinz Himmelbauer
- Garvan Institute of Medical Research, 2010, Sydney, NSW, Australia
- Department of Biotechnology, University of Natural Resources and Life Sciences (BOKU), 1190, Vienna, Austria
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| |
Collapse
|
31
|
Ewing AD. Transposable element detection from whole genome sequence data. Mob DNA 2015; 6:24. [PMID: 26719777 PMCID: PMC4696183 DOI: 10.1186/s13100-015-0055-3] [Citation(s) in RCA: 123] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Accepted: 12/21/2015] [Indexed: 11/25/2022] Open
Abstract
The number of software tools available for detecting transposable element insertions from whole genome sequence data has been increasing steadily throughout the last ~5 years. Some of these methods have unique features suiting them for particular use cases, but in general they follow one or more of a common set of approaches. Here, detection and filtering approaches are reviewed in the light of transposable element biology and the current state of whole genome sequencing. We demonstrate that the current state-of-the-art methods still do not produce highly concordant results and provide resources to assist future development in transposable element detection methods.
Collapse
Affiliation(s)
- Adam D Ewing
- Mater Research Institute - University of Queensland, 37 Kent St Level 4, Woolloongabba, QLD 4102 Australia
| |
Collapse
|
32
|
Polyadenylation of RNA transcribed from mammalian SINEs by RNA polymerase III: Complex requirements for nucleotide sequences. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2015; 1859:355-65. [PMID: 26700565 DOI: 10.1016/j.bbagrm.2015.12.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Revised: 12/09/2015] [Accepted: 12/11/2015] [Indexed: 01/08/2023]
Abstract
It is generally accepted that only transcripts synthesized by RNA polymerase II (e.g., mRNA) were subject to AAUAAA-dependent polyadenylation. However, we previously showed that RNA transcribed by RNA polymerase III (pol III) from mouse B2 SINE could be polyadenylated in an AAUAAA-dependent manner. Many species of mammalian SINEs end with the pol III transcriptional terminator (TTTTT) and contain hexamers AATAAA in their A-rich tail. Such SINEs were united into Class T(+), whereas SINEs lacking the terminator and AATAAA sequences were classified as T(-). Here we studied the structural features of SINE pol III transcripts that are necessary for their polyadenylation. Eight and six SINE families from classes T(+) and T(-), respectively, were analyzed. The replacement of AATAAA with AACAAA in T(+) SINEs abolished the RNA polyadenylation. Interestingly, insertion of the polyadenylation signal (AATAAA) and pol III transcription terminator in T(-) SINEs did not result in polyadenylation. The detailed analysis of three T(+) SINEs (B2, DIP, and VES) revealed areas important for the polyadenylation of their pol III transcripts: the polyadenylation signal and terminator in A-rich tail, β region positioned immediately downstream of the box B of pol III promoter, and τ region located upstream of the tail. In DIP and VES (but not in B2), the τ region is a polypyrimidine motif which is also characteristic of many other T(+) SINEs. Most likely, SINEs of different mammals acquired these structural features independently as a result of parallel evolution.
Collapse
|
33
|
Konkel MK, Walker JA, Hotard AB, Ranck MC, Fontenot CC, Storer J, Stewart C, Marth GT, Batzer MA. Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project. Genome Biol Evol 2015; 7:2608-22. [PMID: 26319576 PMCID: PMC4607524 DOI: 10.1093/gbe/evv167] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2015] [Indexed: 12/17/2022] Open
Abstract
The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages.
Collapse
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Louisiana State University
| | | | - Ashley B Hotard
- Department of Biological Sciences, Louisiana State University
| | - Megan C Ranck
- Department of Biological Sciences, Louisiana State University
| | | | - Jessica Storer
- Department of Biological Sciences, Louisiana State University Department of Molecular, Cellular and Developmental Biology, The Ohio State University
| | - Chip Stewart
- Department of Biology, Boston College Cancer Genome Computational Analysis, Cambridge, MA
| | - Gabor T Marth
- Department of Biology, Boston College Eccles Institute of Human Genetics, University of Utah
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University
| |
Collapse
|
34
|
Gallus S, Kumar V, Bertelsen MF, Janke A, Nilsson MA. A genome survey sequencing of the Java mouse deer (Tragulus javanicus) adds new aspects to the evolution of lineage specific retrotransposons in Ruminantia (Cetartiodactyla). Gene 2015; 571:271-8. [PMID: 26123917 DOI: 10.1016/j.gene.2015.06.064] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2015] [Revised: 06/24/2015] [Accepted: 06/25/2015] [Indexed: 10/23/2022]
Abstract
Ruminantia, the ruminating, hoofed mammals (cow, deer, giraffe and allies) are an unranked artiodactylan clade. Around 50-60 million years ago the BovB retrotransposon entered the ancestral ruminantian genome through horizontal gene transfer. A survey genome screen using 454-pyrosequencing of the Java mouse deer (Tragulus javanicus) and the lesser kudu (Tragelaphus imberbis) was done to investigate and to compare the landscape of transposable elements within Ruminantia. The family Tragulidae (mouse deer) is the only representative of Tragulina and phylogenetically important, because it represents the earliest divergence in Ruminantia. The data analyses show that, relative to other ruminantian species, the lesser kudu genome has seen an expansion of BovB Long INterspersed Elements (LINEs) and BovB related Short INterspersed Elements (SINEs) like BOVA2. In comparison the genome of Java mouse deer has fewer BovB elements than other ruminants, especially Bovinae, and has in addition a novel CHR-3 SINE most likely propagated by LINE-1. By contrast the other ruminants have low amounts of CHR SINEs but high numbers of actively propagating BovB-derived and BovB-propagated SINEs. The survey sequencing data suggest that the transposable element landscape in mouse deer (Tragulina) is unique among Ruminantia, suggesting a lineage specific evolutionary trajectory that does not involve BovB mediated retrotransposition. This shows that the genomic landscape of mobile genetic elements can rapidly change in any lineage.
Collapse
Affiliation(s)
- S Gallus
- Senckenberg Biodiversity and Climate Research Centre, Senckenberg Gesellschaft für Naturforschung, Senckenberganlage 25, D-60325 Frankfurt am Main, Germany
| | - V Kumar
- Senckenberg Biodiversity and Climate Research Centre, Senckenberg Gesellschaft für Naturforschung, Senckenberganlage 25, D-60325 Frankfurt am Main, Germany
| | - M F Bertelsen
- Center for Zoo and Wild Animal Health, Copenhagen Zoo, Roskildevej 38, DK-2000 Frederiksberg, Denmark
| | - A Janke
- Senckenberg Biodiversity and Climate Research Centre, Senckenberg Gesellschaft für Naturforschung, Senckenberganlage 25, D-60325 Frankfurt am Main, Germany; Goethe University Frankfurt Institute for Ecology, Evolution & Diversity Biologicum Max-von-Laue-Str.13, D-60439 Frankfurt am Main, Germany
| | - M A Nilsson
- Senckenberg Biodiversity and Climate Research Centre, Senckenberg Gesellschaft für Naturforschung, Senckenberganlage 25, D-60325 Frankfurt am Main, Germany.
| |
Collapse
|
35
|
Doucet AJ, Droc G, Siol O, Audoux J, Gilbert N. U6 snRNA Pseudogenes: Markers of Retrotransposition Dynamics in Mammals. Mol Biol Evol 2015; 32:1815-32. [PMID: 25761766 PMCID: PMC4476161 DOI: 10.1093/molbev/msv062] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Transposable elements comprise more than 45% of the human genome and long interspersed nuclear element 1 (LINE-1 or L1) is the only autonomous mobile element remaining active. Since its identification, it has been proposed that L1 contributes to the mobilization and amplification of other cellular RNAs and more recently, experimental demonstrations of this function has been described for many transcripts such as Alu, a nonautonomous mobile element, cellular mRNAs, or small noncoding RNAs. Detailed examination of the mobilization of various cellular RNAs revealed distinct pathways by which they could be recruited during retrotransposition; template choice or template switching. Here, by analyzing genomic structures and retrotransposition signatures associated with small nuclear RNA (snRNA) sequences, we identified distinct recruiting steps during the L1 retrotransposition cycle for the formation of snRNA-processed pseudogenes. Interestingly, some of the identified recruiting steps take place in the nucleus. Moreover, after comparison to other vertebrate genomes, we established that snRNA amplification by template switching is common to many LINE families from several LINE clades. Finally, we suggest that U6 snRNA copies can serve as markers of L1 retrotransposition dynamics in mammalian genomes.
Collapse
Affiliation(s)
- Aurélien J Doucet
- Institut de Génétique Humaine, CNRS, UPR 1142, Montpellier, France Institute for Research on Cancer and Aging, Nice (IRCAN), INSERM, U1081, CNRS UMR 7284, Nice, France
| | - Gaëtan Droc
- Institut de Génétique Humaine, CNRS, UPR 1142, Montpellier, France Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad), UMR AGAP, Montpellier, France
| | - Oliver Siol
- Institut de Génétique Humaine, CNRS, UPR 1142, Montpellier, France Institut de Génétique Humaine, CNRS, UPR 1142, Montpellier, France
| | - Jérôme Audoux
- Institute for Regenerative Medicine and Biotherapy, INSERM, U1183, Montpellier, France
| | - Nicolas Gilbert
- Institut de Génétique Humaine, CNRS, UPR 1142, Montpellier, France Institute for Regenerative Medicine and Biotherapy, INSERM, U1183, Montpellier, France
| |
Collapse
|
36
|
Noll A, Raabe CA, Churakov G, Brosius J, Schmitz J. Ancient traces of tailless retropseudogenes in therian genomes. Genome Biol Evol 2015; 7:889-900. [PMID: 25724209 PMCID: PMC5322556 DOI: 10.1093/gbe/evv040] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Transposable elements, once described by Barbara McClintock as controlling genetic units, not only occupy the largest part of our genome but are also a prominent moving force of genomic plasticity and innovation. They usually replicate and reintegrate into genomes silently, sometimes causing malfunctions or misregulations, but occasionally millions of years later, a few may evolve into new functional units. Retrotransposons make their way into the genome following reverse transcription of RNA molecules and chromosomal insertion. In therian mammals, long interspersed elements 1 (LINE1s) self-propagate but also coretropose many RNAs, including mRNAs and small RNAs that usually exhibit an oligo(A) tail. The revitalization of specific LINE1 elements in the mammalian lineage about 150 Ma parallels the rise of many other nonautonomous mobilized genomic elements. We previously identified and described hundreds of tRNA-derived retropseudogenes missing characteristic oligo(A) tails consequently termed tailless retropseudogenes. Additional analyses now revealed hundreds of thousands of tailless retropseudogenes derived from nearly all types of RNAs. We extracted 2,402 perfect tailless sequences (with discernible flanking target site duplications) originating from tRNAs, spliceosomal RNAs, 5S rRNAs, 7SK RNAs, mRNAs, and others. Interestingly, all are truncated at one or more defined positions that coincide with internal single-stranded regions. 5S ribosomal and U2 spliceosomal RNAs were analyzed in the context of mammalian phylogeny to discern the origin of the therian LINE1 retropositional system that evolved in our 150-Myr-old ancestor.
Collapse
Affiliation(s)
- Angela Noll
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany
| | - Carsten A Raabe
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany Institute of Evolution and Biodiversity, University of Münster, Germany
| | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany Institute of Evolutionary and Medical Genomics, Brandenburg Medical School, Neuruppin, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany
| |
Collapse
|
37
|
Lee J, Kim YJ, Mun S, Kim HS, Han K. Identification of human-specific AluS elements through comparative genomics. Gene 2014; 555:208-16. [PMID: 25447892 DOI: 10.1016/j.gene.2014.11.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Revised: 11/03/2014] [Accepted: 11/05/2014] [Indexed: 01/08/2023]
Abstract
Mobile elements are responsible for ~45% of the human genome. Among them is the Alu element, accounting for 10% of the human genome (>1.1million copies). Several studies of Alu elements have reported that they are frequently involved in human genetic diseases and genomic rearrangements. In this study, we investigated the AluS subfamily, which is a relatively old Alu subfamily and has the highest copy number in primate genomes. Previously, a set of 263 human-specific AluS insertions was identified in the human genome. To validate these, we compared each of the human-specific AluS loci with its pre-insertion site in other primate genomes, including chimpanzee, gorilla, and orangutan. We obtained 24 putative human-specific AluS candidates via the in silico analysis and manual inspection, and then tried to verify them using PCR amplification and DNA sequencing. Through the PCR product sequencing, we were able to detect two instances of near-parallel Alu insertions in nearby sites that led to computational false negatives. Finally, we computationally and experimentally verified 23 human-specific AluS elements. We reported three alternative Alu insertion events, which are accompanied by filler DNA and/or Alu retrotransposition mediated-deletion. Bisulfite sequencing was carried out to examine DNA methylation levels of human-specific AluS elements. The results showed that fixed AluS elements are hypermethylated compared with polymorphic elements, indicating a possible relation between DNA methylation and Alu fixation in the human genome.
Collapse
Affiliation(s)
- Jae Lee
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea
| | - Yun-Ji Kim
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Seyoung Mun
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Heui-Soo Kim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 609-735, Republic of Korea
| | - Kyudong Han
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea.
| |
Collapse
|
38
|
|
39
|
Downs LM, Mellersh CS. An Intronic SINE insertion in FAM161A that causes exon-skipping is associated with progressive retinal atrophy in Tibetan Spaniels and Tibetan Terriers. PLoS One 2014; 9:e93990. [PMID: 24705771 PMCID: PMC3976383 DOI: 10.1371/journal.pone.0093990] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Accepted: 03/10/2014] [Indexed: 11/19/2022] Open
Abstract
Progressive retinal atrophy (PRA) in dogs is characterised by the degeneration of the photoreceptor cells of the retina, resulting in vision loss and eventually complete blindness. The condition affects more than 100 dog breeds and is known to be genetically heterogeneous between breeds. Around 19 mutations have now been identified that are associated with PRA in around 49 breeds, but for the majority of breeds the mutation(s) responsible have yet to be identified. Using genome-wide association with 22 Tibetan Spaniel PRA cases and 10 controls, we identified a novel PRA locus, PRA3, on CFA10 (praw = 2.01×10−5, pgenome = 0.014), where a 3.8 Mb region was homozygous within 12 cases. Using targeted next generation sequencing, a short interspersed nuclear element insertion was identified near a splice acceptor site in an intron of a provocative gene, FAM161A. Analysis of mRNA from an affected dog revealed that the SINE causes exon skipping, resulting in a frame shift, leading to a downstream premature termination codon and possibly a truncated protein product. This mutation segregates with the disease in 22 out of 35 cases tested (63%). Of the PRA controls, none are homozygous for the mutation, 15% carry the mutation and 85% are homozygous wildtype. This mutation was also identified in Tibetan Terriers, although our results indicate that PRA is genetically heterogeneous in both Tibetan Spaniels and Tibetan Terriers.
Collapse
Affiliation(s)
- Louise M. Downs
- Kennel Club Genetics Centre, Animal Health Trust, Newmarket, United Kingdom
| | - Cathryn S. Mellersh
- Kennel Club Genetics Centre, Animal Health Trust, Newmarket, United Kingdom
- * E-mail:
| |
Collapse
|
40
|
Shaw AD, Tiwari Y, Kaplan W, Heath A, Mitchell PB, Schofield PR, Fullerton JM. Characterisation of genetic variation in ST8SIA2 and its interaction region in NCAM1 in patients with bipolar disorder. PLoS One 2014; 9:e92556. [PMID: 24651862 PMCID: PMC3961385 DOI: 10.1371/journal.pone.0092556] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Accepted: 02/24/2014] [Indexed: 12/30/2022] Open
Abstract
Alpha-2,8-sialyltransferase 2 (ST8SIA2) is an enzyme responsible for the transfer of polysialic acid (PSA) to glycoproteins, principally the neuronal cell adhesion molecule (NCAM1), and is involved in neuronal plasticity. Variants within ST8SIA2 have previously shown association with bipolar disorder, schizophrenia and autism. In addition, altered PSA-NCAM expression in brains of patients with schizophrenia or bipolar disorder indicates a functional dysregulation of glycosylation in mental illness. To explore the role of sequence variation affecting PSA-NCAM formation, we conducted a targeted re-sequencing study of a ∼100 kb region – including the entire ST8SIA2 gene and its region of interaction with NCAM1 – in 48 Caucasian cases with bipolar disorder using the Roche 454 platform. We identified over 400 DNA variants, including 47 putative novel variants not described in dbSNP. Validation of a subset of variants via Sequenom showed high reliability of Roche 454 genotype calls (97% genotype concordance, with 80% of novel variants independently verified). We did not observe major loss-of-function mutations that would affect PSA-NCAM formation, either by ablating ST8SIA2 function or by affecting the ability of NCAM1 to be glycosylated. However, we identified 13 SNPs in the UTRs of ST8SIA2, a synonymous coding SNP in exon 5 (rs2305561, P207P) and many additional non-coding variants that may influence splicing or regulation of ST8SIA2 expression. We calculated nucleotide diversity within ST8SIA2 on specific haplotypes, finding that the diversity on the specific “risk” and “protective” haplotypes was lower than other non-disease-associated haplotypes, suggesting that putative functional variation may have arisen on a spectrum of haplotypes. We have identified common and novel variants (rs11074064, rs722645, 15∶92961050) that exist on a spectrum of haplotypes, yet are plausible candidates for conferring the effect of risk and protective haplotypes via multiple enhancer elements. A Galaxy workflow/pipeline for sequence analysis used herein is available at: https://main.g2.bx.psu.edu/u/a-shaw-neura/p/next-generation-resources.
Collapse
Affiliation(s)
- Alex D Shaw
- Neuroscience Research Australia, Sydney, New South Wales, Australia; Schizophrenia Research Institute, Sydney, New South Wales, Australia
| | - Yash Tiwari
- Neuroscience Research Australia, Sydney, New South Wales, Australia; Schizophrenia Research Institute, Sydney, New South Wales, Australia; School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia
| | - Warren Kaplan
- Peter Wills Bioinformatic Centre, Garvan Institute, Sydney, New South Wales, Australia
| | - Anna Heath
- Neuroscience Research Australia, Sydney, New South Wales, Australia
| | - Philip B Mitchell
- School of Psychiatry, University of New South Wales, Sydney, New South Wales, Australia; Black Dog Institute, Sydney, New South Wales, Australia
| | - Peter R Schofield
- Neuroscience Research Australia, Sydney, New South Wales, Australia; Schizophrenia Research Institute, Sydney, New South Wales, Australia; School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia
| | - Janice M Fullerton
- Neuroscience Research Australia, Sydney, New South Wales, Australia; Schizophrenia Research Institute, Sydney, New South Wales, Australia; School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia
| |
Collapse
|
41
|
Ade C, Roy-Engel AM, Deininger PL. Alu elements: an intrinsic source of human genome instability. Curr Opin Virol 2013; 3:639-45. [PMID: 24080407 PMCID: PMC3982648 DOI: 10.1016/j.coviro.2013.09.002] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2013] [Accepted: 09/09/2013] [Indexed: 11/29/2022]
Abstract
Alu elements are ∼300bp sequences that have amplified via an RNA intermediate leading to the accumulation of over 1 million copies in the human genome. Although a few of the copies are active, Alu germline activity is the highest of all human retrotransposons and does significantly contribute to genetic disease and population diversity. There are two basic mechanisms by which Alu elements contribute to disease: through insertional mutagenesis and as a large source of repetitive sequences that contribute to nonallelic homologous recombination (NAHR) that cause genetic deletions and duplications.
Collapse
Affiliation(s)
- Catherine Ade
- Tulane University, Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane Cancer Center, Consortium Of Mobile Elements at Tulane)
| | - Astrid M. Roy-Engel
- Tulane University, Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane Cancer Center, Consortium Of Mobile Elements at Tulane)
| | - Prescott L. Deininger
- Tulane University, Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane Cancer Center, Consortium Of Mobile Elements at Tulane)
| |
Collapse
|
42
|
McLain AT, Carman GW, Fullerton ML, Beckstrom TO, Gensler W, Meyer TJ, Faulk C, Batzer MA. Analysis of western lowland gorilla (Gorilla gorilla gorilla) specific Alu repeats. Mob DNA 2013; 4:26. [PMID: 24262036 PMCID: PMC4177385 DOI: 10.1186/1759-8753-4-26] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 10/23/2013] [Indexed: 02/07/2023] Open
Abstract
Background Research into great ape genomes has revealed widely divergent activity levels over time for Alu elements. However, the diversity of this mobile element family in the genome of the western lowland gorilla has previously been uncharacterized. Alu elements are primate-specific short interspersed elements that have been used as phylogenetic and population genetic markers for more than two decades. Alu elements are present at high copy number in the genomes of all primates surveyed thus far. The AluY subfamily and its derivatives have been recognized as the evolutionarily youngest Alu subfamily in the Old World primate lineage. Results Here we use a combination of computational and wet-bench laboratory methods to assess and catalog AluY subfamily activity level and composition in the western lowland gorilla genome (gorGor3.1). A total of 1,075 independent AluY insertions were identified and computationally divided into 10 subfamilies, with the largest number of gorilla-specific elements assigned to the canonical AluY subfamily. Conclusions The retrotransposition activity level appears to be significantly lower than that seen in the human and chimpanzee lineages, while higher than that seen in orangutan genomes, indicative of differential Alu amplification in the western lowland gorilla lineage as compared to other Homininae.
Collapse
Affiliation(s)
- Adam T McLain
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA.
| | | | | | | | | | | | | | | |
Collapse
|
43
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
44
|
Murata H, Ota Y, Yamaguchi M, Yamada A, Katahata S, Otsuka Y, Babasaki K, Neda H. Mobile DNA distributions refine the phylogeny of "matsutake" mushrooms, Tricholoma sect. Caligata. MYCORRHIZA 2013; 23:447-461. [PMID: 23440576 DOI: 10.1007/s00572-013-0487-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2012] [Accepted: 02/07/2013] [Indexed: 06/01/2023]
Abstract
"Matsutake" mushrooms are formed by several species of Tricholoma sect. Caligata distributed across the northern hemisphere. A phylogenetic analysis of matsutake based on virtually neutral mutations in DNA sequences resolved robust relationships among Tricholoma anatolicum, Tricholoma bakamatsutake, Tricholoma magnivelare, Tricholoma matsutake, and Tricholoma sp. from Mexico (=Tricholoma sp. Mex). However, relationships among these matsutake and other species, such as Tricholoma caligatum and Tricholoma fulvocastaneum, were ambiguous. We, therefore, analyzed genomic copy numbers of σ marY1 , marY1, and marY2N retrotransposons by comparing them with the single-copy mobile DNA megB1 using real-time polymerase chain reaction (PCR) to clarify matsutake phylogeny. We also examined types of megB1-associated domains, composed of a number of poly (A) and poly (T) reminiscent of RNA-derived DNA elements among these species. Both datasets resolved two distinct groups, one composed of T. bakamatsutake, T. fulvocastaneum, and T. caligatum that could have diverged earlier and the other comprising T. magnivelare, Tricholoma sp. Mex, T. anatolicum, and T. matsutake that could have evolved later. In the first group, T. caligatum was the closest to the second group, followed by T. fulvocastaneum and T. bakamatsutake. Within the second group, T. magnivelare was clearly differentiated from the other species. The data suggest that matsutake underwent substantial evolution between the first group, mostly composed of Fagaceae symbionts, and the second group, comprised only of Pinaceae symbionts, but diverged little within each groups. Mobile DNA markers could be useful in resolving difficult phylogenies due to, for example, closely spaced speciation events.
Collapse
Affiliation(s)
- Hitoshi Murata
- Department of Applied Microbiology and Mushroom Sciences, Forestry and Forest Products Research Institute, Tsukuba, Ibaraki 305-8687, Japan.
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Grandi FC, An W. Non-LTR retrotransposons and microsatellites: Partners in genomic variation. Mob Genet Elements 2013; 3:e25674. [PMID: 24195012 PMCID: PMC3812793 DOI: 10.4161/mge.25674] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Revised: 07/07/2013] [Accepted: 07/09/2013] [Indexed: 01/10/2023] Open
Abstract
The human genome is laden with both non-LTR (long-terminal repeat) retrotransposons and microsatellite repeats. Both types of sequences are able to, either actively or passively, mutagenize the genomes of human individuals and are therefore poised to dynamically alter the human genomic landscape across generations. Non-LTR retrotransposons, such as L1 and Alu, are a major source of new microsatellites, which are born both concurrently and subsequently to L1 and Alu integration into the genome. Likewise, the mutation dynamics of microsatellite repeats have a direct impact on the fitness of their non-LTR retrotransposon parent owing to microsatellite expansion and contraction. This review explores the interactions and dynamics between non-LTR retrotransposons and microsatellites in the context of genomic variation and evolution.
Collapse
Affiliation(s)
- Fiorella C Grandi
- School of Molecular Biosciences and Center for Reproductive Biology; Washington State University; Pullman, WA USA
| | | |
Collapse
|
46
|
Grandi FC, Rosser JM, An W. LINE-1-derived poly(A) microsatellites undergo rapid shortening and create somatic and germline mosaicism in mice. Mol Biol Evol 2012; 30:503-12. [PMID: 23125228 DOI: 10.1093/molbev/mss251] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Interspersed and tandem repeat sequences comprise the bulk of mammalian genomes. Interspersed repeats result from successive replication by transposable elements, such as Alu and long interspersed element type 1 (L1). Microsatellites are tandem repeats of 1-6 base pairs, among which poly(A) microsatellites are the most abundant in the human genome. The rise and fall of a microsatellite has been depicted as a life cycle. Previous studies have demonstrated that Alu and L1 insertions are a major source of A-rich microsatellites owing to the concurrent formation of a poly(A) DNA tract at the 3'-end of each insertion. The fate of such poly(A) tracts has been studied by surveying the length distribution of genomic resident Alu and L1 insertions. However, these cross-sectional studies provide no information about the tempo of mutation immediately after birth. In this study, de novo L1 insertions were created using a transgenic L1 mouse model and traced through generations to investigate the early life of poly(A) microsatellites. High frequencies of intra-individual and intergenerational shortening were observed for long poly(A) tracts, creating somatic and germline mosaicism at the insertion site, whereas little variation was observed for short poly(A) alleles. As poly(A) microsatellites are the major intrinsic signal for nucleosome positioning, their remarkable abundance and variability make them a significant source of epigenetic variation. Thus, the birth of poly(A) microsatellites from retrotransposons and the subsequent rapid and variable shortening represent a new way with which retrotransposons can modify the genetic and epigenetic architecture of our genome.
Collapse
Affiliation(s)
- Fiorella C Grandi
- School of Molecular Biosciences and Center for Reproductive Biology, Washington State University, USA
| | | | | |
Collapse
|
47
|
Glunčić M, Paar V. Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm. Nucleic Acids Res 2012; 41:e17. [PMID: 22977183 PMCID: PMC3592446 DOI: 10.1093/nar/gks721] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes).
Collapse
Affiliation(s)
- Matko Glunčić
- Faculty of Science, University of Zagreb, Bijenička 32 and Croatian Academy of Sciences and Arts, Zrinski trg 11, 10000 Zagreb, Croatia.
| | | |
Collapse
|
48
|
Poly(A) binding protein C1 is essential for efficient L1 retrotransposition and affects L1 RNP formation. Mol Cell Biol 2012; 32:4323-36. [PMID: 22907758 DOI: 10.1128/mcb.06785-11] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Poly(A) binding proteins (PABPs) specifically bind the polyadenosine tail of mRNA and have been shown to be important for RNA polyadenylation, translation initiation, and mRNA stability. Using a modified L1 retrotransposition vector, we examined the effects of two PABPs (encoded by PABPN1 and PABPC1) on the retrotransposition activity of the L1 non-long-terminal-repeat (non-LTR) retrotransposon in both HeLa and HEK293T cells. We demonstrated that knockdown of these two genes by RNA interference (RNAi) effectively reduced L1 retrotransposition by 70 to 80% without significantly changing L1 transcription or translation or the status of the poly(A) tail. We identified that both poly(A) binding proteins were associated with the L1 ribonucleoprotein complex, presumably through L1 mRNA. Depletion of PABPC1 caused a defect in L1 RNP formation. Knockdown of the PABPC1 inhibitor PAIP2 increased L1 retrotransposition up to 2-fold. Low levels of exogenous overexpression of PABPN1 and PABPC1 increased L1 retrotransposition, whereas unregulated overexpression of these two proteins caused pleiotropic effects, such as hypersensitivity to puromycin and decreased L1 activity. Our data suggest that PABPC1 is essential for the formation of L1 RNA-protein complexes and may play a role in L1 RNP translocation in the host cell.
Collapse
|
49
|
Wagstaff BJ, Hedges DJ, Derbes RS, Campos Sanchez R, Chiaromonte F, Makova KD, Roy-Engel AM. Rescuing Alu: recovery of new inserts shows LINE-1 preserves Alu activity through A-tail expansion. PLoS Genet 2012; 8:e1002842. [PMID: 22912586 PMCID: PMC3415434 DOI: 10.1371/journal.pgen.1002842] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2011] [Accepted: 05/30/2012] [Indexed: 12/15/2022] Open
Abstract
Alu elements are trans-mobilized by the autonomous non-LTR retroelement, LINE-1 (L1). Alu-induced insertion mutagenesis contributes to about 0.1% human genetic disease and is responsible for the majority of the documented instances of human retroelement insertion-induced disease. Here we introduce a SINE recovery method that provides a complementary approach for comprehensive analysis of the impact and biological mechanisms of Alu retrotransposition. Using this approach, we recovered 226 de novo tagged Alu inserts in HeLa cells. Our analysis reveals that in human cells marked Alu inserts driven by either exogenously supplied full length L1 or ORF2 protein are indistinguishable. Four percent of de novo Alu inserts were associated with genomic deletions and rearrangements and lacked the hallmarks of retrotransposition. In contrast to L1 inserts, 5′ truncations of Alu inserts are rare, as most of the recovered inserts (96.5%) are full length. De novo Alus show a random pattern of insertion across chromosomes, but further characterization revealed an Alu insertion bias exists favoring insertion near other SINEs, highly conserved elements, with almost 60% landing within genes. De novo Alu inserts show no evidence of RNA editing. Priming for reverse transcription rarely occurred within the first 20 bp (most 5′) of the A-tail. The A-tails of recovered inserts show significant expansion, with many at least doubling in length. Sequence manipulation of the construct led to the demonstration that the A-tail expansion likely occurs during insertion due to slippage by the L1 ORF2 protein. We postulate that the A-tail expansion directly impacts Alu evolution by reintroducing new active source elements to counteract the natural loss of active Alus and minimizing Alu extinction. SINEs are mobile elements that are found ubiquitously throughout a large diversity of genomes from plants to mammals. The human SINE, Alu, is among the most successful mobile elements, with more than one million copies in the genome. Due to its high activity and ability to insert throughout the genome, Alu retrotransposition is responsible for the majority of diseases reported to be caused by mobile element activity. To further evaluate the genomic impact of SINEs, we recovered and characterized over 200 de novo Alu inserts under controlled conditions. Our data reinforce observations on the mutagenic potential of Alu, with newly retrotransposed Alu elements favoring insertion into genic and highly conserved elements. Alu-mediated deletions and rearrangements are infrequent and lack the typical hallmarks of TPRT retrotransposition, suggesting the use of an alternate method for resolving retrotransposition intermediates or an atypical insertion mechanism. Our data also provide novel insights into SINE retrotransposition biology. We found that slippage of L1 ORF2 protein during reverse transcription expands the A-tails of de novo insertions. We propose that the L1 ORF2 protein plays a major role in minimizing Alu extinction by reintroducing active Alu elements to counter the natural loss of Alu source elements.
Collapse
Affiliation(s)
- Bradley J. Wagstaff
- Tulane Cancer Center, Department of Epidemiology, Tulane University, New Orleans, Louisiana, United States of America
| | - Dale J. Hedges
- Hussman Institute for Human Genomics, Dr. John T. Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Rebecca S. Derbes
- Tulane Cancer Center, Department of Epidemiology, Tulane University, New Orleans, Louisiana, United States of America
| | - Rebeca Campos Sanchez
- Department of Biology, Center for Medical Genomics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Francesca Chiaromonte
- Department of Biology, Center for Medical Genomics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Kateryna D. Makova
- Department of Biology, Center for Medical Genomics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Astrid M. Roy-Engel
- Tulane Cancer Center, Department of Epidemiology, Tulane University, New Orleans, Louisiana, United States of America
- * E-mail:
| |
Collapse
|
50
|
Oler AJ, Traina-Dorge S, Derbes RS, Canella D, Cairns BR, Roy-Engel AM. Alu expression in human cell lines and their retrotranspositional potential. Mob DNA 2012; 3:11. [PMID: 22716230 PMCID: PMC3412727 DOI: 10.1186/1759-8753-3-11] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2012] [Accepted: 06/20/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The vast majority of the 1.1 million Alu elements are retrotranspositionally inactive, where only a few loci referred to as 'source elements' can generate new Alu insertions. The first step in identifying the active Alu sources is to determine the loci transcribed by RNA polymerase III (pol III). Previous genome-wide analyses from normal and transformed cell lines identified multiple Alu loci occupied by pol III factors, making them candidate source elements. FINDINGS Analysis of the data from these genome-wide studies determined that the majority of pol III-bound Alus belonged to the older subfamilies Alu S and Alu J, which varied between cell lines from 62.5% to 98.7% of the identified loci. The pol III-bound Alus were further scored for estimated retrotransposition potential (ERP) based on the absence or presence of selected sequence features associated with Alu retrotransposition capability. Our analyses indicate that most of the pol III-bound Alu loci candidates identified lack the sequence characteristics important for retrotransposition. CONCLUSIONS These data suggest that Alu expression likely varies by cell type, growth conditions and transformation state. This variation could extend to where the same cell lines in different laboratories present different Alu expression patterns. The vast majority of Alu loci potentially transcribed by RNA pol III lack important sequence features for retrotransposition and the majority of potentially active Alu loci in the genome (scored high ERP) belong to young Alu subfamilies. Our observations suggest that in an in vivo scenario, the contribution of Alu activity on somatic genetic damage may significantly vary between individuals and tissues.
Collapse
Affiliation(s)
- Andrew J Oler
- Department of Oncological Sciences, Huntsman Cancer Institute, and Howard Hughes Medical Institute, University of Utah School of Medicine, Salt Lake City, UT USA.,Bioinformatics and Computational Biosciences Branch, Office of Cyber Infrastructure and Computational Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Stephen Traina-Dorge
- Tulane Cancer Center SL-66, Dept. of Epidemiology, Tulane University, 1430 Tulane Ave, New Orleans, LA 70112, USA
| | - Rebecca S Derbes
- Tulane Cancer Center SL-66, Dept. of Epidemiology, Tulane University, 1430 Tulane Ave, New Orleans, LA 70112, USA
| | - Donatella Canella
- Center for Integrative Genomics (CIG), Faculty of Biology and Medicine, University of Lausanne, Lausanne 1015, Switzerland
| | - Brad R Cairns
- Department of Oncological Sciences, Huntsman Cancer Institute, and Howard Hughes Medical Institute, University of Utah School of Medicine, Salt Lake City, UT USA
| | - Astrid M Roy-Engel
- Tulane Cancer Center SL-66, Dept. of Epidemiology, Tulane University, 1430 Tulane Ave, New Orleans, LA 70112, USA
| |
Collapse
|