1
|
Liao X, Zhu W, Zhou J, Li H, Xu X, Zhang B, Gao X. Repetitive DNA sequence detection and its role in the human genome. Commun Biol 2023; 6:954. [PMID: 37726397 PMCID: PMC10509279 DOI: 10.1038/s42003-023-05322-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 09/04/2023] [Indexed: 09/21/2023] Open
Abstract
Repetitive DNA sequences playing critical roles in driving evolution, inducing variation, and regulating gene expression. In this review, we summarized the definition, arrangement, and structural characteristics of repeats. Besides, we introduced diverse biological functions of repeats and reviewed existing methods for automatic repeat detection, classification, and masking. Finally, we analyzed the type, structure, and regulation of repeats in the human genome and their role in the induction of complex diseases. We believe that this review will facilitate a comprehensive understanding of repeats and provide guidance for repeat annotation and in-depth exploration of its association with human diseases.
Collapse
Affiliation(s)
- Xingyu Liao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Wufei Zhu
- Department of Endocrinology, Yichang Central People's Hospital, The First College of Clinical Medical Science, China Three Gorges University, 443000, Yichang, P.R. China
| | - Juexiao Zhou
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Haoyang Li
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xiaopeng Xu
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Bin Zhang
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia.
| |
Collapse
|
2
|
Dumas F, Perelman PL, Biltueva L, Roelke-Parker ME. Retrotransposon mapping in spider monkey genomes of the family Atelidae (Platyrrhini, Primates) shows a high level of LINE-1 amplification. JOURNAL OF BIOLOGICAL RESEARCH - BOLLETTINO DELLA SOCIETÀ ITALIANA DI BIOLOGIA SPERIMENTALE 2022. [DOI: 10.4081/jbr.2022.10725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
To investigate the distribution of LINE-1 repeat sequences, a LINE-1 probe was Fluorescence In Situ Hybridized (FISH) on the chromosomes of Ateles geoffroyi and Ateles fusciceps (Atelidae); a LINE-1 probe was also mapped on Cebuella pygmaea (Cebidae) and used as an outgroup for phylogenetic comparison. Ateles spider monkeys have a highly rearranged genome and are an ideal model for testing whether LINE-1 is involved in genome evolution. The LINE-1 probe has been mapped in the two Atelidae species for the first time, revealing a high accumulation of LINE-1 sequences along chromosomal arms, including telomeres, and a scarcity of LINE-1 signals at centromere positions. LINE-1 mapping in C. pygmaea (Cebidae) revealed signals at centromere positions and along chromosome arms, which was consistent with previous published data from other Cebidae species. In a broader sense, the results were analyzed in light of published data on whole-chromosomal human probes mapped in these genomes. This analysis allows us to speculate about the presence of LINE-1 sequences at the junction of human chromosomal syntenies, as well as a possible link between these sequences and chromosomal rearrangements.
Collapse
|
3
|
Lee H, Min JW, Mun S, Han K. Human Retrotransposons and Effective Computational Detection Methods for Next-Generation Sequencing Data. LIFE (BASEL, SWITZERLAND) 2022; 12:life12101583. [PMID: 36295018 PMCID: PMC9605557 DOI: 10.3390/life12101583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/03/2022] [Accepted: 10/10/2022] [Indexed: 11/16/2022]
Abstract
Transposable elements (TEs) are classified into two classes according to their mobilization mechanism. Compared to DNA transposons that move by the "cut and paste" mechanism, retrotransposons mobilize via the "copy and paste" method. They have been an essential research topic because some of the active elements, such as Long interspersed element 1 (LINE-1), Alu, and SVA elements, have contributed to the genetic diversity of primates beyond humans. In addition, they can cause genetic disorders by altering gene expression and generating structural variations (SVs). The development and rapid technological advances in next-generation sequencing (NGS) have led to new perspectives on detecting retrotransposon-mediated SVs, especially insertions. Moreover, various computational methods have been developed based on NGS data to precisely detect the insertions and deletions in the human genome. Therefore, this review discusses details about the recently studied and utilized NGS technologies and the effective computational approaches for discovering retrotransposons through it. The final part covers a diverse range of computational methods for detecting retrotransposon insertions with human NGS data. This review will give researchers insights into understanding the TEs and how to investigate them and find connections with research interests.
Collapse
Affiliation(s)
- Haeun Lee
- Department of Bioconvergence Engineering, Dankook University, Yongin 16890, Korea
| | - Jun Won Min
- Department of Surgery, Dankook University College of Medicine, Cheonan 31116, Korea
| | - Seyoung Mun
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan 31116, Korea
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea
- Correspondence: (S.M.); (K.H.)
| | - Kyudong Han
- Department of Bioconvergence Engineering, Dankook University, Yongin 16890, Korea
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan 31116, Korea
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea
- HuNbiome Co., Ltd., R&D Center, Seoul 08507, Korea
- Correspondence: (S.M.); (K.H.)
| |
Collapse
|
4
|
Genomic Organization of Microsatellites and LINE-1-like Retrotransposons: Evolutionary Implications for Ctenomys minutus (Rodentia: Ctenomyidae) Cytotypes. Animals (Basel) 2022; 12:ani12162091. [PMID: 36009681 PMCID: PMC9405301 DOI: 10.3390/ani12162091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 07/28/2022] [Accepted: 08/13/2022] [Indexed: 12/05/2022] Open
Abstract
Simple Summary In animals, several species contain substantial chromosomal and genomic variation among their populations, but as to what could have driven such diversification is still a puzzle for most cases. Here, we used molecular cytogenetic analysis to expose the main genomic elements involved in the population variation observed in the Neotropical underground rodents of the genus Ctenomys (Rodentia: Ctenomyidae), which harbor the most significant chromosomal variation among mammals (2n = 10 to 2n = 70). These data provide evidence for a correlation between repetitive genomic content and localization of evolutionary breakpoint regions (EBRs) and highlight their direct impact in promoting chromosomal rearrangements. Abstract The Neotropical underground rodents of the genus Ctenomys (Rodentia: Ctenomyidae) comprise about 65 species, which harbor the most significant chromosomal variation among mammals (2n = 10 to 2n = 70). Among them, C. minutus stands out with 45 different cytotypes already identified, among which, seven parental ones, named A to G, are parapatrically distributed in the coastal plains of Southern Brazil. Looking for possible causes that led to such extensive karyotype diversification, we performed chromosomal mapping of different repetitive DNAs, including microsatellites and long interspersed element-1 (LINE-1) retrotransposons in the seven parental cytotypes. Although microsatellites were found mainly in the centromeric and telomeric regions of the chromosomes, different patterns occur for each cytotype, thus revealing specific features. Likewise, the LINE-1-like retrotransposons also showed a differential distribution for each cytotype, which may be linked to stochastic loss of LINE-1 in some populations. Here, microsatellite motifs (A)30, (C)30, (CA)15, (CAC)10, (CAG)10, (CGG)10, (GA)15, and (GAG)10 could be mapped to fusion of chromosomes 20/17, fission and inversion in the short arm of chromosome 2, fusion of chromosomes 23/19, and different combinations of centric and tandem fusions of chromosomes 22/24/16. These data provide evidence for a correlation between repetitive genomic content and localization of evolutionary breakpoints and highlight their direct impact in promoting chromosomal rearrangements.
Collapse
|
5
|
Ceraulo S, Perelman PL, Dumas F. Massive LINE‐1 retrotransposon enrichment in tamarins of the Cebidae family (Platyrrhini, Primates) and its significance for genome evolution. J ZOOL SYST EVOL RES 2021. [DOI: 10.1111/jzs.12536] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Simona Ceraulo
- Department of “Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche (STEBICEF)” University of Palermo Palermo Italy
| | | | - Francesca Dumas
- Department of “Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche (STEBICEF)” University of Palermo Palermo Italy
| |
Collapse
|
6
|
Riba A, Fumagalli MR, Caselle M, Osella M. A Model-Driven Quantitative Analysis of Retrotransposon Distributions in the Human Genome. Genome Biol Evol 2021; 12:2045-2059. [PMID: 32986810 PMCID: PMC7750997 DOI: 10.1093/gbe/evaa201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2020] [Indexed: 12/21/2022] Open
Abstract
Retrotransposons, DNA sequences capable of creating copies of themselves, compose about half of the human genome and played a central role in the evolution of mammals. Their current position in the host genome is the result of the retrotranscription process and of the following host genome evolution. We apply a model from statistical physics to show that the genomic distribution of the two most populated classes of retrotransposons in human deviates from random placement, and that this deviation increases with time. The time dependence suggests a major role of the host genome dynamics in shaping the current retrotransposon distributions. Focusing on a neutral scenario, we show that a simple model based on random placement followed by genome expansion and sequence duplications can reproduce the empirical retrotransposon distributions, even though more complex and possibly selective mechanisms can have contributed. Besides the inherent interest in understanding the origin of current retrotransposon distributions, this work sets a general analytical framework to analyze quantitatively the effects of genome evolutionary dynamics on the distribution of genomic elements.
Collapse
Affiliation(s)
| | - Maria Rita Fumagalli
- Institute of Biophysics - CNR, National Research Council, Genova, Italy.,Department of Environmental Science and Policy, Center for Complexity and Biosystems, University of Milan, Milano, Italy
| | - Michele Caselle
- Department of Physics and INFN, University of Torino, Torino, Italy
| | - Matteo Osella
- Department of Physics and INFN, University of Torino, Torino, Italy
| |
Collapse
|
7
|
Lewis JJ, Cicconardi F, Martin SH, Reed RD, Danko CG, Montgomery SH. The Dryas iulia Genome Supports Multiple Gains of a W Chromosome from a B Chromosome in Butterflies. Genome Biol Evol 2021; 13:evab128. [PMID: 34117762 PMCID: PMC8290107 DOI: 10.1093/gbe/evab128] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/29/2021] [Indexed: 12/17/2022] Open
Abstract
In butterflies and moths, which exhibit highly variable sex determination mechanisms, the homogametic Z chromosome is deeply conserved and is featured in many genome assemblies. The evolution and origin of the female W sex chromosome, however, remains mostly unknown. Previous studies have proposed that a ZZ/Z0 sex determination system is ancestral to Lepidoptera, and that W chromosomes may originate from sex-linked B chromosomes. Here, we sequence and assemble the female Dryas iulia genome into 32 highly contiguous ordered and oriented chromosomes, including the Z and W sex chromosomes. We then use sex-specific Hi-C, ATAC-seq, PRO-seq, and whole-genome DNA sequence data sets to test if features of the D. iulia W chromosome are consistent with a hypothesized B chromosome origin. We show that the putative W chromosome displays female-associated DNA sequence, gene expression, and chromatin accessibility to confirm the sex-linked function of the W sequence. In contrast with expectations from studies of homologous sex chromosomes, highly repetitive DNA content on the W chromosome, the sole presence of domesticated repetitive elements in functional DNA, and lack of sequence homology with the Z chromosome or autosomes is most consistent with a B chromosome origin for the W, although it remains challenging to rule out extensive sequence divergence. Synteny analysis of the D. iulia W chromosome with other female lepidopteran genome assemblies shows no homology between W chromosomes and suggests multiple, independent origins of the W chromosome from a B chromosome likely occurred in butterflies.
Collapse
Affiliation(s)
- James J Lewis
- Baker Institute for Animal Health, Cornell University, Ithaca, New York, USA
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York, USA
| | - Francesco Cicconardi
- School of Biological Sciences, University of Bristol, United Kingdom
- Department of Zoology, University of Cambridge, United Kingdom
| | - Simon H Martin
- Institute of Evolutionary Biology, University of Edinburgh, United Kingdom
| | - Robert D Reed
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York, USA
| | - Charles G Danko
- Baker Institute for Animal Health, Cornell University, Ithaca, New York, USA
| | | |
Collapse
|
8
|
McDonald TL, Zhou W, Castro CP, Mumm C, Switzenberg JA, Mills RE, Boyle AP. Cas9 targeted enrichment of mobile elements using nanopore sequencing. Nat Commun 2021; 12:3586. [PMID: 34117247 PMCID: PMC8196195 DOI: 10.1038/s41467-021-23918-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 05/25/2021] [Indexed: 02/05/2023] Open
Abstract
Mobile element insertions (MEIs) are repetitive genomic sequences that contribute to genetic variation and can lead to genetic disorders. Targeted and whole-genome approaches using short-read sequencing have been developed to identify reference and non-reference MEIs; however, the read length hampers detection of these elements in complex genomic regions. Here, we pair Cas9-targeted nanopore sequencing with computational methodologies to capture active MEIs in human genomes. We demonstrate parallel enrichment for distinct classes of MEIs, averaging 44% of reads on-targeted signals and exhibiting a 13.4-54x enrichment over whole-genome approaches. We show an individual flow cell can recover most MEIs (97% L1Hs, 93% AluYb, 51% AluYa, 99% SVA_F, and 65% SVA_E). We identify seventeen non-reference MEIs in GM12878 overlooked by modern, long-read analysis pipelines, primarily in repetitive genomic regions. This work introduces the utility of nanopore sequencing for MEI enrichment and lays the foundation for rapid discovery of elusive, repetitive genetic elements.
Collapse
Affiliation(s)
- Torrin L McDonald
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Christopher P Castro
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Camille Mumm
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Jessica A Switzenberg
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Ryan E Mills
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| | - Alan P Boyle
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
9
|
Zhu X, Zhou B, Pattni R, Gleason K, Tan C, Kalinowski A, Sloan S, Fiston-Lavier AS, Mariani J, Petrov D, Barres BA, Duncan L, Abyzov A, Vogel H, Moran JV, Vaccarino FM, Tamminga CA, Levinson DF, Urban AE. Machine learning reveals bilateral distribution of somatic L1 insertions in human neurons and glia. Nat Neurosci 2021; 24:186-196. [PMID: 33432196 PMCID: PMC8806165 DOI: 10.1038/s41593-020-00767-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 11/21/2020] [Indexed: 02/06/2023]
Abstract
Retrotransposons can cause somatic genome variation in the human nervous system, which is hypothesized to have relevance to brain development and neuropsychiatric disease. However, the detection of individual somatic mobile element insertions presents a difficult signal-to-noise problem. Using a machine-learning method (RetroSom) and deep whole-genome sequencing, we analyzed L1 and Alu retrotransposition in sorted neurons and glia from human brains. We characterized two brain-specific L1 insertions in neurons and glia from a donor with schizophrenia. There was anatomical distribution of the L1 insertions in neurons and glia across both hemispheres, indicating retrotransposition occurred during early embryogenesis. Both insertions were within the introns of genes (CNNM2 and FRMD4A) inside genomic loci associated with neuropsychiatric disorders. Proof-of-principle experiments revealed these L1 insertions significantly reduced gene expression. These results demonstrate that RetroSom has broad applications for studies of brain development and may provide insight into the possible pathological effects of somatic retrotransposition.
Collapse
Affiliation(s)
- Xiaowei Zhu
- Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA, USA
- Department of Genetics, Stanford University, Palo Alto, CA, USA
| | - Bo Zhou
- Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA, USA
- Department of Genetics, Stanford University, Palo Alto, CA, USA
| | - Reenal Pattni
- Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA, USA
- Department of Genetics, Stanford University, Palo Alto, CA, USA
| | - Kelly Gleason
- Division of Translational Research in Schizophrenia, Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Chunfeng Tan
- Division of Translational Research in Schizophrenia, Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Agnieszka Kalinowski
- Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA, USA
| | - Steven Sloan
- Department of Human Genetics, Emory University, Atlanta, GA, USA
| | - Anna-Sophie Fiston-Lavier
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS-UM-IRD-EPHE), Université de Montpellier, Montpellier, France
| | | | - Dmitri Petrov
- Department of Biology, Stanford University, Palo Alto, CA, USA
| | - Ben A Barres
- Department of Neurobiology, Stanford University, Palo Alto, CA, USA
| | - Laramie Duncan
- Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA, USA
| | - Alexej Abyzov
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Hannes Vogel
- Department of Pathology, Stanford University, Palo Alto, CA, USA
| | - John V Moran
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
| | - Flora M Vaccarino
- Child Study Center, Yale University, New Haven, CT, USA
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, USA
| | - Carol A Tamminga
- Division of Translational Research in Schizophrenia, Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Douglas F Levinson
- Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA, USA
| | - Alexander E Urban
- Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA, USA.
- Department of Genetics, Stanford University, Palo Alto, CA, USA.
| |
Collapse
|
10
|
Solassol J, Larrieux M, Leclerc J, Ducros V, Corsini C, Chiésa J, Pujol P, Rey JM. Alu element insertion in the MLH1 exon 6 coding sequence as a mutation predisposing to Lynch syndrome. Hum Mutat 2020; 40:716-720. [PMID: 30815977 DOI: 10.1002/humu.23725] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 02/05/2019] [Accepted: 02/12/2019] [Indexed: 12/17/2022]
Abstract
Lynch syndrome (LS) is the most frequent cause of hereditary colorectal cancer. A subset of patients with a history of LS shows no causal germline pathogenic alteration and are identified as having Lynch-like syndrome (LLS). Alu retrotransposons are the most abundant mobile DNA sequences in the human genome and have been associated with numerous human cancers by either disrupting coding regions or altering epigenetic modifications or splicing signals. We report a family first classified as having LLS by Sanger sequencing analysis. Next-generation sequencing (NGS) analysis identified an AluY5a insertion in MLH1 exon 6 that led to exon skipping. This splicing alteration inducing a pathogenic frameshift was found in patients who developed colorectal adenocarcinomas. Retroelement insertion might thus be an important but underestimated mechanism of cancer genetics that could be systematically tested in patients with a phenotype suggesting LS to accurately assess family risk and surveillance approaches.
Collapse
Affiliation(s)
- Jérôme Solassol
- Department of Pathology and Oncobiology, Montpellier University Hospital, Montpellier, France.,IRCM Inserm, Montpellier University, Montpellier, France
| | - Marion Larrieux
- Department of Pathology and Oncobiology, Montpellier University Hospital, Montpellier, France
| | - Julie Leclerc
- Department of Biochemistry and Molecular Biology, and Inserm UMR-S, Lille University Hospital, JPA Research Center, Lille University, Lille, France
| | - Vincent Ducros
- Department of Pathology and Oncobiology, Montpellier University Hospital, Montpellier, France
| | - Carole Corsini
- Department of Genetics, Montpellier University Hospital, Montpellier, France
| | - Jean Chiésa
- Department of Cytogenetics, Nimes University Hospital, Nîmes, France
| | - Pascal Pujol
- Department of Genetics, Montpellier University Hospital, Montpellier, France.,Montpellier University, Montpellier, France
| | - Jean-Marc Rey
- Department of Pathology and Oncobiology, Montpellier University Hospital, Montpellier, France
| |
Collapse
|
11
|
Zhou W, Emery SB, Flasch DA, Wang Y, Kwan KY, Kidd JM, Moran JV, Mills RE. Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology. Nucleic Acids Res 2020; 48:1146-1163. [PMID: 31853540 PMCID: PMC7026601 DOI: 10.1093/nar/gkz1173] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 11/14/2019] [Accepted: 12/05/2019] [Indexed: 11/13/2022] Open
Abstract
Long Interspersed Element-1 (LINE-1) retrotransposition contributes to inter- and intra-individual genetic variation and occasionally can lead to human genetic disorders. Various strategies have been developed to identify human-specific LINE-1 (L1Hs) insertions from short-read whole genome sequencing (WGS) data; however, they have limitations in detecting insertions in complex repetitive genomic regions. Here, we developed a computational tool (PALMER) and used it to identify 203 non-reference L1Hs insertions in the NA12878 benchmark genome. Using PacBio long-read sequencing data, we identified L1Hs insertions that were absent in previous short-read studies (90/203). Approximately 81% (73/90) of the L1Hs insertions reside within endogenous LINE-1 sequences in the reference assembly and the analysis of unique breakpoint junction sequences revealed 63% (57/90) of these L1Hs insertions could be genotyped in 1000 Genomes Project sequences. Moreover, we observed that amplification biases encountered in single-cell WGS experiments led to a wide variation in L1Hs insertion detection rates between four individual NA12878 cells; under-amplification limited detection to 32% (65/203) of insertions, whereas over-amplification increased false positive calls. In sum, these data indicate that L1Hs insertions are often missed using standard short-read sequencing approaches and long-read sequencing approaches can significantly improve the detection of L1Hs insertions present in individual genomes.
Collapse
Affiliation(s)
- Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Sarah B Emery
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Diane A Flasch
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Yifan Wang
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Kenneth Y Kwan
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA.,Molecular and Behavioral Neuroscience Institute, University of Michigan Medical School, 109 Zina Pitcher Place, Ann Arbor, MI 48109, USA
| | - Jeffrey M Kidd
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - John V Moran
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA.,Department of Internal Medicine, University of Michigan, 1500 East Medical Center Drive, Ann Arbor, MI 48109, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| |
Collapse
|
12
|
Somatic LINE-1 retrotransposition in cortical neurons and non-brain tissues of Rett patients and healthy individuals. PLoS Genet 2019; 15:e1008043. [PMID: 30973874 PMCID: PMC6478352 DOI: 10.1371/journal.pgen.1008043] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 04/23/2019] [Accepted: 02/21/2019] [Indexed: 11/19/2022] Open
Abstract
Mounting evidence supports that LINE-1 (L1) retrotransposition can occur postzygotically in healthy and diseased human tissues, contributing to genomic mosaicism in the brain and other somatic tissues of an individual. However, the genomic distribution of somatic human-specific LINE-1 (L1Hs) insertions and their potential impact on carrier cells remain unclear. Here, using a PCR-based targeted bulk sequencing approach, we profiled 9,181 somatic insertions from 20 postmortem tissues from five Rett patients and their matched healthy controls. We identified and validated somatic L1Hs insertions in both cortical neurons and non-brain tissues. In Rett patients, somatic insertions were significantly depleted in exons—mainly contributed by long genes—than healthy controls, implying that cells carrying MECP2 mutations might be defenseless against a second exonic L1Hs insertion. We observed a significant increase of somatic L1Hs insertions in the brain compared with non-brain tissues from the same individual. Compared to germline insertions, somatic insertions were less sense-depleted to transcripts, indicating that they underwent weaker selective pressure on the orientation of insertion. Our observations demonstrate that somatic L1Hs insertions contribute to genomic diversity and MeCP2 dysfunction alters their genomic patterns in Rett patients. Human-specific LINE-1 (L1Hs) is the most active autonomous retrotransposon family in the human genome. Mounting evidence supports that L1Hs retrotransposition occurs postzygotically in the human brain cells, contributing to neuronal genomic diversity, but the extent of L1Hs-driven mosaicism in the brain is debated. In this study, we profiled genome-wide L1Hs insertions among 20 postmortem tissues from Rett patients and matched controls. We identified and validated somatic L1Hs insertions in both cortical neurons and non-brain tissues, with a higher jumping activity in the brain. We further found that MeCP2 dysfunction might alter the genomic pattern of somatic L1Hs in Rett patients.
Collapse
|
13
|
Shin W, Mun S, Kim J, Lee W, Park DG, Choi S, Lee TY, Cha S, Han K. Novel Discovery of LINE-1 in a Korean Individual by a Target Enrichment Method. Mol Cells 2019; 42:87-95. [PMID: 30699287 PMCID: PMC6354063 DOI: 10.14348/molcells.2018.0351] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 10/10/2018] [Accepted: 10/26/2018] [Indexed: 11/27/2022] Open
Abstract
Long interspersed element-1 (LINE-1 or L1) is an autonomous retrotransposon, which is capable of inserting into a new region of genome. Previous studies have reported that these elements lead to genomic variations and altered functions by affecting gene expression and genetic networks. Mounting evidence strongly indicates that genetic diseases or various cancers can occur as a result of retrotransposition events that involve L1s. Therefore, the development of methodologies to study the structural variations and interpersonal insertion polymorphisms by L1 element-associated changes in an individual genome is invaluable. In this study, we applied a systematic approach to identify human-specific L1s (i.e., L1Hs) through the bioinformatics analysis of high-throughput next-generation sequencing data. We identified 525 candidates that could be inferred to carry non-reference L1Hs in a Korean individual genome (KPGP9). Among them, we randomly selected 40 candidates and validated that approximately 92.5% of non-reference L1Hs were inserted into a KPGP9 genome. In addition, unlike conventional methods, our relatively simple and expedited approach was highly reproducible in confirming the L1 insertions. Taken together, our findings strongly support that the identification of non-reference L1Hs by our novel target enrichment method demonstrates its future application to genomic variation studies on the risk of cancer and genetic disorders.
Collapse
Affiliation(s)
- Wonseok Shin
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 31116,
Korea
| | - Seyoung Mun
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 31116,
Korea
| | - Junse Kim
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 31116,
Korea
| | - Wooseok Lee
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 31116,
Korea
| | - Dong-Guk Park
- Department of Surgery, Dankook University College of Medicine, Cheonan 31116,
Korea
| | - Seungkyu Choi
- Department of Pathology, Dankook University College of Medicine, Cheonan 31116,
Korea
| | - Tae Yoon Lee
- Department of Technology Education and Department of Biomedical Engineering, Chungnam National University, Daejeon 34134,
Korea
| | - Seunghee Cha
- Department of Oral and Maxillofacial Diagnostic Sciences, University of Florida College of Dentistry, Gainesville, FL 32610,
USA
| | - Kyudong Han
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 31116,
Korea
| |
Collapse
|
14
|
Richardson SR, Faulkner GJ. Heritable L1 Retrotransposition Events During Development: Understanding Their Origins: Examination of heritable, endogenous L1 retrotransposition in mice opens up exciting new questions and research directions. Bioessays 2018; 40:e1700189. [PMID: 29709066 PMCID: PMC6681178 DOI: 10.1002/bies.201700189] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 03/04/2018] [Indexed: 01/08/2023]
Abstract
The retrotransposon Long Interspersed Element 1 (LINE-1 or L1) has played a major role in shaping the sequence composition of the mammalian genome. In our recent publication, "Heritable L1 retrotransposition in the mouse primordial germline and early embryo," we systematically assessed the rate and developmental timing of de novo, heritable endogenous L1 insertions in mice. Such heritable retrotransposition events allow L1 to exert an ongoing influence upon genome evolution. Here, we place our findings in the context of earlier studies, and highlight how our results corroborate, and depart from, previous research based on human patient samples and transgenic mouse models harboring engineered L1 reporter genes. In parallel, we outline outstanding questions regarding the stage-specificity, regulation, and functional impact of embryonic and germline L1 retrotransposition, and propose avenues for future research in this field.
Collapse
Affiliation(s)
- Sandra R. Richardson
- Mater Research Institute–University of QueenslandWoolloongabbaQueensland 4102Australia
| | - Geoffrey J. Faulkner
- Mater Research Institute–University of QueenslandWoolloongabbaQueensland 4102Australia
- Queensland Brain InstituteUniversity of QueenslandBrisbaneQueensland 4072Australia
- School of Biomedical SciencesUniversity of QueenslandBrisbaneQueensland 4072Australia
| |
Collapse
|
15
|
Yu Q, Zhang W, Zhang X, Zeng Y, Wang Y, Wang Y, Xu L, Huang X, Li N, Zhou X, Lu J, Guo X, Li G, Hou Y, Liu S, Li B. Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection. Gigascience 2018; 6:1-11. [PMID: 28938719 PMCID: PMC5603766 DOI: 10.1093/gigascience/gix066] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Accepted: 07/20/2017] [Indexed: 12/20/2022] Open
Abstract
Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms.
Collapse
Affiliation(s)
- Qichao Yu
- BGI Education Center, UCAS: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.,BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Wei Zhang
- BGI Education Center, UCAS: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.,BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Xiaolong Zhang
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Yongli Zeng
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Yeming Wang
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Yanhui Wang
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Liqin Xu
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Xiaoyun Huang
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Nannan Li
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Xinlan Zhou
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Jie Lu
- BGI College: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Xiaosen Guo
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Guibo Li
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.,Department of Biology, University of Copenhagen: Nørregade 10, Copenhagen 1165, Denmark
| | - Yong Hou
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.,Department of Biology, University of Copenhagen: Nørregade 10, Copenhagen 1165, Denmark
| | - Shiping Liu
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.,School of Biology and Biological Engineering, SCUT: Postdoctoral Apartment Building, South China University of Technology, Wushan RD., TianHe District, Guangzhou, 510640, China
| | - Bo Li
- BGI-Shenzhen: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.,BGI-Forensics: Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| |
Collapse
|
16
|
de Sotero-Caio CG, Cabral-de-Mello DC, Calixto MDS, Valente GT, Martins C, Loreto V, de Souza MJ, Santos N. Centromeric enrichment of LINE-1 retrotransposons and its significance for the chromosome evolution of Phyllostomid bats. Chromosome Res 2017; 25:313-325. [PMID: 28916913 DOI: 10.1007/s10577-017-9565-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2017] [Revised: 08/24/2017] [Accepted: 08/28/2017] [Indexed: 10/18/2022]
Abstract
Despite their ubiquitous incidence, little is known about the chromosomal distribution of long interspersed elements (LINEs) in mammalian genomes. Phyllostomid bats, characterized by lineages with distinct trends of chromosomal evolution coupled with remarkable ecological and taxonomic diversity, represent good models to understand how these repetitive sequences contribute to the evolution of genome architecture and its link to lineage diversification. To test the hypothesis that LINE-1 sequences were important modifiers of bat genome architecture, we characterized the distribution of LINE-1-derived sequences on genomes of 13 phyllostomid species within a phylogenetic framework. We found massive accumulation of LINE-1 elements in the centromeres of most species: a rare phenomenon on mammalian genomes. We hypothesize that expansion of these elements has occurred early in the radiation of phyllostomids and recurred episodically. LINE-1 expansions on centromeric heterochromatin probably spurred chromosomal change before the radiation of phyllostomids into the extant 11 subfamilies and contributed to the high degree of karyotypic variation observed among different lineages. Understanding centromere architecture in a variety of taxa promises to explain how lineage-specific changes on centromere structure can contribute to karyotypic diversity while not disrupting functional constraints for proper cell division.
Collapse
Affiliation(s)
- Cibele Gomes de Sotero-Caio
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil. .,Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA.
| | - Diogo Cavalcanti Cabral-de-Mello
- Departamento de Biologia, Grupo de Estudos em Citogenômica e Evolução Animal, UNESP-Universidade Estadual Paulista, Instituto de Biociências, Rio Claro, SP, Brazil
| | - Merilane da Silva Calixto
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil.,Centro de Saúde e Tecnologia, Unidade Acadêmica de Ciências Biológicas, UFCG-Universidade Federal de Campina Grande, Patos, PB, Brazil
| | - Guilherme Targino Valente
- Departamento de Bioprocessos e Biotecnologia da Faculdade de Ciências Agronômicas, UNESP-Universidade Estadual Paulista, Botucatu, SP, Brazil
| | - Cesar Martins
- Departamento de Morfologia, Laboratório Genômica Integrativa, UNESP-Universidade Estadual Paulista, Botucatu, SP, Brazil
| | - Vilma Loreto
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil
| | - Maria José de Souza
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil
| | - Neide Santos
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil
| |
Collapse
|
17
|
Differential chromosomal organization between Saguinus midas and Saguinus bicolor with accumulation of differences the repetitive sequence DNA. Genetica 2017. [PMID: 28634866 DOI: 10.1007/s10709-017-9971-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Saguinus is the largest and most complex genus of the subfamily Callitrichinae, with 23 species distributed from the south of Central America to the north of South America with Saguinus midas having the largest geographical distribution while Saguinus bicolor has a very restricted one, affected by the population expansion in the state of Amazonas. Considering the phylogenetic proximity of the two species along with evidence on the existence of hybrids between them, as well as cytogenetic studies on Saguinus describing a conserved karyotypic macrostructure, we carried out a physical mapping of DNA repeated sequences in the mitotic chromosome of both species, since these sequences are less susceptible to evolutionary pressure and possibly perform an important function in speciation. Both species presented 2n = 46 chromosomes; in S. midas, chromosome Y is the smallest. Multiple ribosomal sites occur in both species, but chromosome pairs three and four may be regarded as markers that differ the species when subjected to G banding and distribution of retroelement LINE 1, suggesting that it may be cytogenetic marker in which it can contribute to identification of first generation hybrids in contact zone. Saguinus bicolor also presented differences in the LINE 1 distribution pattern for sexual chromosome X in individuals from different urban fragments, probably due to geographical isolation. In this context, cytogenetic analyses reveal a differential genomic organization pattern between species S. midas and S. bicolor, in addition to indicating that individuals from different urban fragments have been accumulating differences because of the isolation between them.
Collapse
|
18
|
Adrion JR, Song MJ, Schrider DR, Hahn MW, Schaack S. Genome-Wide Estimates of Transposable Element Insertion and Deletion Rates in Drosophila Melanogaster. Genome Biol Evol 2017; 9:1329-1340. [PMID: 28338986 PMCID: PMC5447328 DOI: 10.1093/gbe/evx050] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/08/2017] [Indexed: 02/06/2023] Open
Abstract
Knowing the rate at which transposable elements (TEs) insert and delete is critical for understanding their role in genome evolution. We estimated spontaneous rates of insertion and deletion for all known, active TE superfamilies present in a set of Drosophila melanogaster mutation-accumulation (MA) lines using whole genome sequence data. Our results demonstrate that TE insertions far outpace TE deletions in D. melanogaster. We found a significant effect of background genotype on TE activity, with higher rates of insertions in one MA line. We also found significant rate heterogeneity between the chromosomes, with both insertion and deletion rates elevated on the X relative to the autosomes. Further, we identified significant associations between TE activity and chromatin state, and tested for associations between TE activity and other features of the local genomic environment such as TE content, exon content, GC content, and recombination rate. Our results provide the most detailed assessment of TE mobility in any organism to date, and provide a useful benchmark for both addressing theoretical predictions of TE dynamics and for exploring large-scale patterns of TE movement in D. melanogaster and other species.
Collapse
Affiliation(s)
| | - Michael J. Song
- Department of Integrative Biology, University of California, Berkeley, CA
| | - Daniel R. Schrider
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ
| | - Matthew W. Hahn
- Department of Biology, Indiana University, Bloomington, IN
- School of Informatics and Computing, Indiana University, Bloomington, IN
| | | |
Collapse
|
19
|
Vieira-da-Silva A, Adega F, Guedes-Pinto H, Chaves R. LINE-1 distribution in six rodent genomes follow a species-specific pattern. J Genet 2016; 95:21-33. [PMID: 27019429 DOI: 10.1007/s12041-015-0595-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
L1 distribution in mammal's genomes is yet a huge riddle. However, these repetitive sequences were already found in all chromosomic regions, and in general, they seem to be nonrandomly distributed in the genome. It also seems that after insertion and when they are not deleterious, they are always involved in dynamic processes occurring on that particular chromosomic region. Furthermore, it seems that large-scale genome rearrangements and L1 activity and accumulation are somehow interconnected. In the present study, we analysed L1 genomic distribution in Tatera gambiana (Muridae, Gerbillinae), Acomys sp. (Muridae, Deomyinae), Cricetomys sp. (Nesomyidae, Cricetomyinae), Microtus arvalis (Cricetidae, Arvicolinae), Phodopus roborovskii and P. sungorus (Cricetidae, Cricetinae). All the species studied here seems to exhibit a species-specific pattern.Possible mechanisms, and processes involved in L1 distribution and preferential accumulation in certain regions are di scussed.
Collapse
Affiliation(s)
- A Vieira-da-Silva
- Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), University of Trάs-os-Montes and Alto Douro (UTAD), 5001, 801 Vila Real,
| | | | | | | |
Collapse
|
20
|
Abstract
Transposable elements have had a profound impact on the structure and function of mammalian genomes. The retrotransposon Long INterspersed Element-1 (LINE-1 or L1), by virtue of its replicative mobilization mechanism, comprises ∼17% of the human genome. Although the vast majority of human LINE-1 sequences are inactive molecular fossils, an estimated 80-100 copies per individual retain the ability to mobilize by a process termed retrotransposition. Indeed, LINE-1 is the only active, autonomous retrotransposon in humans and its retrotransposition continues to generate both intra-individual and inter-individual genetic diversity. Here, we briefly review the types of transposable elements that reside in mammalian genomes. We will focus our discussion on LINE-1 retrotransposons and the non-autonomous Short INterspersed Elements (SINEs) that rely on the proteins encoded by LINE-1 for their mobilization. We review cases where LINE-1-mediated retrotransposition events have resulted in genetic disease and discuss how the characterization of these mutagenic insertions led to the identification of retrotransposition-competent LINE-1s in the human and mouse genomes. We then discuss how the integration of molecular genetic, biochemical, and modern genomic technologies have yielded insight into the mechanism of LINE-1 retrotransposition, the impact of LINE-1-mediated retrotransposition events on mammalian genomes, and the host cellular mechanisms that protect the genome from unabated LINE-1-mediated retrotransposition events. Throughout this review, we highlight unanswered questions in LINE-1 biology that provide exciting opportunities for future research. Clearly, much has been learned about LINE-1 and SINE biology since the publication of Mobile DNA II thirteen years ago. Future studies should continue to yield exciting discoveries about how these retrotransposons contribute to genetic diversity in mammalian genomes.
Collapse
|
21
|
Nazaryan-Petersen L, Bertelsen B, Bak M, Jønson L, Tommerup N, Hancks DC, Tümer Z. Germline Chromothripsis Driven by L1-Mediated Retrotransposition and Alu/Alu Homologous Recombination. Hum Mutat 2016; 37:385-95. [PMID: 26929209 DOI: 10.1002/humu.22953] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Accepted: 01/03/2016] [Indexed: 12/20/2022]
Abstract
Chromothripsis (CTH) is a phenomenon where multiple localized double-stranded DNA breaks result in complex genomic rearrangements. Although the DNA-repair mechanisms involved in CTH have been described, the mechanisms driving the localized "shattering" process remain unclear. High-throughput sequence analysis of a familial germline CTH revealed an inserted SVAE retrotransposon associated with a 110-kb deletion displaying hallmarks of L1-mediated retrotransposition. Our analysis suggests that the SVAE insertion did not occur prior to or after, but concurrent with the CTH event. We also observed L1-endonuclease potential target sites in other breakpoints. In addition, we found four Alu elements flanking the 110-kb deletion and associated with an inversion. We suggest that chromatin looping mediated by homologous Alu elements may have brought distal DNA regions into close proximity facilitating DNA cleavage by catalytically active L1-endonuclease. Our data provide the first evidence that active and inactive human retrotransposons can serve as endogenous mutagens driving CTH in the germline.
Collapse
Affiliation(s)
- Lusine Nazaryan-Petersen
- Applied Human Molecular Genetics, Kennedy Center, Department of Clinical Genetics, Copenhagen University Hospital, Rigshospitalet, Glostrup, 2600, Denmark.,Department of Cellular and Molecular Medicine (ICMM), Faculty of Health Science, University of Copenhagen, Copenhagen, N. 2200, Denmark
| | - Birgitte Bertelsen
- Applied Human Molecular Genetics, Kennedy Center, Department of Clinical Genetics, Copenhagen University Hospital, Rigshospitalet, Glostrup, 2600, Denmark
| | - Mads Bak
- Department of Cellular and Molecular Medicine, Faculty of Health Science, University of Copenhagen, Copenhagen, N. 2200, Denmark
| | - Lars Jønson
- Center for Genomic Medicine, Copenhagen University Hospital, Rigshospitalet, Copenhagen, O. 2100, Denmark
| | - Niels Tommerup
- Department of Cellular and Molecular Medicine, Faculty of Health Science, University of Copenhagen, Copenhagen, N. 2200, Denmark
| | - Dustin C Hancks
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah, 84112
| | - Zeynep Tümer
- Applied Human Molecular Genetics, Kennedy Center, Department of Clinical Genetics, Copenhagen University Hospital, Rigshospitalet, Glostrup, 2600, Denmark
| |
Collapse
|
22
|
Dhivya S, Premkumar K. Nomadic genetic elements contribute to oncogenic translocations: Implications in carcinogenesis. Crit Rev Oncol Hematol 2015; 98:81-93. [PMID: 26548742 DOI: 10.1016/j.critrevonc.2015.10.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Revised: 10/05/2015] [Accepted: 10/27/2015] [Indexed: 12/22/2022] Open
Abstract
Chromosomal translocations as molecular signatures have been reported in various malignancies but, the mechanism behind which is largely unknown. Swapping of chromosomal fragments occurs by induction of double strand breaks (DSBs), most of which were initially assumed de novo. However, decoding of human genome proved that transposable elements (TE) might have profound influence on genome integrity. TEs are highly conserved mobile genetic elements that generate DSBs, subsequently resulting in large chromosomal rearrangements. Previously TE insertions were thought to be harmless, but recently gains attention due to the origin of spectrum of post-insertional genomic alterations and subsequent transcriptional alterations leading to development of deleterious effects mainly carcinogenesis. Though the existing knowledge on the cancer-associated TE dynamics is very primitive, exploration of underlying mechanism promises better therapeutic strategies for cancer. Thus, this review focuses on the prevalence of TE in the genome, associated genomic instability upon transposition activation and impact on tumorigenesis.
Collapse
Affiliation(s)
- Sridaran Dhivya
- Cancer Genetics and Nanomedicine Laboratory, Department of Biomedical Science, School of Basic Medical Sciences, Bharathidasan University, Tiruchirappalli 620 024, Tamil Nadu, India
| | - Kumpati Premkumar
- Cancer Genetics and Nanomedicine Laboratory, Department of Biomedical Science, School of Basic Medical Sciences, Bharathidasan University, Tiruchirappalli 620 024, Tamil Nadu, India.
| |
Collapse
|
23
|
Linker S, Hedges D. Linear decay of retrotransposon antisense bias across genes is contingent upon tissue specificity. PLoS One 2013; 8:e79402. [PMID: 24244495 PMCID: PMC3828378 DOI: 10.1371/journal.pone.0079402] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 09/28/2013] [Indexed: 12/23/2022] Open
Abstract
Retrotransposons comprise approximately half of the human genome and contribute to chromatin structure, regulatory motifs, and protein-coding sequences. Since retrotransposon insertions can disrupt functional genetic elements as well as introduce new sequence motifs to a region, they have the potential to affect the function of genes that harbour insertions as well as those nearby. Partly as a result of these effects, the distribution of retrotransposons across the genome is non-uniform and there are observed imbalances in the orientation of insertions with respect to the transcriptional direction of the containing gene. Although some of the factors underlying the observed distributions are understood, much of the variability remains unexplained. Detailed characterization of retrotransposon density in genes could help inform predictions of the functional consequence of de novo as well as polymorphic insertions. In order to characterize the relationship between genes and inserted elements, we have examined the distribution of retrotransposons and their internal motifs within tissue-specific and housekeeping genes. We have identified that the previously established retrotransposon antisense bias decays at a linear rate across genes, resulting in an equal density of sense and antisense retrotransposons near the 3'-UTR. In addition, the decay of antisense bias across genes is less pronounced among tissue-specific genes. Our results provide support for the scenario in which this linear decay in antisense bias is established by natural selection shortly after retrotransposon integration, and that total antisense bias observed is above and beyond any bias introduced by the integration process itself. Finally, we provide an example of a retrotransposon acting as an eQTL on a coincident gene, highlighting one of several possible avenues through which insertions may modulate gene function.
Collapse
Affiliation(s)
- Sara Linker
- Hussman Institute for Human Genomics, Dr John T. Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Dale Hedges
- Division of Human Genetics, Department of Internal Medicine, The Ohio State University, Columbus, Ohio, United States of America
| |
Collapse
|
24
|
Grandi FC, An W. Non-LTR retrotransposons and microsatellites: Partners in genomic variation. Mob Genet Elements 2013; 3:e25674. [PMID: 24195012 PMCID: PMC3812793 DOI: 10.4161/mge.25674] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Revised: 07/07/2013] [Accepted: 07/09/2013] [Indexed: 01/10/2023] Open
Abstract
The human genome is laden with both non-LTR (long-terminal repeat) retrotransposons and microsatellite repeats. Both types of sequences are able to, either actively or passively, mutagenize the genomes of human individuals and are therefore poised to dynamically alter the human genomic landscape across generations. Non-LTR retrotransposons, such as L1 and Alu, are a major source of new microsatellites, which are born both concurrently and subsequently to L1 and Alu integration into the genome. Likewise, the mutation dynamics of microsatellite repeats have a direct impact on the fitness of their non-LTR retrotransposon parent owing to microsatellite expansion and contraction. This review explores the interactions and dynamics between non-LTR retrotransposons and microsatellites in the context of genomic variation and evolution.
Collapse
Affiliation(s)
- Fiorella C Grandi
- School of Molecular Biosciences and Center for Reproductive Biology; Washington State University; Pullman, WA USA
| | | |
Collapse
|
25
|
Controlled insertional mutagenesis using a LINE-1 (ORFeus) gene-trap mouse model. Proc Natl Acad Sci U S A 2013; 110:E2706-13. [PMID: 23818630 DOI: 10.1073/pnas.1302504110] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
A codon-optimized mouse LINE-1 element, ORFeus, exhibits dramatically higher retrotransposition frequencies compared with its native long interspersed element 1 counterpart. To establish a retrotransposon-mediated mouse model with regulatable and potent mutagenic capabilities, we generated a tetracycline (tet)-regulated ORFeus element harboring a gene-trap cassette. Here, we show that mice expressing tet-ORFeus broadly exhibit robust retrotransposition in somatic tissues when treated with doxycycline. Consistent with a significant mutagenic burden, we observed a reduced number of double transgenic animals when treated with high-level doxycycline during embryogenesis. Transgene induction in skin resulted in a white spotting phenotype due to somatic ORFeus-mediated mutations that likely disrupt melanocyte development. The data suggest a high level of transposition in melanocyte precursors and consequent mutation of genes important for melanoblast proliferation, differentiation, or migration. These findings reveal the utility of a retrotransposon-based mutagenesis system as an alternative to existing DNA transposon systems. Moreover, breeding these mice to different tet-transactivator/reversible tet-transactivator lines supports broad functionality of tet-ORFeus because of the potential for dose-dependent, tissue-specific, and temporal-specific mutagenesis.
Collapse
|
26
|
Macfarlane CM, Collier P, Rahbari R, Beck CR, Wagstaff JF, Igoe S, Moran JV, Badge RM. Transduction-specific ATLAS reveals a cohort of highly active L1 retrotransposons in human populations. Hum Mutat 2013; 34:974-85. [PMID: 23553801 DOI: 10.1002/humu.22327] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Accepted: 03/15/2013] [Indexed: 11/09/2022]
Abstract
Long INterspersed Element-1 (LINE-1 or L1) retrotransposons are the only autonomously active transposable elements in the human genome. The average human genome contains ∼80-100 active L1s, but only a subset of these L1s are highly active or 'hot'. Human L1s are closely related in sequence, making it difficult to decipher progenitor/offspring relationships using traditional phylogenetic methods. However, L1 mRNAs can sometimes bypass their own polyadenylation signal and instead utilize fortuitous polyadenylation signals in 3' flanking genomic DNA. Retrotransposition of the resultant mRNAs then results in lineage specific sequence "tags" (i.e., 3' transductions) that mark the descendants of active L1 progenitors. Here, we developed a method (Transduction-Specific Amplification Typing of L1 Active Subfamilies or TS-ATLAS) that exploits L1 3' transductions to identify active L1 lineages in a genome-wide context. TS-ATLAS enabled the characterization of a putative active progenitor of one L1 lineage that includes the disease causing L1 insertion L1RP , and the identification of new retrotransposition events within two other "hot" L1 lineages. Intriguingly, the analysis of the newly discovered transduction lineage members suggests that L1 polyadenylation, even within a lineage, is highly stochastic. Thus, TS-ATLAS provides a new tool to explore the dynamics of L1 lineage evolution and retrotransposon biology.
Collapse
|
27
|
Witherspoon DJ, Zhang Y, Xing J, Watkins WS, Ha H, Batzer MA, Jorde LB. Mobile element scanning (ME-Scan) identifies thousands of novel Alu insertions in diverse human populations. Genome Res 2013; 23:1170-81. [PMID: 23599355 PMCID: PMC3698510 DOI: 10.1101/gr.148973.112] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Alu retrotransposons are the most numerous and active mobile elements in humans, causing genetic disease and creating genomic diversity. Mobile element scanning (ME-Scan) enables comprehensive and affordable identification of mobile element insertions (MEI) using targeted high-throughput sequencing of multiplexed MEI junction libraries. In a single experiment, ME-Scan identifies nearly all AluYb8 and AluYb9 elements, with high sensitivity for both rare and common insertions, in 169 individuals of diverse ancestry. ME-Scan detects heterozygous insertions in single individuals with 91% sensitivity. Insertion presence or absence states determined by ME-Scan are 95% concordant with those determined by locus-specific PCR assays. By sampling diverse populations from Africa, South Asia, and Europe, we are able to identify 5799 Alu insertions, including 2524 novel ones, some of which occur in exons. Sub-Saharan populations and a Pygmy group in particular carry numerous intermediate-frequency Alu insertions that are absent in non-African groups. There is a significant dearth of exon-interrupting insertions among common Alu polymorphisms, but the density of singleton Alu insertions is constant across exonic and nonexonic regions. In one case, a validated novel singleton Alu interrupts a protein-coding exon of FAM187B. This implies that exonic Alu insertions are generally deleterious and thus eliminated by natural selection, but not so quickly that they cannot be observed as extremely rare variants.
Collapse
Affiliation(s)
- David J Witherspoon
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, Utah 84112, USA.
| | | | | | | | | | | | | |
Collapse
|
28
|
Xing J, Witherspoon DJ, Jorde LB. Mobile element biology: new possibilities with high-throughput sequencing. Trends Genet 2013; 29:280-9. [PMID: 23312846 DOI: 10.1016/j.tig.2012.12.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2012] [Revised: 11/20/2012] [Accepted: 12/11/2012] [Indexed: 12/29/2022]
Abstract
Mobile elements comprise more than half of the human genome, but until recently their large-scale detection was time consuming and challenging. With the development of new high-throughput sequencing (HTS) technologies, the complete spectrum of mobile element variation in humans can now be identified and analyzed. Thousands of new mobile element insertions (MEIs) have been discovered, yielding new insights into mobile element biology, evolution, and genomic variation. Here, we review several high-throughput methods, with an emphasis on techniques that specifically target MEIs in humans. We highlight recent applications of these methods in evolutionary studies and in the analysis of somatic alterations in human normal and tumor tissues.
Collapse
Affiliation(s)
- Jinchuan Xing
- Department of Genetics, Human Genetic Institute of New Jersey, Rutgers, State University of New Jersey, Piscataway, NJ 08854, USA
| | | | | |
Collapse
|
29
|
Grandi FC, Rosser JM, An W. LINE-1-derived poly(A) microsatellites undergo rapid shortening and create somatic and germline mosaicism in mice. Mol Biol Evol 2012; 30:503-12. [PMID: 23125228 DOI: 10.1093/molbev/mss251] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Interspersed and tandem repeat sequences comprise the bulk of mammalian genomes. Interspersed repeats result from successive replication by transposable elements, such as Alu and long interspersed element type 1 (L1). Microsatellites are tandem repeats of 1-6 base pairs, among which poly(A) microsatellites are the most abundant in the human genome. The rise and fall of a microsatellite has been depicted as a life cycle. Previous studies have demonstrated that Alu and L1 insertions are a major source of A-rich microsatellites owing to the concurrent formation of a poly(A) DNA tract at the 3'-end of each insertion. The fate of such poly(A) tracts has been studied by surveying the length distribution of genomic resident Alu and L1 insertions. However, these cross-sectional studies provide no information about the tempo of mutation immediately after birth. In this study, de novo L1 insertions were created using a transgenic L1 mouse model and traced through generations to investigate the early life of poly(A) microsatellites. High frequencies of intra-individual and intergenerational shortening were observed for long poly(A) tracts, creating somatic and germline mosaicism at the insertion site, whereas little variation was observed for short poly(A) alleles. As poly(A) microsatellites are the major intrinsic signal for nucleosome positioning, their remarkable abundance and variability make them a significant source of epigenetic variation. Thus, the birth of poly(A) microsatellites from retrotransposons and the subsequent rapid and variable shortening represent a new way with which retrotransposons can modify the genetic and epigenetic architecture of our genome.
Collapse
Affiliation(s)
- Fiorella C Grandi
- School of Molecular Biosciences and Center for Reproductive Biology, Washington State University, USA
| | | | | |
Collapse
|
30
|
Wagstaff BJ, Hedges DJ, Derbes RS, Campos Sanchez R, Chiaromonte F, Makova KD, Roy-Engel AM. Rescuing Alu: recovery of new inserts shows LINE-1 preserves Alu activity through A-tail expansion. PLoS Genet 2012; 8:e1002842. [PMID: 22912586 PMCID: PMC3415434 DOI: 10.1371/journal.pgen.1002842] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2011] [Accepted: 05/30/2012] [Indexed: 12/15/2022] Open
Abstract
Alu elements are trans-mobilized by the autonomous non-LTR retroelement, LINE-1 (L1). Alu-induced insertion mutagenesis contributes to about 0.1% human genetic disease and is responsible for the majority of the documented instances of human retroelement insertion-induced disease. Here we introduce a SINE recovery method that provides a complementary approach for comprehensive analysis of the impact and biological mechanisms of Alu retrotransposition. Using this approach, we recovered 226 de novo tagged Alu inserts in HeLa cells. Our analysis reveals that in human cells marked Alu inserts driven by either exogenously supplied full length L1 or ORF2 protein are indistinguishable. Four percent of de novo Alu inserts were associated with genomic deletions and rearrangements and lacked the hallmarks of retrotransposition. In contrast to L1 inserts, 5′ truncations of Alu inserts are rare, as most of the recovered inserts (96.5%) are full length. De novo Alus show a random pattern of insertion across chromosomes, but further characterization revealed an Alu insertion bias exists favoring insertion near other SINEs, highly conserved elements, with almost 60% landing within genes. De novo Alu inserts show no evidence of RNA editing. Priming for reverse transcription rarely occurred within the first 20 bp (most 5′) of the A-tail. The A-tails of recovered inserts show significant expansion, with many at least doubling in length. Sequence manipulation of the construct led to the demonstration that the A-tail expansion likely occurs during insertion due to slippage by the L1 ORF2 protein. We postulate that the A-tail expansion directly impacts Alu evolution by reintroducing new active source elements to counteract the natural loss of active Alus and minimizing Alu extinction. SINEs are mobile elements that are found ubiquitously throughout a large diversity of genomes from plants to mammals. The human SINE, Alu, is among the most successful mobile elements, with more than one million copies in the genome. Due to its high activity and ability to insert throughout the genome, Alu retrotransposition is responsible for the majority of diseases reported to be caused by mobile element activity. To further evaluate the genomic impact of SINEs, we recovered and characterized over 200 de novo Alu inserts under controlled conditions. Our data reinforce observations on the mutagenic potential of Alu, with newly retrotransposed Alu elements favoring insertion into genic and highly conserved elements. Alu-mediated deletions and rearrangements are infrequent and lack the typical hallmarks of TPRT retrotransposition, suggesting the use of an alternate method for resolving retrotransposition intermediates or an atypical insertion mechanism. Our data also provide novel insights into SINE retrotransposition biology. We found that slippage of L1 ORF2 protein during reverse transcription expands the A-tails of de novo insertions. We propose that the L1 ORF2 protein plays a major role in minimizing Alu extinction by reintroducing active Alu elements to counter the natural loss of Alu source elements.
Collapse
Affiliation(s)
- Bradley J. Wagstaff
- Tulane Cancer Center, Department of Epidemiology, Tulane University, New Orleans, Louisiana, United States of America
| | - Dale J. Hedges
- Hussman Institute for Human Genomics, Dr. John T. Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Rebecca S. Derbes
- Tulane Cancer Center, Department of Epidemiology, Tulane University, New Orleans, Louisiana, United States of America
| | - Rebeca Campos Sanchez
- Department of Biology, Center for Medical Genomics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Francesca Chiaromonte
- Department of Biology, Center for Medical Genomics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Kateryna D. Makova
- Department of Biology, Center for Medical Genomics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Astrid M. Roy-Engel
- Tulane Cancer Center, Department of Epidemiology, Tulane University, New Orleans, Louisiana, United States of America
- * E-mail:
| |
Collapse
|
31
|
Hancks DC, Kazazian HH. Active human retrotransposons: variation and disease. Curr Opin Genet Dev 2012; 22:191-203. [PMID: 22406018 DOI: 10.1016/j.gde.2012.02.006] [Citation(s) in RCA: 449] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2011] [Revised: 01/18/2012] [Accepted: 02/06/2012] [Indexed: 12/12/2022]
Abstract
Mobile DNAs, also known as transposons or 'jumping genes', are widespread in nature and comprise an estimated 45% of the human genome. Transposons are divided into two general classes based on their transposition intermediate (DNA or RNA). Only one subclass, the non-LTR retrotransposons, which includes the Long INterspersed Element-1 (LINE-1 or L1), is currently active in humans as indicated by 96 disease-causing insertions. The autonomous LINE-1 is capable of retrotransposing not only a copy of its own RNA in cis but also other RNAs (Alu, SINE-VNTR-Alu (SVA), U6) in trans to new genomic locations through an element encoded reverse transcriptase. L1 can also retrotranspose cellular mRNAs, resulting in processed pseudogene formation. Here, we highlight recent reports that update our understanding of human L1 retrotransposition and their role in disease. Finally we discuss studies that provide insights into the past and current activity of these retrotransposons, and shed light on not just when, but where, retrotransposition occurs and its part in genetic variation.
Collapse
Affiliation(s)
- Dustin C Hancks
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, United States
| | | |
Collapse
|
32
|
Kines KJ, Belancio VP. Expressing genes do not forget their LINEs: transposable elements and gene expression. FRONT BIOSCI-LANDMRK 2012; 17:1329-44. [PMID: 22201807 DOI: 10.2741/3990] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Historically the accumulated mass of mammalian transposable elements (TEs), particularly those located within gene boundaries, was viewed as a genetic burden potentially detrimental to the genomic landscape. This notion has been strengthened by the discovery that transposable sequences can alter the architecture of the transcriptome, not only through insertion, but also long after the integration process is completed. Insertions previously considered harmless are now known to impact the expression of host genes via modification of the transcript quality or quantity, transcriptional interference, or by the control of pathways that affect the mRNA life-cycle. Conversely, several examples of the evolutionary advantageous impact of TEs on the host gene structure that diversified the cellular transcriptome are reported. TE-induced changes in gene expression can be tissue- or disease-specific, raising the possibility that the impact of TE sequences may vary during development, among normal cell types, and between normal and disease-affected tissues. The understanding of the rules and abundance of TE-interference with gene expression is in its infancy, and its contribution to human disease and/or evolution remains largely unexplored.
Collapse
Affiliation(s)
- Kristine J Kines
- Department of Structural and Cellular Biology, Tulane University School of Medicine, Tulane University Cancer Center and Tulane Center for Aging
| | | |
Collapse
|
33
|
Raiz J, Damert A, Chira S, Held U, Klawitter S, Hamdorf M, Löwer J, Strätling WH, Löwer R, Schumann GG. The non-autonomous retrotransposon SVA is trans-mobilized by the human LINE-1 protein machinery. Nucleic Acids Res 2011; 40:1666-83. [PMID: 22053090 PMCID: PMC3287187 DOI: 10.1093/nar/gkr863] [Citation(s) in RCA: 168] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
SINE-VNTR-Alu (SVA) elements are non-autonomous, hominid-specific non-LTR retrotransposons and distinguished by their organization as composite mobile elements. They represent the evolutionarily youngest, currently active family of human non-LTR retrotransposons, and sporadically generate disease-causing insertions. Since preexisting, genomic SVA sequences are characterized by structural hallmarks of Long Interspersed Elements 1 (LINE-1, L1)-mediated retrotransposition, it has been hypothesized for several years that SVA elements are mobilized by the L1 protein machinery in trans. To test this hypothesis, we developed an SVA retrotransposition reporter assay in cell culture using three different human-specific SVA reporter elements. We demonstrate that SVA elements are mobilized in HeLa cells only in the presence of both L1-encoded proteins, ORF1p and ORF2p. SVA trans-mobilization rates exceeded pseudogene formation frequencies by 12- to 300-fold in HeLa-HA cells, indicating that SVA elements represent a preferred substrate for L1 proteins. Acquisition of an AluSp element increased the trans-mobilization frequency of the SVA reporter element by ~25-fold. Deletion of (CCCTCT)n repeats and Alu-like region of a canonical SVA reporter element caused significant attenuation of the SVA trans-mobilization rate. SVA de novo insertions were predominantly full-length, occurred preferentially in G+C-rich regions, and displayed all features of L1-mediated retrotransposition which are also observed in preexisting genomic SVA insertions.
Collapse
Affiliation(s)
- Julija Raiz
- Section PR2/Retroelements, Division of Medical Biotechnology, Paul-Ehrlich-Institut, Paul-Ehrlich-Strasse 51-59, D-63225 Langen, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Beck CR, Garcia-Perez JL, Badge RM, Moran JV. LINE-1 elements in structural variation and disease. Annu Rev Genomics Hum Genet 2011; 12:187-215. [PMID: 21801021 DOI: 10.1146/annurev-genom-082509-141802] [Citation(s) in RCA: 394] [Impact Index Per Article: 28.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The completion of the human genome reference sequence ushered in a new era for the study and discovery of human transposable elements. It now is undeniable that transposable elements, historically dismissed as junk DNA, have had an instrumental role in sculpting the structure and function of our genomes. In particular, long interspersed element-1 (LINE-1 or L1) and short interspersed elements (SINEs) continue to affect our genome, and their movement can lead to sporadic cases of disease. Here, we briefly review the types of transposable elements present in the human genome and their mechanisms of mobility. We next highlight how advances in DNA sequencing and genomic technologies have enabled the discovery of novel retrotransposons in individual genomes. Finally, we discuss how L1-mediated retrotransposition events impact human genomes.
Collapse
Affiliation(s)
- Christine R Beck
- Department of Human Genetics, University of MIchigan Medical School, Ann Arbor, Michigan 48109-5618, USA.
| | | | | | | |
Collapse
|
35
|
Recent amplification of the kangaroo endogenous retrovirus, KERV, limited to the centromere. J Virol 2011; 85:4761-71. [PMID: 21389136 DOI: 10.1128/jvi.01604-10] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Mammalian retrotransposons, transposable elements that are processed through an RNA intermediate, are categorized as short interspersed elements (SINEs), long interspersed elements (LINEs), and long terminal repeat (LTR) retroelements, which include endogenous retroviruses. The ability of transposable elements to autonomously amplify led to their initial characterization as selfish or junk DNA; however, it is now known that they may acquire specific cellular functions in a genome and are implicated in host defense mechanisms as well as in genome evolution. Interactions between classes of transposable elements may exert a markedly different and potentially more significant effect on a genome than interactions between members of a single class of transposable elements. We examined the genomic structure and evolution of the kangaroo endogenous retrovirus (KERV) in the marsupial genus Macropus. The complete proviral structure of the kangaroo endogenous retrovirus, phylogenetic relationship among relative retroviruses, and expression of this virus in both Macropus rufogriseus and M. eugenii are presented for the first time. In addition, we show the relative copy number and distribution of the kangaroo endogenous retrovirus in the Macropus genus. Our data indicate that amplification of the kangaroo endogenous retrovirus occurred in a lineage-specific fashion, is restricted to the centromeres, and is not correlated with LINE depletion. Finally, analysis of KERV long terminal repeat sequences using massively parallel sequencing indicates that the recent amplification in M. rufogriseus is likely due to duplications and concerted evolution rather than a high number of independent insertion events.
Collapse
|
36
|
O'Donnell KA, Burns KH. Mobilizing diversity: transposable element insertions in genetic variation and disease. Mob DNA 2010; 1:21. [PMID: 20813032 PMCID: PMC2941744 DOI: 10.1186/1759-8753-1-21] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2010] [Accepted: 09/02/2010] [Indexed: 02/06/2023] Open
Abstract
Transposable elements (TEs) comprise a large fraction of mammalian genomes. A number of these elements are actively jumping in our genomes today. As a consequence, these insertions provide a source of genetic variation and, in rare cases, these events cause mutations that lead to disease. Yet, the extent to which these elements impact their host genomes is not completely understood. This review will summarize our current understanding of the mechanisms underlying transposon regulation and the contribution of TE insertions to genetic diversity in the germline and in somatic cells. Finally, traditional methods and emerging technologies for identifying transposon insertions will be considered.
Collapse
Affiliation(s)
- Kathryn A O'Donnell
- Department of Molecular Biology and Genetics, The Johns Hopkins University School of Medicine, Baltimore, MD, USA.
| | | |
Collapse
|
37
|
LINE-1 retrotransposition activity in human genomes. Cell 2010; 141:1159-70. [PMID: 20602998 DOI: 10.1016/j.cell.2010.05.021] [Citation(s) in RCA: 438] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2010] [Revised: 03/23/2010] [Accepted: 05/13/2010] [Indexed: 01/23/2023]
Abstract
Highly active (i.e., "hot") long interspersed element-1 (LINE-1 or L1) sequences comprise the bulk of retrotransposition activity in the human genome; however, the abundance of hot L1s in the human population remains largely unexplored. Here, we used a fosmid-based, paired-end DNA sequencing strategy to identify 68 full-length L1s that are differentially present among individuals but are absent from the human genome reference sequence. The majority of these L1s were highly active in a cultured cell retrotransposition assay. Genotyping 26 elements revealed that two L1s are only found in Africa and that two more are absent from the H952 subset of the Human Genome Diversity Panel. Therefore, these results suggest that hot L1s are more abundant in the human population than previously appreciated, and that ongoing L1 retrotransposition continues to be a major source of interindividual genetic variation.
Collapse
|
38
|
Witherspoon DJ, Xing J, Zhang Y, Watkins WS, Batzer MA, Jorde LB. Mobile element scanning (ME-Scan) by targeted high-throughput sequencing. BMC Genomics 2010; 11:410. [PMID: 20591181 PMCID: PMC2996938 DOI: 10.1186/1471-2164-11-410] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2010] [Accepted: 06/30/2010] [Indexed: 11/10/2022] Open
Abstract
Background Mobile elements (MEs) are diverse, common and dynamic inhabitants of nearly all genomes. ME transposition generates a steady stream of polymorphic genetic markers, deleterious and adaptive mutations, and substrates for further genomic rearrangements. Research on the impacts, population dynamics, and evolution of MEs is constrained by the difficulty of ascertaining rare polymorphic ME insertions that occur against a large background of pre-existing fixed elements and then genotyping them in many individuals. Results Here we present a novel method for identifying nearly all insertions of a ME subfamily in the whole genomes of multiple individuals and simultaneously genotyping (for presence or absence) those insertions that are variable in the population. We use ME-specific primers to construct DNA libraries that contain the junctions of all ME insertions of the subfamily, with their flanking genomic sequences, from many individuals. Individual-specific "index" sequences are designed into the oligonucleotide adapters used to construct the individual libraries. These libraries are then pooled and sequenced using a ME-specific sequencing primer. Mobile element insertion loci of the target subfamily are uniquely identified by their junction sequence, and all insertion junctions are linked to their individual libraries by the corresponding index sequence. To test this method's feasibility, we apply it to the human AluYb8 and AluYb9 subfamilies. In four individuals, we identified a total of 2,758 AluYb8 and AluYb9 insertions, including nearly all those that are present in the reference genome, as well as 487 that are not. Index counts show the sequenced products from each sample reflect the intended proportions to within 1%. At a sequencing depth of 355,000 paired reads per sample, the sensitivity and specificity of ME-Scan are both approximately 95%. Conclusions Mobile Element Scanning (ME-Scan) is an efficient method for quickly genotyping mobile element insertions with very high sensitivity and specificity. In light of recent improvements to high-throughput sequencing technology, it should be possible to employ ME-Scan to genotype insertions of almost any mobile element family in many individuals from any species.
Collapse
Affiliation(s)
- David J Witherspoon
- Dept. of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, Utah 84112, USA.
| | | | | | | | | | | |
Collapse
|
39
|
Iskow RC, McCabe MT, Mills RE, Torene S, Pittard WS, Neuwald AF, Van Meir EG, Vertino PM, Devine SE. Natural mutagenesis of human genomes by endogenous retrotransposons. Cell 2010; 141:1253-61. [PMID: 20603005 PMCID: PMC2943760 DOI: 10.1016/j.cell.2010.05.020] [Citation(s) in RCA: 427] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2010] [Revised: 04/01/2010] [Accepted: 05/12/2010] [Indexed: 11/22/2022]
Abstract
Two abundant classes of mobile elements, namely Alu and L1 elements, continue to generate new retrotransposon insertions in human genomes. Estimates suggest that these elements have generated millions of new germline insertions in individual human genomes worldwide. Unfortunately, current technologies are not capable of detecting most of these young insertions, and the true extent of germline mutagenesis by endogenous human retrotransposons has been difficult to examine. Here, we describe technologies for detecting these young retrotransposon insertions and demonstrate that such insertions indeed are abundant in human populations. We also found that new somatic L1 insertions occur at high frequencies in human lung cancer genomes. Genome-wide analysis suggests that altered DNA methylation may be responsible for the high levels of L1 mobilization observed in these tumors. Our data indicate that transposon-mediated mutagenesis is extensive in human genomes and is likely to have a major impact on human biology and diseases.
Collapse
Affiliation(s)
- Rebecca C. Iskow
- Genetics and Molecular Biology Graduate Program, Emory University, Atlanta, GA 30322
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA 30322
| | - Michael T. McCabe
- Department of Radiation Oncology, Emory University School of Medicine, Atlanta, GA 30322
- Winship Cancer Institute, Emory University, Atlanta, GA 30322
| | - Ryan E. Mills
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA 30322
| | - Spencer Torene
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA 30322
| | | | - Andrew F. Neuwald
- Institute for Genome SciencesBiology, University of Maryland School of Medicine, Baltimore, MD 20201
- Department of Biochemistry and Molecular, University of Maryland School of Medicine, Baltimore, MD 20201
| | - Erwin G. Van Meir
- Department of Neurosurgery, Emory University School of Medicine, Atlanta, GA 30322
- Department of Hematology and Medical Oncology, Emory University School of Medicine, Atlanta, GA 30322
- Winship Cancer Institute, Emory University, Atlanta, GA 30322
| | - Paula M. Vertino
- Genetics and Molecular Biology Graduate Program, Emory University, Atlanta, GA 30322
- Department of Radiation Oncology, Emory University School of Medicine, Atlanta, GA 30322
- Winship Cancer Institute, Emory University, Atlanta, GA 30322
| | - Scott E. Devine
- Genetics and Molecular Biology Graduate Program, Emory University, Atlanta, GA 30322
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA 30322
- Winship Cancer Institute, Emory University, Atlanta, GA 30322
- Institute for Genome SciencesBiology, University of Maryland School of Medicine, Baltimore, MD 20201
- Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 20201
- Marlene and Stewart Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD 20201
| |
Collapse
|
40
|
Ewing AD, Kazazian HH. High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes. Genome Res 2010; 20:1262-70. [PMID: 20488934 DOI: 10.1101/gr.106419.110] [Citation(s) in RCA: 224] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Using high-throughput sequencing, we devised a technique to determine the insertion sites of virtually all members of the human-specific L1 retrotransposon family in any human genome. Using diagnostic nucleotides, we were able to locate the approximately 800 L1Hs copies corresponding specifically to the pre-Ta, Ta-0, and Ta-1 L1Hs subfamilies, with over 90% of sequenced reads corresponding to human-specific elements. We find that any two individual genomes differ at an average of 285 sites with respect to L1 insertion presence or absence. In total, we assayed 25 individuals, 15 of which are unrelated, at 1139 sites, including 772 shared with the reference genome and 367 nonreference L1 insertions. We show that L1Hs profiles recapitulate genetic ancestry, and determine the chromosomal distribution of these elements. Using these data, we estimate that the rate of L1 retrotransposition in humans is between 1/95 and 1/270 births, and the number of dimorphic L1 elements in the human population with gene frequencies greater than 0.05 is between 3000 and 10,000.
Collapse
Affiliation(s)
- Adam D Ewing
- University of Pennsylvania Department of Genetics, Philadelphia, Pennsylvania 19104, USA
| | | |
Collapse
|
41
|
Konkel MK, Batzer MA. A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome. Semin Cancer Biol 2010; 20:211-21. [PMID: 20307669 DOI: 10.1016/j.semcancer.2010.03.001] [Citation(s) in RCA: 130] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2010] [Revised: 03/04/2010] [Accepted: 03/16/2010] [Indexed: 02/06/2023]
Abstract
It is now commonly agreed that the human genome is not the stable entity originally presumed. Deletions, duplications, inversions, and insertions are common, and contribute significantly to genomic structural variations (SVs). Their collective impact generates much of the inter-individual genomic diversity observed among humans. Not only do these variations change the structure of the genome; they may also have functional implications, e.g. altered gene expression. Some SVs have been identified as the cause of genetic disorders, including cancer predisposition. Cancer cells are notorious for their genomic instability, and often show genomic rearrangements at the microscopic and submicroscopic level to which transposable elements (TEs) contribute. Here, we review the role of TEs in genome instability, with particular focus on non-LTR retrotransposons. Currently, three non-LTR retrotransposon families - long interspersed element 1 (L1), SVA (short interspersed element (SINE-R), variable number of tandem repeats (VNTR), and Alu), and Alu (a SINE) elements - mobilize in the human genome, and cause genomic instability through both insertion- and post-insertion-based mutagenesis. Due to the abundance and high sequence identity of TEs, they frequently mislead the homologous recombination repair pathway into non-allelic homologous recombination, causing deletions, duplications, and inversions. While less comprehensively studied, non-LTR retrotransposon insertions and TE-mediated rearrangements are probably more common in cancer cells than in healthy tissue. This may be at least partially attributed to the commonly seen global hypomethylation as well as general epigenetic dysfunction of cancer cells. Where possible, we provide examples that impact cancer predisposition and/or development.
Collapse
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, LA 70803, USA
| | | |
Collapse
|
42
|
Alu and b1 repeats have been selectively retained in the upstream and intronic regions of genes of specific functional classes. PLoS Comput Biol 2009; 5:e1000610. [PMID: 20019790 PMCID: PMC2784220 DOI: 10.1371/journal.pcbi.1000610] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2009] [Accepted: 11/13/2009] [Indexed: 11/20/2022] Open
Abstract
Alu and B1 repeats are mobile elements that originated in an initial duplication of the 7SL RNA gene prior to the primate-rodent split about 80 million years ago and currently account for a substantial fraction of the human and mouse genome, respectively. Following the primate-rodent split, Alu and B1 elements spread independently in each of the two genomes in a seemingly random manner, and, according to the prevailing hypothesis, negative selection shaped their final distribution in each genome by forcing the selective loss of certain Alu and B1 copies. In this paper, contrary to the prevailing hypothesis, we present evidence that Alu and B1 elements have been selectively retained in the upstream and intronic regions of genes belonging to specific functional classes. At the same time, we found no evidence for selective loss of these elements in any functional class. A subset of the functional links we discovered corresponds to functions where Alu involvement has actually been experimentally validated, whereas the majority of the functional links we report are novel. Finally, the unexpected finding that Alu and B1 elements show similar biases in their distribution across functional classes, despite having spread independently in their respective genomes, further supports our claim that the extant instances of Alu and B1 elements are the result of positive selection. Despite their fundamental role in cell regulation, genes account for less than 1% of the human genome. Recent studies have shown that non-genic regions of our DNA may also play an important functional role in human cells. In this paper, we study Alu and B elements, a specific class of such non-genic elements that account for ∼10% of the human genome and ∼7% of the mouse genome respectively. We show that, contrary to the prevailing hypothesis, Alu and B elements have been preferentially retained in the proximity of genes that perform specific functions in the cell. In contrast, we found no evidence for selective loss of these elements in any functional class. Several of the functional classes that we have linked to Alu and B elements are central to the proper working of the cell, and their disruption has previously been shown to lead to the onset of disease. Interestingly, the DNA sequences of Alu and B elements differ substantially between human and mouse, thus hinting at the existence of a potentially large number of non-conserved regulatory elements.
Collapse
|
43
|
Rangwala SH, Zhang L, Kazazian HH. Many LINE1 elements contribute to the transcriptome of human somatic cells. Genome Biol 2009; 10:R100. [PMID: 19772661 PMCID: PMC2768975 DOI: 10.1186/gb-2009-10-9-r100] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Revised: 08/21/2009] [Accepted: 09/22/2009] [Indexed: 11/29/2022] Open
Abstract
Over 600 LINE 1 elements are shown to be transcribed in humans; 400 of these are full-length elements in the reference genome. Background While LINE1 (L1) retroelements comprise nearly 20% of the human genome, the majority are thought to have been rendered transcriptionally inactive, due to either mutation or epigenetic suppression. How many L1 elements 'escape' these forms of repression and contribute to the transcriptome of human somatic cells? We have cloned out expressed sequence tags corresponding to the 5' and 3' flanks of L1 elements in order to characterize the population of elements that are being actively transcribed. We also examined expression of a select number of elements in different individuals. Results We isolated expressed sequence tags from human lymphoblastoid cell lines corresponding to 692 distinct L1 element sites, including 410 full-length elements. Four of the expression tagged sites corresponding to full-length elements from the human specific L1Hs subfamily were examined in European-American individuals and found to be differentially expressed in different family members. Conclusions A large number of different L1 element sites are expressed in human somatic tissues, and this expression varies among different individuals. Paradoxically, few elements were tagged at high frequency, indicating that the majority of expressed L1s are transcribed at low levels. Based on our preliminary expression studies of a limited number of elements in a single family, we predict a significant degree of inter-individual transcript-level polymorphism in this class of sequence.
Collapse
Affiliation(s)
- Sanjida H Rangwala
- Department of Genetics, University of Pennsylvania School of Medicine, Hamilton Walk, Philadelphia, Pennsylvania 19104, USA.
| | | | | |
Collapse
|
44
|
Hasnaoui M, Doucet AJ, Meziane O, Gilbert N. Ancient repeat sequence derived from U6 snRNA in primate genomes. Gene 2009; 448:139-44. [PMID: 19647053 DOI: 10.1016/j.gene.2009.07.015] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Revised: 07/15/2009] [Accepted: 07/15/2009] [Indexed: 02/06/2023]
Abstract
LINE-1 (L1) is the most represented sequence of the human genome (17% of the total genomic mass). Moreover, it has been proposed for many years and demonstrated more recently that L1 has contributed to the mobilization of pseudogenes, small non-coding RNAs, such as tRNAs or snRNAs, and SINEs. In fact, it is estimated that L1 is responsible for at least 30% of our genome. The mobilization of non-L1 RNAs can occur in different ways and at different steps of the retrotransposition cycle. Here, by looking at U6 snRNA sequences mobilized by L1, we have observed an ancient repeat sequence derived from U6, present in all primate genomes. We were able to trace its origin in Euarchota genomes, most likely during the divergence of the four orders; Scandentia, Dermoptera, Plesiadapiform (extinct) and Primates.
Collapse
Affiliation(s)
- Manel Hasnaoui
- Institut de Génétique Humaine, Centre National de la Recherche Scientifique, 141 Rue de la Cardonille, 34396 Montpellier Cedex 5, France
| | | | | | | |
Collapse
|
45
|
Srikanta D, Sen SK, Conlin EM, Batzer MA. Internal priming: an opportunistic pathway for L1 and Alu retrotransposition in hominins. Gene 2009; 448:233-41. [PMID: 19501635 DOI: 10.1016/j.gene.2009.05.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2009] [Revised: 05/26/2009] [Accepted: 05/28/2009] [Indexed: 01/24/2023]
Abstract
Retrotransposons, specifically Alu and L1 elements, have been especially successful in their expansion throughout primate genomes. While most of these elements integrate through an endonuclease-mediated process termed target primed reverse transcription, a minority integrate using alternative methods. Here we present evidence for one such mechanism, which we term internal priming and demonstrate that loci integrating through this mechanism are qualitatively different from "classical" insertions. Previous examples of this mechanism are limited to cell culture assays, which show that reverse transcription can initiate upstream of the 3' poly-A tail during retrotransposon integration. To detect whether this mechanism occurs in vivo as well as in cell culture, we have analyzed the human genome for internal priming events using recently integrated L1 and Alu elements. Our examination of the human genome resulted in the recovery of twenty events involving internal priming insertions, which are structurally distinct from both classical TPRT-mediated insertions and non-classical insertions. We suggest two possible mechanisms by which these internal priming loci are created and provide evidence supporting a role in staggered DNA double-strand break repair. Also, we demonstrate that the internal priming process is associated with inter-chromosomal duplications and the insertion of filler DNA.
Collapse
Affiliation(s)
- Deepa Srikanta
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803, USA
| | | | | | | |
Collapse
|
46
|
|
47
|
Srikanta D, Sen SK, Huang CT, Conlin EM, Rhodes RM, Batzer MA. An alternative pathway for Alu retrotransposition suggests a role in DNA double-strand break repair. Genomics 2008; 93:205-12. [PMID: 18951971 DOI: 10.1016/j.ygeno.2008.09.016] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2008] [Revised: 09/15/2008] [Accepted: 09/26/2008] [Indexed: 11/18/2022]
Abstract
The Alu family is a highly successful group of non-LTR retrotransposons ubiquitously found in primate genomes. Similar to the L1 retrotransposon family, Alu elements integrate primarily through an endonuclease-dependent mechanism termed target site-primed reverse transcription (TPRT). Recent studies have suggested that, in addition to TPRT, L1 elements occasionally utilize an alternative endonuclease-independent pathway for genomic integration. To determine whether an analogous mechanism exists for Alu elements, we have analyzed three publicly available primate genomes (human, chimpanzee and rhesus macaque) for endonuclease-independent recently integrated or lineage specific Alu insertions. We recovered twenty-three examples of such insertions and show that these insertions are recognizably different from classical TPRT-mediated Alu element integration. We suggest a role for this process in DNA double-strand break repair and present evidence to suggest its association with intra-chromosomal translocations, in-vitro RNA recombination (IVRR), and synthesis-dependent strand annealing (SDSA).
Collapse
Affiliation(s)
- Deepa Srikanta
- Department of Biological Sciences, Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, LA 70803, USA
| | | | | | | | | | | |
Collapse
|
48
|
Belancio VP, Hedges DJ, Deininger P. Mammalian non-LTR retrotransposons: for better or worse, in sickness and in health. Genome Res 2008; 18:343-58. [PMID: 18256243 DOI: 10.1101/gr.5558208] [Citation(s) in RCA: 226] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Transposable elements (TEs) have shared an exceptionally long coexistence with their host organisms and have come to occupy a significant fraction of eukaryotic genomes. The bulk of the expansion occurring within mammalian genomes has arisen from the activity of type I retrotransposons, which amplify in a "copy-and-paste" fashion through an RNA intermediate. For better or worse, the sequences of these retrotransposons are now wedded to the genomes of their mammalian hosts. Although there are several reported instances of the positive contribution of mobile elements to their host genomes, these discoveries have occurred alongside growing evidence of the role of TEs in human disease and genetic instability. Here we examine, with a particular emphasis on human retrotransposon activity, several newly discovered aspects of mammalian retrotransposon biology. We consider their potential impact on host biology as well as their ultimate implications for the nature of the TE-host relationship.
Collapse
Affiliation(s)
- Victoria P Belancio
- Tulane Cancer Center and Department of Epidemiology, Tulane University Health Sciences Center, New Orleans, Louisiana 70112, USA
| | | | | |
Collapse
|
49
|
Suetsugu Y, Minami H, Shimomura M, Sasanuma SI, Narukawa J, Mita K, Yamamoto K. End-sequencing and characterization of silkworm (Bombyx mori) bacterial artificial chromosome libraries. BMC Genomics 2007; 8:314. [PMID: 17822570 PMCID: PMC2014780 DOI: 10.1186/1471-2164-8-314] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2007] [Accepted: 09/07/2007] [Indexed: 11/24/2022] Open
Abstract
Background We performed large-scale bacterial artificial chromosome (BAC) end-sequencing of two BAC libraries (an EcoRI- and a BamHI-digested library) and conducted an in silico analysis to characterize the obtained sequence data, to make them a useful resource for genomic research on the silkworm (Bombyx mori). Results More than 94000 BAC end sequences (BESs), comprising more than 55 Mbp and covering about 10.4% of the silkworm genome, were sequenced. Repeat-sequence analysis with known repeat sequences indicated that the long interspersed nuclear elements (LINEs) were abundant in BamHI BESs, whereas DNA-type elements were abundant in EcoRI BESs. Repeat-sequence analysis revealed that the abundance of LINEs might be due to a GC bias of the restriction sites and that the GC content of silkworm LINEs was higher than that of mammalian LINEs. In a BLAST-based sequence analysis of the BESs against two available whole-genome shotgun sequence data sets, more than 70% of the BESs had a BLAST hit with an identity of ≥ 99%. About 14% of EcoRI BESs and about 8% of BamHI BESs were paired-end clones with unique sequences at both ends. Cluster analysis of the BESs clarified the proportion of BESs containing protein-coding regions. Conclusion As a result of this characterization, the identified BESs will be a valuable resource for genomic research on Bombyx mori, for example, as a base for construction of a BAC-based physical map. The use of multiple complementary BAC libraries constructed with different restriction enzymes also makes the BESs a more valuable genomic resource. The GenBank accession numbers of the obtained end sequences are DE283657–DE378560.
Collapse
Affiliation(s)
- Yoshitaka Suetsugu
- National Institute of Agrobiological Sciences, 1-2 Owashi, Tsukuba, Ibaraki 305-8634, Japan
| | - Hiroshi Minami
- Mitsubishi Space Software Co. Ltd., 1-6-1 Takezono, Tsukuba, Ibaraki 305-0032, Japan
| | - Michihiko Shimomura
- Mitsubishi Space Software Co. Ltd., 1-6-1 Takezono, Tsukuba, Ibaraki 305-0032, Japan
| | - Shun-ichi Sasanuma
- National Institute of Agrobiological Sciences, 1-2 Owashi, Tsukuba, Ibaraki 305-8634, Japan
| | - Junko Narukawa
- National Institute of Agrobiological Sciences, 1-2 Owashi, Tsukuba, Ibaraki 305-8634, Japan
| | - Kazuei Mita
- National Institute of Agrobiological Sciences, 1-2 Owashi, Tsukuba, Ibaraki 305-8634, Japan
| | - Kimiko Yamamoto
- National Institute of Agrobiological Sciences, 1-2 Owashi, Tsukuba, Ibaraki 305-8634, Japan
| |
Collapse
|
50
|
Chen JM, Masson E, Macek M, Raguénès O, Piskackova T, Fercot B, Fila L, Cooper DN, Audrézet MP, Férec C. Detection of two Alu insertions in the CFTR gene. J Cyst Fibros 2007; 7:37-43. [PMID: 17531547 DOI: 10.1016/j.jcf.2007.04.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2007] [Revised: 04/12/2007] [Accepted: 04/13/2007] [Indexed: 01/08/2023]
Abstract
BACKGROUND LINE-1 (long interspersed element-1) or L1-mediated retrotransposition is a potent force in human genome evolution and an occasional cause of human genetic disease. Since the first report of two de novo L1 insertions in the F8 gene causing hemophilia A, more than 50 L1-mediated retrotranspositional events have been identified as causing human genetic disease. However, a significant bias has generally militated against the detection of these pathological events at autosomal loci. Based upon this and other observations, we surmised that some previously unresolved cystic fibrosis chromosomes might carry hitherto undetected L1-mediated retrotranspositional insertions at the CFTR locus. This study represents an attempt to identify such mutational events. METHODS 100 previously unresolved cystic fibrosis chromosomes were carefully reanalyzed using quantitative high-performance liquid chromatography (QHPLC). RESULTS Two simple Alu insertions were identified in the CFTR gene, within exons 16 and 17b respectively. CONCLUSIONS Our findings have not only revealed a previously unknown mutational mechanism responsible for cystic fibrosis but also represent an important addition to the already diverse spectrum of known CFTR gene mutations. Experience with the CFTR gene suggests that pathological L1-mediated retrotranspositional events may also have been overlooked at other gene loci and should always be considered in cases that appear to be refractory to analysis.
Collapse
|