51
|
Kvikstad EM, Piazza P, Taylor JC, Lunter G. A high throughput screen for active human transposable elements. BMC Genomics 2018; 19:115. [PMID: 29390960 PMCID: PMC5796560 DOI: 10.1186/s12864-018-4485-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Accepted: 01/16/2018] [Indexed: 11/30/2022] Open
Abstract
Background Transposable elements (TEs) are mobile genetic sequences that randomly propagate within their host’s genome. This mobility has the potential to affect gene transcription and cause disease. However, TEs are technically challenging to identify, which complicates efforts to assess the impact of TE insertions on disease. Here we present a targeted sequencing protocol and computational pipeline to identify polymorphic and novel TE insertions using next-generation sequencing: TE-NGS. The method simultaneously targets the three subfamilies that are responsible for the majority of recent TE activity (L1HS, AluYa5/8, and AluYb8/9) thereby obviating the need for multiple experiments and reducing the amount of input material required. Results Here we describe the laboratory protocol and detection algorithm, and a benchmark experiment for the reference genome NA12878. We demonstrate a substantial enrichment for on-target fragments, and high sensitivity and precision to both reference and NA12878-specific insertions. We report 17 previously unreported loci for this individual which are supported by orthogonal long-read evidence, and we identify 1470 polymorphic and novel TEs in 12 additional samples that were previously undocumented in databases of insertion polymorphisms. Conclusions We anticipate that future applications of TE-NGS alongside exome sequencing of patients with sporadic disease will reduce the number of unresolved cases, and improve estimates of the contribution of TEs to human genetic disease. Electronic supplementary material The online version of this article (10.1186/s12864-018-4485-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Erika M Kvikstad
- Wellcome Trust Centre for Human Genetics, Oxford, UK. .,National Institute for Health Research Comprehensive Biomedical Research Centre, Oxford, UK.
| | - Paolo Piazza
- Wellcome Trust Centre for Human Genetics, Oxford, UK.,Department of Medicine, Imperial College London, London, UK
| | - Jenny C Taylor
- Wellcome Trust Centre for Human Genetics, Oxford, UK.,National Institute for Health Research Comprehensive Biomedical Research Centre, Oxford, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, Oxford, UK
| |
Collapse
|
52
|
Mita P, Wudzinska A, Sun X, Andrade J, Nayak S, Kahler DJ, Badri S, LaCava J, Ueberheide B, Yun CY, Fenyö D, Boeke JD. LINE-1 protein localization and functional dynamics during the cell cycle. eLife 2018; 7:30058. [PMID: 29309036 PMCID: PMC5821460 DOI: 10.7554/elife.30058] [Citation(s) in RCA: 82] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2017] [Accepted: 01/04/2018] [Indexed: 01/12/2023] Open
Abstract
LINE-1/L1 retrotransposon sequences comprise 17% of the human genome. Among the many classes of mobile genetic elements, L1 is the only autonomous retrotransposon that still drives human genomic plasticity today. Through its co-evolution with the human genome, L1 has intertwined itself with host cell biology. However, a clear understanding of L1’s lifecycle and the processes involved in restricting its insertion and intragenomic spread remains elusive. Here we identify modes of L1 proteins’ entrance into the nucleus, a necessary step for L1 proliferation. Using functional, biochemical, and imaging approaches, we also show a clear cell cycle bias for L1 retrotransposition that peaks during the S phase. Our observations provide a basis for novel interpretations about the nature of nuclear and cytoplasmic L1 ribonucleoproteins (RNPs) and the potential role of DNA replication in L1 retrotransposition. Only two percent of our genetic material or genome are occupied by genes, while between 60-70 percent are made up of hundreds of thousands of copies of very similar DNA sequences. These repetitive sequences evolved from genetic elements called transposons. Transposons are often referred to as ‘jumping genes’, as they can randomly move within the genome and thereby create dangerous mutations that may lead to cancer or other genetic diseases. LINE-1 is the only remaining active transposon in humans, and it expands by copying and pasting itself to new locations via a process called 'retrotransposition'. To do so, it is first transcribed into RNA – the molecules that help to make proteins – and then converted back into identical DNA sequences. Previous research has shown that LINE-1 can form complexes with a series of proteins, including the two encoded by LINE-1 RNA itself: ORF1p and ORF2p. The LINE-1 complexes can enter the nucleus of the cell and insert a new copy of LINE-1 into the genome. However, until now it was not known how they do this. To investigate this further, Mita et al. used human cancer cells grown in the lab and tracked LINE-1 during the different stages of the cell cycle. The results showed that LINE-1 enters the nucleus as the cell starts to divide and the membrane of the nucleus breaks down. The LINE-1 complexes are then retained in the nucleus while the membrane of the nucleus reforms. Later, as the cell duplicates its genetic material, LINE-1 starts to copy and paste itself. Mita et al., together with another group of researchers, also found that during this process, only LINE-1 RNA and ORF2p were found in the nucleus. This shows that the cell cycle dictates both where the LINE-1 complexes gather and when LINE-1 is active. A next step will be to further investigate how the ‘copy and paste’ mechanisms of LINE-1 and the two LINE-1 proteins are regulated during the cell cycle. In future, this may help to identify LINE-1’s role in processes like aging or in diseases such as cancer.
Collapse
Affiliation(s)
- Paolo Mita
- Institute of Systems Genetics (ISG), Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, United States
| | - Aleksandra Wudzinska
- Institute of Systems Genetics (ISG), Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, United States
| | - Xiaoji Sun
- Institute of Systems Genetics (ISG), Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, United States
| | - Joshua Andrade
- Proteomics laboratory, NYU Langone Health, New York, United States
| | - Shruti Nayak
- Proteomics laboratory, NYU Langone Health, New York, United States
| | - David J Kahler
- High Throughput Biology (HTB) Laboratory, NYU Langone Health, New York, United States
| | - Sana Badri
- Department of Pathology, NYU Langone Health, New York, United States
| | - John LaCava
- Institute of Systems Genetics (ISG), Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, United States.,Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, United States
| | - Beatrix Ueberheide
- Institute of Systems Genetics (ISG), Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, United States.,Proteomics laboratory, NYU Langone Health, New York, United States
| | - Chi Y Yun
- High Throughput Biology (HTB) Laboratory, NYU Langone Health, New York, United States
| | - David Fenyö
- Institute of Systems Genetics (ISG), Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, United States
| | - Jef D Boeke
- Institute of Systems Genetics (ISG), Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, United States
| |
Collapse
|
53
|
Dai Q, Shen Y, Wang Y, Wang X, Francisco JC, Luo Z, Lin C. Striking a balance: regulation of transposable elements by Zfp281 and Mll2 in mouse embryonic stem cells. Nucleic Acids Res 2017; 45:12301-12310. [PMID: 29036642 PMCID: PMC5716208 DOI: 10.1093/nar/gkx841] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Accepted: 09/12/2017] [Indexed: 01/22/2023] Open
Abstract
Transposable elements (TEs) compose about 40% of the murine genome. Retrotransposition of active TEs such as LINE-1 (L1) tremendously impacts genetic diversification and genome stability. Therefore, transcription and transposition activities of retrotransposons are tightly controlled. Here, we show that the Krüppel-like zinc finger protein Zfp281 directly binds and suppresses a subset of retrotransposons, including the active young L1 repeat elements, in mouse embryonic stem (ES) cells. In addition, we find that Zfp281-regulated L1s are highly enriched for 5-hydroxymethylcytosine (5hmC) and H3K4me3. The COMPASS-like H3K4 methyltransferase Mll2 is the major H3K4me3 methylase at the Zfp281-regulated L1s and required for their proper expression. Our studies also reveal that Zfp281 functions partially through recruiting the L1 regulators DNA hydroxymethylase Tet1 and Sin3A, and restricting Mll2 at these active L1s, leading to their balanced expression. In summary, our data indicate an instrumental role of Zfp281 in suppressing the young active L1s in mouse ES cells.
Collapse
Affiliation(s)
- Qian Dai
- Institute of Life Sciences, the Key Laboratory of Developmental Genes and Human Disease, Southeast University, Nanjing 210096, China
| | - Yang Shen
- Bioinformatics Group, A*STAR Genome Institute of Singapore, 60 Biopolis Street, Singapore 138672, Singapore
| | - Yan Wang
- Institute of Life Sciences, the Key Laboratory of Developmental Genes and Human Disease, Southeast University, Nanjing 210096, China
| | - Xin Wang
- Institute of Life Sciences, the Key Laboratory of Developmental Genes and Human Disease, Southeast University, Nanjing 210096, China
| | - Joel Celio Francisco
- Transcriptional Control in Development and Disease Laboratory, A*STAR Institute of Molecular and Cell Biology, 61 Biopolis Drive Proteos, Singapore 138673, Singapore
| | - Zhuojuan Luo
- Institute of Life Sciences, the Key Laboratory of Developmental Genes and Human Disease, Southeast University, Nanjing 210096, China
| | - Chengqi Lin
- Institute of Life Sciences, the Key Laboratory of Developmental Genes and Human Disease, Southeast University, Nanjing 210096, China.,Transcriptional Control in Development and Disease Laboratory, A*STAR Institute of Molecular and Cell Biology, 61 Biopolis Drive Proteos, Singapore 138673, Singapore
| |
Collapse
|
54
|
Gainetdinov I, Skvortsova Y, Kondratieva S, Funikov S, Azhikina T. Two modes of targeting transposable elements by piRNA pathway in human testis. RNA (NEW YORK, N.Y.) 2017; 23:1614-1625. [PMID: 28842508 PMCID: PMC5648030 DOI: 10.1261/rna.060939.117] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2017] [Accepted: 08/23/2017] [Indexed: 06/07/2023]
Abstract
PIWI proteins and their partner small RNAs, termed piRNAs, are known to control transposable elements (TEs) in the germline. Here, we provide evidence that in humans this control is exerted in two different modes. On the one hand, production of piRNAs specifically targeting evolutionarily youngest TEs (L1HS, L1PA2-L1PA6, LTR12C, SVA) is present both at prenatal and postnatal stages of spermatogenesis and is performed without involvement of piRNA clusters. On the other hand, at postnatal stages, piRNAs deriving from pachytene clusters target "older" TEs and thus complement cluster-independent piRNA production to achieve relevant targeting of virtually all TEs expressed in postnatal testis. We also find that converging transcription of antisense-oriented genes contributes to the origin of genic postnatal prepachytene clusters. Finally, while a fraction of pachytene piRNAs was previously shown to arise from long intergenic noncoding RNAs (lincRNAs, i.e., pachytene piRNA cluster primary transcripts), we ascertain that these are a specific set of lincRNAs that both possess distinguishing epigenetic features and are expressed exclusively in testis.
Collapse
Affiliation(s)
- Ildar Gainetdinov
- Department of Genomics and Postgenomic Technologies, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| | - Yulia Skvortsova
- Department of Genomics and Postgenomic Technologies, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| | - Sofia Kondratieva
- Department of Genomics and Postgenomic Technologies, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| | - Sergey Funikov
- Department of Structural, Functional and Evolutionary Genomics, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia
| | - Tatyana Azhikina
- Department of Genomics and Postgenomic Technologies, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| |
Collapse
|
55
|
Gardner EJ, Lam VK, Harris DN, Chuang NT, Scott EC, Pittard WS, Mills RE, Devine SE. The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology. Genome Res 2017; 27:1916-1929. [PMID: 28855259 PMCID: PMC5668948 DOI: 10.1101/gr.218032.116] [Citation(s) in RCA: 208] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Accepted: 08/07/2017] [Indexed: 01/22/2023]
Abstract
Mobile element insertions (MEIs) represent ∼25% of all structural variants in human genomes. Moreover, when they disrupt genes, MEIs can influence human traits and diseases. Therefore, MEIs should be fully discovered along with other forms of genetic variation in whole genome sequencing (WGS) projects involving population genetics, human diseases, and clinical genomics. Here, we describe the Mobile Element Locator Tool (MELT), which was developed as part of the 1000 Genomes Project to perform MEI discovery on a population scale. Using both Illumina WGS data and simulations, we demonstrate that MELT outperforms existing MEI discovery tools in terms of speed, scalability, specificity, and sensitivity, while also detecting a broader spectrum of MEI-associated features. Several run modes were developed to perform MEI discovery on local and cloud systems. In addition to using MELT to discover MEIs in modern humans as part of the 1000 Genomes Project, we also used it to discover MEIs in chimpanzees and ancient (Neanderthal and Denisovan) hominids. We detected diverse patterns of MEI stratification across these populations that likely were caused by (1) diverse rates of MEI production from source elements, (2) diverse patterns of MEI inheritance, and (3) the introgression of ancient MEIs into modern human genomes. Overall, our study provides the most comprehensive map of MEIs to date spanning chimpanzees, ancient hominids, and modern humans and reveals new aspects of MEI biology in these lineages. We also demonstrate that MELT is a robust platform for MEI discovery and analysis in a variety of experimental settings.
Collapse
Affiliation(s)
- Eugene J Gardner
- Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Vincent K Lam
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Daniel N Harris
- Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Nelson T Chuang
- Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Division of Gastroenterology, Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Emma C Scott
- Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - W Stephen Pittard
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, Georgia 30322, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA.,Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | | | - Scott E Devine
- Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| |
Collapse
|
56
|
Abstract
Transposable elements give rise to interspersed repeats, sequences that comprise most of our genomes. These mobile DNAs have been historically underappreciated - both because they have been presumed to be unimportant, and because their high copy number and variability pose unique technical challenges. Neither impediment now seems steadfast. Interest in the human mobilome has never been greater, and methods enabling its study are maturing at a fast pace. This Review describes the activity of transposable elements in human cancers, particularly long interspersed element-1 (LINE-1). LINE-1 sequences are self-propagating, protein-coding retrotransposons, and their activity results in somatically acquired insertions in cancer genomes. Altered expression of transposable elements and animation of genomic LINE-1 sequences appear to be hallmarks of cancer, and can be responsible for driving mutations in tumorigenesis.
Collapse
Affiliation(s)
- Kathleen H Burns
- Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| |
Collapse
|
57
|
Transposable elements in cancer. NATURE REVIEWS. CANCER 2017. [PMID: 28642606 DOI: 10.1038/nrc.2017.35+[doi]] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Transposable elements give rise to interspersed repeats, sequences that comprise most of our genomes. These mobile DNAs have been historically underappreciated - both because they have been presumed to be unimportant, and because their high copy number and variability pose unique technical challenges. Neither impediment now seems steadfast. Interest in the human mobilome has never been greater, and methods enabling its study are maturing at a fast pace. This Review describes the activity of transposable elements in human cancers, particularly long interspersed element-1 (LINE-1). LINE-1 sequences are self-propagating, protein-coding retrotransposons, and their activity results in somatically acquired insertions in cancer genomes. Altered expression of transposable elements and animation of genomic LINE-1 sequences appear to be hallmarks of cancer, and can be responsible for driving mutations in tumorigenesis.
Collapse
|
58
|
Kim SH, Kong Y, Bae YA. Recurrent emergence of structural variants of LTR retrotransposon CsRn1 evolving novel expression strategy and their selective expansion in a carcinogenic liver fluke, Clonorchis sinensis. Mol Biochem Parasitol 2017; 214:14-26. [PMID: 28322871 DOI: 10.1016/j.molbiopara.2017.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Revised: 03/13/2017] [Accepted: 03/13/2017] [Indexed: 11/19/2022]
Abstract
Autonomous retrotransposons, in which replication and transcription are coupled, encode the essential gag and pol genes as a fusion or separate overlapping form(s) that are expressed in single transcripts regulated by a common upstream promoter. The element-specific expression strategies have driven development of relevant translational recoding mechanisms including ribosomal frameshifting to satisfy the protein stoichiometry critical for the assembly of infectious virus-like particles. Retrotransposons with different recoding strategies exhibit a mosaic distribution pattern across the diverse families of reverse transcribing elements, even though their respective distributions are substantially skewed towards certain family groups. However, only a few investigations to date have focused on the emergence of retrotransposons evolving novel expression strategy and causal genetic drivers of the structural variants. In this study, the bulk of genomic and transcribed sequences of a Ty3/gypsy-like CsRn1 retrotransposon in Clonorchis sinensis were analyzed for the comprehensive examination of its expression strategy. Our results demonstrated that structural variants with single open reading frame (ORF) have recurrently emerged from precedential CsRn1 copies encoding overlapping gag-pol ORFs by a single-nucleotide insertion in an upstream region of gag stop codon. In the parasite genome, some of the newly evolved variants appeared to undergo proliferative burst as active master lineages together with their ancestral copies. The genetic event was similarly observed in Opisthorchis viverrini, the closest neighbor of C. sinensis, whereas the resulting structural variants might have failed to overcome purifying selection and comprised minor remnant copies in the Opisthorchis genome.
Collapse
Affiliation(s)
- Seon-Hee Kim
- Department of Microbiology, College of Medicine, Gachon University, 191 Hambakmoe-ro, Yeonsu-gu, Incheon 21936, Republic of Korea
| | - Yoon Kong
- Department of Molecular Parasitology, Sungkyunkwan University School of Medicine, 2066 Seobu-ro, Jangan-gu, Suwon 16419, Republic of Korea
| | - Young-An Bae
- Department of Microbiology, College of Medicine, Gachon University, 191 Hambakmoe-ro, Yeonsu-gu, Incheon 21936, Republic of Korea.
| |
Collapse
|
59
|
Human transposon insertion profiling: Analysis, visualization and identification of somatic LINE-1 insertions in ovarian cancer. Proc Natl Acad Sci U S A 2017; 114:E733-E740. [PMID: 28096347 DOI: 10.1073/pnas.1619797114] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Mammalian genomes are replete with interspersed repeats reflecting the activity of transposable elements. These mobile DNAs are self-propagating, and their continued transposition is a source of both heritable structural variation as well as somatic mutation in human genomes. Tailored approaches to map these sequences are useful to identify insertion alleles. Here, we describe in detail a strategy to amplify and sequence long interspersed element-1 (LINE-1, L1) retrotransposon insertions selectively in the human genome, transposon insertion profiling by next-generation sequencing (TIPseq). We also report the development of a machine-learning-based computational pipeline, TIPseqHunter, to identify insertion sites with high precision and reliability. We demonstrate the utility of this approach to detect somatic retrotransposition events in high-grade ovarian serous carcinoma.
Collapse
|
60
|
Sahakyan AB, Balasubramanian S. Single genome retrieval of context-dependent variability in mutation rates for human germline. BMC Genomics 2017; 18:81. [PMID: 28086752 PMCID: PMC5237266 DOI: 10.1186/s12864-016-3440-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Accepted: 12/19/2016] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Accurate knowledge of the core components of substitution rates is of vital importance to understand genome evolution and dynamics. By performing a single-genome and direct analysis of 39,894 retrotransposon remnants, we reveal sequence context-dependent germline nucleotide substitution rates for the human genome. RESULTS The rates are characterised through rate constants in a time-domain, and are made available through a dedicated program (Trek) and a stand-alone database. Due to the nature of the method design and the imposed stringency criteria, we expect our rate constants to be good estimates for the rates of spontaneous mutations. Benefiting from such data, we study the short-range nucleotide (up to 7-mer) organisation and the germline basal substitution propensity (BSP) profile of the human genome; characterise novel, CpG-independent, substitution prone and resistant motifs; confirm a decreased tendency of moieties with low BSP to undergo somatic mutations in a number of cancer types; and, produce a Trek-based estimate of the overall mutation rate in human. CONCLUSIONS The extended set of rate constants we report may enrich our resources and help advance our understanding of genome dynamics and evolution, with possible implications for the role of spontaneous mutations in the emergence of pathological genotypes and neutral evolution of proteomes.
Collapse
Affiliation(s)
- Aleksandr B Sahakyan
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
| | - Shankar Balasubramanian
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Robinson Way, Cambridge, CB2 0RE, UK.
- School of Clinical Medicine, University of Cambridge, Cambridge, CB2 0SP, UK.
| |
Collapse
|
61
|
Rishishwar L, Wang L, Clayton EA, Mariño-Ramírez L, McDonald JF, Jordan IK. Population and clinical genetics of human transposable elements in the (post) genomic era. Mob Genet Elements 2017; 7:1-20. [PMID: 28228978 PMCID: PMC5305044 DOI: 10.1080/2159256x.2017.1280116] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Revised: 01/03/2017] [Accepted: 01/04/2017] [Indexed: 10/26/2022] Open
Abstract
Recent technological developments-in genomics, bioinformatics and high-throughput experimental techniques-are providing opportunities to study ongoing human transposable element (TE) activity at an unprecedented level of detail. It is now possible to characterize genome-wide collections of TE insertion sites for multiple human individuals, within and between populations, and for a variety of tissue types. Comparison of TE insertion site profiles between individuals captures the germline activity of TEs and reveals insertion site variants that segregate as polymorphisms among human populations, whereas comparison among tissue types ascertains somatic TE activity that generates cellular heterogeneity. In this review, we provide an overview of these new technologies and explore their implications for population and clinical genetic studies of human TEs. We cover both recent published results on human TE insertion activity as well as the prospects for future TE studies related to human evolution and health.
Collapse
Affiliation(s)
- Lavanya Rishishwar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia; Applied Bioinformatics Laboratory, Atlanta, GA, USA
| | - Lu Wang
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia
| | - Evan A Clayton
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; Ovarian Cancer Institute, Atlanta, GA, USA
| | - Leonardo Mariño-Ramírez
- PanAmerican Bioinformatics Institute, Cali, Colombia; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - John F McDonald
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; Ovarian Cancer Institute, Atlanta, GA, USA
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia; Applied Bioinformatics Laboratory, Atlanta, GA, USA
| |
Collapse
|
62
|
Ivancevic AM, Kortschak RD, Bertozzi T, Adelson DL. LINEs between Species: Evolutionary Dynamics of LINE-1 Retrotransposons across the Eukaryotic Tree of Life. Genome Biol Evol 2016; 8:3301-3322. [PMID: 27702814 PMCID: PMC5203782 DOI: 10.1093/gbe/evw243] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
LINE-1 (L1) retrotransposons are dynamic elements. They have the potential to cause great genomic change because of their ability to ‘jump’ around the genome and amplify themselves, resulting in the duplication and rearrangement of regulatory DNA. Active L1, in particular, are often thought of as tightly constrained, homologous and ubiquitous elements with well-characterized domain organization. For the past 30 years, model organisms have been used to define L1s as 6–8 kb sequences containing a 5′-UTR, two open reading frames working harmoniously in cis, and a 3′-UTR with a polyA tail. In this study, we demonstrate the remarkable and overlooked diversity of L1s via a comprehensive phylogenetic analysis of elements from over 500 species from widely divergent branches of the tree of life. The rapid and recent growth of L1 elements in mammalian species is juxtaposed against the diverse lineages found in other metazoans and plants. In fact, some of these previously unexplored mammalian species (e.g. snub-nosed monkey, minke whale) exhibit L1 retrotranspositional ‘hyperactivity’ far surpassing that of human or mouse. In contrast, non-mammalian L1s have become so varied that the current classification system seems to inadequately capture their structural characteristics. Our findings illustrate how both long-term inherited evolutionary patterns and random bursts of activity in individual species can significantly alter genomes, highlighting the importance of L1 dynamics in eukaryotes.
Collapse
Affiliation(s)
- Atma M Ivancevic
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia
| | - R Daniel Kortschak
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia
| | - Terry Bertozzi
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia.,Evolutionary Biology Unit, South Australian Museum, Adelaide, South Australia, Australia
| | - David L Adelson
- School of Biological Sciences, University of Adelaide, Adelaide, South Australia, Australia
| |
Collapse
|
63
|
Wang X, Liu X. Close ecological relationship among species facilitated horizontal transfer of retrotransposons. BMC Evol Biol 2016; 16:201. [PMID: 27717306 PMCID: PMC5055719 DOI: 10.1186/s12862-016-0767-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 09/27/2016] [Indexed: 01/01/2023] Open
Abstract
Background Horizontal transfer (HT) of genetic materials is increasingly being found in both animals and plants and mainly concerns transposable elements (TEs). Many crustaceans have big genome sizes and are thus likely to harbor high TE contents. Their habitat might offer them ample opportunities to exchange genetic materials with organisms that are ecologically close but taxonomically distant to them. Results In this study, we analyzed the transcriptome of Pacific white shrimp (Litopenaeus vannamei), an important economic crustacean, to explore traces of HT events. From a collection of newly assembled transcripts, we identified 395 high reliable TE transcripts, most of which were retrotransposon transcripts. One hundred fifty-seven of those transcripts showed highest similarity to sequences from non-arthropod organisms, including ray-finned fishes, mollusks and putative parasites. In total, 16 already known L. vannamei TE families are likely to be involved in horizontal transfer events. Phylogenetic analyses of 10 L. vannamei TE families and their homologues (protein sequences) revealed that L. vannamei TE families were generally more close to sequences from aquatic species. Furthermore, TEs from other aquatic species also tend to group together, although they are often distantly related in taxonomy. Sequences from parasites and microorganisms were also widely present, indicating their possible important roles in HT events. Expression profile analyses of transcripts in two NCBI BioProjects revealed that transcripts involved in HT events are likely to play important roles in antiviral immunity. More specifically, those transcripts might act as inhibitors of antiviral immunity. Conclusions Close ecological relationship, especially predation, might greatly facilitate HT events among aquatic species. This could be achieved through exchange of parasites and microorganisms, or through direct DNA flow. The occurrence of HT events may be largely incidental, but the effects could be beneficial for recipients. Electronic supplementary material The online version of this article (doi:10.1186/s12862-016-0767-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xianzong Wang
- Shaanxi Key Laboratory of Molecular Biology for Agriculture, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, Shaanxi, People's Republic of China
| | - Xiaolin Liu
- Shaanxi Key Laboratory of Molecular Biology for Agriculture, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, Shaanxi, People's Republic of China.
| |
Collapse
|
64
|
Doucet-O'Hare TT, Sharma R, Rodić N, Anders RA, Burns KH, Kazazian HH. Somatically Acquired LINE-1 Insertions in Normal Esophagus Undergo Clonal Expansion in Esophageal Squamous Cell Carcinoma. Hum Mutat 2016; 37:942-54. [PMID: 27319353 DOI: 10.1002/humu.23027] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Accepted: 06/01/2016] [Indexed: 12/16/2022]
Abstract
Squamous cell carcinoma of the esophagus (SCC) is the most common form of esophageal cancer in the world and is typically diagnosed at an advanced stage when successful treatment is challenging. Understanding the mutational profile of this cancer may identify new treatment strategies. Because somatic retrotransposition has been shown in tumors of the gastrointestinal system, we focused on LINE-1 (L1) mobilization as a source of genetic instability in this cancer. We hypothesized that retrotransposition is ongoing in SCC patients. The expression of L1 encoded proteins is necessary for retrotransposition to occur; therefore, we evaluated the expression of L1 open reading frame 1 protein (ORF1p). Using immunohistochemistry, we detected ORF1p expression in all four SCC cases evaluated. Using L1-seq, we identified and validated 74 somatic insertions in eight tumors of the nine evaluated. Of these, 12 insertions appeared to be somatic, not genetically inherited, and sub-clonal (i.e., present in less than one copy per genome equivalent) in the adjacent normal esophagus (NE), while clonal in the tumor. Our results indicate that L1 retrotransposition is active in SCC of the esophagus and that insertion events are present in histologically NE that expands clonally in the subsequent tumor.
Collapse
Affiliation(s)
- Tara T Doucet-O'Hare
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland.,Predoctoral Training Program in Human Genetics, McKusick-Nathans Institute of Genetic Medicine, Baltimore, Maryland.,National Institutes of Health: National Institute of Neurological Disorders and Stroke, Bethesda, Maryland
| | - Reema Sharma
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Nemanja Rodić
- Dermatology Department, Yale School of Medicine, New Haven, Connecticut
| | - Robert A Anders
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Kathleen H Burns
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland.,Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Haig H Kazazian
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
| |
Collapse
|
65
|
Scott EC, Gardner EJ, Masood A, Chuang NT, Vertino PM, Devine SE. A hot L1 retrotransposon evades somatic repression and initiates human colorectal cancer. Genome Res 2016; 26:745-55. [PMID: 27197217 PMCID: PMC4889970 DOI: 10.1101/gr.201814.115] [Citation(s) in RCA: 175] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 04/19/2016] [Indexed: 01/16/2023]
Abstract
Although human LINE-1 (L1) elements are actively mobilized in many cancers, a role for somatic L1 retrotransposition in tumor initiation has not been conclusively demonstrated. Here, we identify a novel somatic L1 insertion in the APC tumor suppressor gene that provided us with a unique opportunity to determine whether such insertions can actually initiate colorectal cancer (CRC), and if so, how this might occur. Our data support a model whereby a hot L1 source element on Chromosome 17 of the patient's genome evaded somatic repression in normal colon tissues and thereby initiated CRC by mutating the APC gene. This insertion worked together with a point mutation in the second APC allele to initiate tumorigenesis through the classic two-hit CRC pathway. We also show that L1 source profiles vary considerably depending on the ancestry of an individual, and that population-specific hot L1 elements represent a novel form of cancer risk.
Collapse
Affiliation(s)
- Emma C Scott
- Graduate Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Eugene J Gardner
- Graduate Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Ashiq Masood
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Nelson T Chuang
- Graduate Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Division of Gastroenterology, Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| | - Paula M Vertino
- Department of Radiation Oncology, Emory University School of Medicine, Atlanta, Georgia 30322, USA; Winship Cancer Institute, Emory University, Atlanta, Georgia 30322, USA
| | - Scott E Devine
- Graduate Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA; Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
| |
Collapse
|
66
|
Upregulated LINE-1 Activity in the Fanconi Anemia Cancer Susceptibility Syndrome Leads to Spontaneous Pro-inflammatory Cytokine Production. EBioMedicine 2016; 8:184-194. [PMID: 27428429 PMCID: PMC4919473 DOI: 10.1016/j.ebiom.2016.05.005] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2016] [Revised: 04/08/2016] [Accepted: 05/05/2016] [Indexed: 11/30/2022] Open
Abstract
Fanconi Anemia (FA) is a genetic disorder characterized by elevated cancer susceptibility and pro-inflammatory cytokine production. Using SLX4FANCP deficiency as a working model, we questioned the trigger for chronic inflammation in FA. We found that absence of SLX4 caused cytoplasmic DNA accumulation, including sequences deriving from active Long INterspersed Element-1 (LINE-1), triggering the cGAS-STING pathway to elicit interferon (IFN) expression. In agreement, absence of SLX4 leads to upregulated LINE-1 retrotransposition. Importantly, similar results were obtained with the FANCD2 upstream activator of SLX4. Furthermore, treatment of FA cells with the Tenofovir reverse transcriptase inhibitor (RTi), that prevents endogenous retrotransposition, decreased both accumulation of cytoplasmic DNA and pro-inflammatory signaling. Collectively, our data suggest a contribution of endogenous RT activities to the generation of immunogenic cytoplasmic nucleic acids responsible for inflammation in FA. The additional observation that RTi decreased pro-inflammatory cytokine production induced by DNA replication stress-inducing drugs further demonstrates the contribution of endogenous RTs to sustaining chronic inflammation. Altogether, our data open perspectives in the prevention of adverse effects of chronic inflammation in tumorigenesis. Cytoplasmic DNA, comprising LINE-1-derived sequences, elicits IFN expression via the cGAS-STING pathway in SLX4-deficiency. Members of the Fanconi Anemia DNA repair pathway negatively regulate LINE-1 retrotransposition. Endogenous reverse transcriptase activities contribute to spontaneous and chemotherapy-induced inflammation.
Chronic inflammation favors tumorigenesis, negatively influencing patient prognosis. Yet, the underlying molecular mechanisms are poorly understood. Here, we show that increased endogenous retroelement-associated reverse transcriptase activity contributes to generate immunogenic cytoplasmic nucleic acids susceptible of triggering a pro-inflammatory response in the Fanconi Anemia (FA) cancer susceptibility syndrome. In addition, treatment of FA cells or of cells exposed to replication stress inducing drugs, with a reverse transcriptase inhibitor, decreases pro-inflammatory signals. Altogether our data suggest the involvement of endogenous reverse transcriptase activities in sustaining pervasive chronic inflammation, opening therapeutic perspectives for preventing its impact on tumorigenesis.
Collapse
|
67
|
Philippe C, Vargas-Landin DB, Doucet AJ, van Essen D, Vera-Otarola J, Kuciak M, Corbin A, Nigumann P, Cristofari G. Activation of individual L1 retrotransposon instances is restricted to cell-type dependent permissive loci. eLife 2016; 5. [PMID: 27016617 PMCID: PMC4866827 DOI: 10.7554/elife.13926] [Citation(s) in RCA: 105] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 03/25/2016] [Indexed: 12/26/2022] Open
Abstract
LINE-1 (L1) retrotransposons represent approximately one sixth of the human genome, but only the human-specific L1HS-Ta subfamily acts as an endogenous mutagen in modern humans, reshaping both somatic and germline genomes. Due to their high levels of sequence identity and the existence of many polymorphic insertions absent from the reference genome, the transcriptional activation of individual genomic L1HS-Ta copies remains poorly understood. Here we comprehensively mapped fixed and polymorphic L1HS-Ta copies in 12 commonly-used somatic cell lines, and identified transcriptional and epigenetic signatures allowing the unambiguous identification of active L1HS-Ta copies in their genomic context. Strikingly, only a very restricted subset of L1HS-Ta loci - some being polymorphic among individuals - significantly contributes to the bulk of L1 expression, and these loci are differentially regulated among distinct cell lines. Thus, our data support a local model of L1 transcriptional activation in somatic cells, governed by individual-, locus-, and cell-type-specific determinants. DOI:http://dx.doi.org/10.7554/eLife.13926.001 Retrotransposons, also known as jumping genes, have invaded the genomes of most living organisms. These fragments of DNA have the ability to move or copy themselves from one location of a chromosome to another. Depending on where they insert themselves, retrotransposons can modify the sequence of nearby genes, which can alter or even abolish their activity. Although these genetic modifications have contributed significantly to the evolution of different species, retrotransposons can also have detrimental effects; for example, by causing new cases of genetic diseases. Adult human cells have a number of mechanisms that work to keep the activity of retrotransposons at a very low level. However, in many types of cancers retrotransposons escape these defense mechanisms and ‘jump’ actively. This is thought to contribute to the development and spread of cancerous tumors. To understand how jumping genes are mobilized, a fundamental question must be answered: is the high jumping gene activity observed in some cell types a result of activating many copies of the retrotransposons, or only a few of them? This question has been difficult to address because there are more than one hundred copies of retrotransposons that could potentially move in humans, many of which have not even been referenced in the human genome map. Furthermore, each copy is almost identical to another one, making it difficult to discriminate between them. Philippe et al. have now developed an approach that can map where individual retrotransposons are located in the genome of normal and cancerous cells and measure how active these jumping genes are. This revealed that only a very restricted number of them are active in any given cell type. Moreover, different subsets of jumping genes are active in different cell types, and their locations in the genome often do not overlap. Thus, whether jumping genes are activated depends on the cell type and their position in the genome. These results are in contrast to the prevalent view that retrotransposons are activated in a more widespread manner across the genome, at least in cancerous cells. Overall, Philippe et al.’s findings pave the way towards characterizing the chromosome regions in which retrotransposons are frequently activated and understanding how they contribute to cancer and other diseases. DOI:http://dx.doi.org/10.7554/eLife.13926.002
Collapse
Affiliation(s)
- Claude Philippe
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France.,FHU OncoAge, University of Nice-Sophia Antipolis, Nice, France
| | - Dulce B Vargas-Landin
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France.,Ecole Normale Supérieure, Paris, France
| | - Aurélien J Doucet
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France.,FHU OncoAge, University of Nice-Sophia Antipolis, Nice, France
| | - Dominic van Essen
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France
| | - Jorge Vera-Otarola
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France
| | - Monika Kuciak
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France.,Ecole Normale Supérieure de Lyon, Lyon, France
| | | | - Pilvi Nigumann
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France
| | - Gaël Cristofari
- INSERM U1081, CNRS UMR 7284, Institute for Research on Cancer and Aging of Nice, Nice, France.,Faculty of Medicine, University of Nice-Sophia Antipolis, Nice, France.,FHU OncoAge, University of Nice-Sophia Antipolis, Nice, France
| |
Collapse
|
68
|
Abstract
Transposable elements have had a profound impact on the structure and function of mammalian genomes. The retrotransposon Long INterspersed Element-1 (LINE-1 or L1), by virtue of its replicative mobilization mechanism, comprises ∼17% of the human genome. Although the vast majority of human LINE-1 sequences are inactive molecular fossils, an estimated 80-100 copies per individual retain the ability to mobilize by a process termed retrotransposition. Indeed, LINE-1 is the only active, autonomous retrotransposon in humans and its retrotransposition continues to generate both intra-individual and inter-individual genetic diversity. Here, we briefly review the types of transposable elements that reside in mammalian genomes. We will focus our discussion on LINE-1 retrotransposons and the non-autonomous Short INterspersed Elements (SINEs) that rely on the proteins encoded by LINE-1 for their mobilization. We review cases where LINE-1-mediated retrotransposition events have resulted in genetic disease and discuss how the characterization of these mutagenic insertions led to the identification of retrotransposition-competent LINE-1s in the human and mouse genomes. We then discuss how the integration of molecular genetic, biochemical, and modern genomic technologies have yielded insight into the mechanism of LINE-1 retrotransposition, the impact of LINE-1-mediated retrotransposition events on mammalian genomes, and the host cellular mechanisms that protect the genome from unabated LINE-1-mediated retrotransposition events. Throughout this review, we highlight unanswered questions in LINE-1 biology that provide exciting opportunities for future research. Clearly, much has been learned about LINE-1 and SINE biology since the publication of Mobile DNA II thirteen years ago. Future studies should continue to yield exciting discoveries about how these retrotransposons contribute to genetic diversity in mammalian genomes.
Collapse
|
69
|
Identification of transposable element-mediated deletions in 27 Korean individuals based on whole genome sequencing data. Genes Genomics 2016. [DOI: 10.1007/s13258-015-0370-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
70
|
Rahbari R, Badge RM. Combining Amplification Typing of L1 Active Subfamilies (ATLAS) with High-Throughput Sequencing. Methods Mol Biol 2016; 1400:95-106. [PMID: 26895048 DOI: 10.1007/978-1-4939-3372-3_6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
With the advent of new generations of high-throughput sequencing technologies, the catalog of human genome variants created by retrotransposon activity is expanding rapidly. However, despite these advances in describing L1 diversity and the fact that L1 must retrotranspose in the germline or prior to germline partitioning to be evolutionarily successful, direct assessment of de novo L1 retrotransposition in the germline or early embryogenesis has not been achieved for endogenous L1 elements. A direct study of de novo L1 retrotransposition into susceptible loci within sperm DNA (Freeman et al., Hum Mutat 32(8):978-988, 2011) suggested that the rate of L1 retrotransposition in the germline is much lower than previously estimated (<1 in 400 individuals versus 1 in 9 individuals (Kazazian, Nat Genet 22(2):130, 1999). Based on these revised estimates of the L1 retrotransposition rate, we modified the ATLAS L1 display technique (Badge et al., Am J Hum Genet 72(4):823-838, 2003) to investigate de novo L1 retrotransposition in human genomes. In this chapter, we describe how we combined a high-coverage ATLAS variant with high-throughput sequencing, achieving 11-25× sequence depth per single amplicon, to study L1 retrotransposition in whole genome amplified (WGA) DNAs.
Collapse
Affiliation(s)
- Raheleh Rahbari
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Richard M Badge
- Department of Genetics, University of Leicester, Leicester, UK
| |
Collapse
|
71
|
Naufer MN, Callahan KE, Cook PR, Perez-Gonzalez CE, Williams MC, Furano AV. L1 retrotransposition requires rapid ORF1p oligomerization, a novel coiled coil-dependent property conserved despite extensive remodeling. Nucleic Acids Res 2015; 44:281-93. [PMID: 26673717 PMCID: PMC4705668 DOI: 10.1093/nar/gkv1342] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2015] [Accepted: 11/17/2015] [Indexed: 12/16/2022] Open
Abstract
Detailed mechanistic understanding of L1 retrotransposition is sparse, particularly with respect to ORF1p, a coiled coil-mediated homotrimeric nucleic acid chaperone that can form tightly packed oligomers on nucleic acids. Although the coiled coil motif is highly conserved, it is uniquely susceptible to evolutionary change. Here we studied three ORF1 proteins: a modern human one (111p), its resuscitated primate ancestor (555p) and a mosaic modern protein (151p) wherein 9 of the 30 coiled coil substitutions retain their ancestral state. While 111p and 555p equally supported retrotransposition, 151p was inactive. Nonetheless, they were fully active in bulk assays of nucleic acid interactions including chaperone activity. However, single molecule assays showed that 151p trimers form stably bound oligomers on ssDNA at <1/10th the rate of the active proteins, revealing that oligomerization rate is a novel critical parameter of ORF1p activity in retrotransposition conserved for at least the last 25 Myr of primate evolution.
Collapse
Affiliation(s)
- M Nabuan Naufer
- Northeastern University, Department of Physics, Boston, MA 02115, USA
| | - Kathryn E Callahan
- Laboratory of Cellular and Molecular Biology, NIDDK, National Institutes of Health, Bethesda, MD 20892, USA
| | - Pamela R Cook
- Laboratory of Cellular and Molecular Biology, NIDDK, National Institutes of Health, Bethesda, MD 20892, USA
| | - Cesar E Perez-Gonzalez
- Laboratory of Cellular and Molecular Biology, NIDDK, National Institutes of Health, Bethesda, MD 20892, USA
| | - Mark C Williams
- Northeastern University, Department of Physics, Boston, MA 02115, USA
| | - Anthony V Furano
- Laboratory of Cellular and Molecular Biology, NIDDK, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
72
|
Furano AV, Cook PR. The challenge of ORF1p phosphorylation: Effects on L1 activity and its host. Mob Genet Elements 2015; 6:e1119927. [PMID: 27066302 DOI: 10.1080/2159256x.2015.1119927] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Revised: 11/04/2015] [Accepted: 11/06/2015] [Indexed: 01/09/2023] Open
Abstract
L1 non-LTR retrotransposons are autonomously replicating genetic elements that profoundly affected their mammalian hosts having generated upwards of 40% or more of their genomes. Although deleterious, they remain active in most mammalian species, and thus the nature and consequences of the interaction between L1 and its host remain major issues for mammalian biology. We recently showed that L1 activity requires phosphorylation of one of its 2 encoded proteins, ORF1p, a nucleic acid chaperone and the major component of the L1RNP retrotransposition intermediate. Reversible protein phosphorylation, which is effected by interacting cascades of protein kinases, phosphatases, and ancillary proteins, is a mainstay in the regulation and coordination of many basic biological processes. Therefore, demonstrating phosphorylation-dependence of L1 activity substantially enlarged our knowledge of the scope of L1 / host interaction. However, developing a mechanistic understanding of what this means for L1 or its host is a formidable challenge, which we discuss here.
Collapse
Affiliation(s)
- Anthony V Furano
- Laboratory of Cellular and Molecular Biology, NIDDK, National Institutes of Health , Bethesda, MD, USA
| | - Pamela R Cook
- Laboratory of Cellular and Molecular Biology, NIDDK, National Institutes of Health , Bethesda, MD, USA
| |
Collapse
|
73
|
Kagawa T, Oka A, Kobayashi Y, Hiasa Y, Kitamura T, Sakugawa H, Adachi Y, Anzai K, Tsuruya K, Arase Y, Hirose S, Shiraishi K, Shiina T, Sato T, Wang T, Tanaka M, Hayashi H, Kawabe N, Robinson PN, Zemojtel T, Mine T. Recessive inheritance of population-specific intronic LINE-1 insertion causes a rotor syndrome phenotype. Hum Mutat 2015; 36:327-32. [PMID: 25546334 DOI: 10.1002/humu.22745] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Accepted: 12/15/2014] [Indexed: 11/06/2022]
Abstract
Sequences of long-interspersed elements (LINE-1, L1) make up ∼17% of the human genome. De novo insertions of retrotransposition-active L1s can result in genetic diseases. It has been recently shown that the homozygous inactivation of two adjacent genes SLCO1B1 and SLCO1B3 encoding organic anion transporting polypeptides OATP1B1 and OATP1B3 causes a benign recessive disease presenting with conjugated hyperbilirubinemia, Rotor syndrome. Here, we examined SLCO1B1 and SLCO1B3 genes in six Japanese diagnosed with Rotor syndrome on the basis of laboratory data and laparoscopy. All six Japanese patients were homozygous for the c.1738C>T nonsense mutation in SLCO1B1 and homozygous for the insertion of a ∼6.1-kbp L1 retrotransposon in intron 5 of SLCO1B3, which altogether make up a Japanese-specific haplotype. RNA analysis revealed that the L1 insertion induced deleterious splicing resulting in SLCO1B3 transcripts lacking exon 5 or exons 5-7 and containing premature stop codons. The expression of OATP1B1 and OATP1B3 proteins was not detected in liver tissues. This is the first documented case of a population-specific polymorphic intronic L1 transposon insertion contributing to molecular etiology of recessive genetic disease. Since L1 activity in human genomes is currently seen as a major source of individual genetic variation, further investigations are warranted to determine whether this phenomenon results in other autosomal-recessive diseases.
Collapse
Affiliation(s)
- Tatehiro Kagawa
- Division of Gastroenterology, Department of Internal Medicine, Tokai University School of Medicine, Isehara, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
74
|
Smith RWA, Monroe C, Bolnick DA. Detection of Cytosine methylation in ancient DNA from five native american populations using bisulfite sequencing. PLoS One 2015; 10:e0125344. [PMID: 26016479 PMCID: PMC4445908 DOI: 10.1371/journal.pone.0125344] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Accepted: 03/14/2015] [Indexed: 12/18/2022] Open
Abstract
While cytosine methylation has been widely studied in extant populations, relatively few studies have analyzed methylation in ancient DNA. Most existing studies of epigenetic marks in ancient DNA have inferred patterns of methylation in highly degraded samples using post-mortem damage to cytosines as a proxy for cytosine methylation levels. However, this approach limits the inference of methylation compared with direct bisulfite sequencing, the current gold standard for analyzing cytosine methylation at single nucleotide resolution. In this study, we used direct bisulfite sequencing to assess cytosine methylation in ancient DNA from the skeletal remains of 30 Native Americans ranging in age from approximately 230 to 4500 years before present. Unmethylated cytosines were converted to uracils by treatment with sodium bisulfite, bisulfite products of a CpG-rich retrotransposon were pyrosequenced, and C-to-T ratios were quantified for a single CpG position. We found that cytosine methylation is readily recoverable from most samples, given adequate preservation of endogenous nuclear DNA. In addition, our results indicate that the precision of cytosine methylation estimates is inversely correlated with aDNA preservation, such that samples of low DNA concentration show higher variability in measures of percent methylation than samples of high DNA concentration. In particular, samples in this study with a DNA concentration above 0.015 ng/μL generated the most consistent measures of cytosine methylation. This study presents evidence of cytosine methylation in a large collection of ancient human remains, and indicates that it is possible to analyze epigenetic patterns in ancient populations using direct bisulfite sequencing approaches.
Collapse
Affiliation(s)
- Rick W A Smith
- Department of Anthropology, University of Texas at Austin, Austin, Texas, United States of America
| | - Cara Monroe
- Department of Anthropology, Washington State University, Pullman, Washington, United States of America; Department of Anthropology, University of California Santa Barbara, Santa Barbara, California, United States of America
| | - Deborah A Bolnick
- Department of Anthropology, University of Texas at Austin, Austin, Texas, United States of America; Population Research Center, University of Texas at Austin, Austin, Texas, United States of America
| |
Collapse
|
75
|
Abstract
Although members of the L1 (LINE-1) clade of non-LTR retrotransposons can be deleterious, the L1 clade has remained active in most mammals for ∼100 million years and generated almost 40% of the human genome. The details of L1-host interaction are largely unknown, however. Here we report that L1 activity requires phosphorylation of the protein encoded by the L1 ORF1 (ORF1p). Critical phospho-acceptor residues (two serines and two threonines) reside in four conserved proline-directed protein kinase (PDPK) target sites. The PDPK family includes mitogen-activated protein kinases and cyclin-dependent kinases. Mutation of any PDPK phospho-acceptor inhibits L1 retrotransposition. The phosphomimetic aspartic acid can restore activity at the two serine sites, but not at either threonine site, where it is strongly inhibitory. ORF1p also contains conserved PDPK docking sites, which promote specific interaction of PDPKs with their targets. As expected, mutations in these sites also inhibit L1 activity. PDPK mutations in ORF1p that inactivate L1 have no significant effect on the ability of ORF1p to anneal RNA in vitro, an important biochemical property of the protein. We show that phosphorylated PDPK sites in ORF1p are required for an interaction with the peptidyl prolyl isomerase 1 (Pin1), a critical component of PDPK-mediated regulation. Pin1 acts via isomerization of proline side chains at phosphorylated PDPK motifs, thereby affecting substrate conformation and activity. Our demonstration that L1 activity is dependent on and integrated with cellular phosphorylation regulatory cascades significantly increases our understanding of interactions between L1 and its host.
Collapse
|
76
|
Streva VA, Jordan VE, Linker S, Hedges DJ, Batzer MA, Deininger PL. Sequencing, identification and mapping of primed L1 elements (SIMPLE) reveals significant variation in full length L1 elements between individuals. BMC Genomics 2015; 16:220. [PMID: 25887476 PMCID: PMC4381410 DOI: 10.1186/s12864-015-1374-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 02/20/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND There are over a half a million copies of L1 retroelements in the human genome which are responsible for as much as 0.5% of new human genetic diseases. Most new L1 inserts arise from young source elements that are polymorphic in the human genome. Highly active polymorphic "hot" L1 source elements have been shown to be capable of extremely high levels of mobilization and result in numerous instances of disease. Additionally, hot polymorphic L1s have been described to be highly active within numerous cancer genomes. These hot L1s result in mutagenesis by insertion of new L1 copies elsewhere in the genome, but also have been shown to generate additional full length L1 insertions which are also hot and able to further retrotranspose. Through this mechanism, hot L1s may amplify within a tumor and result in a continued cycle of mutagenesis. RESULTS AND CONCLUSIONS We have developed a method to detect full-length, polymorphic L1 elements using a targeted next generation sequencing approach, Sequencing Identification and Mapping of Primed L1 Elements (SIMPLE). SIMPLE has 94% sensitivity and detects nearly all full-length L1 elements in a genome. SIMPLE will allow researchers to identify hot mutagenic full-length L1s as potential drivers of genome instability. Using SIMPLE we find that the typical individual has approximately 100 non-reference, polymorphic L1 elements in their genome. These elements are at relatively low population frequencies relative to previously identified polymorphic L1 elements and demonstrate the tremendous diversity in potentially active L1 elements in the human population.
Collapse
Affiliation(s)
- Vincent A Streva
- Tulane Cancer Center and Department of Epidemiology, Tulane University, New Orleans, LA, USA. .,Present Address: Division of Infectious Diseases, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
| | - Vallmer E Jordan
- Department of Biology, Louisiana State University, Baton Rouge, LA, USA.
| | - Sara Linker
- Department of Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA.
| | - Dale J Hedges
- Department of Internal Medicine, The Ohio State University, Columbus, OH, USA.
| | - Mark A Batzer
- Department of Biology, Louisiana State University, Baton Rouge, LA, USA.
| | - Prescott L Deininger
- Tulane Cancer Center and Department of Epidemiology, Tulane University, New Orleans, LA, USA.
| |
Collapse
|
77
|
White TB, McCoy AM, Streva VA, Fenrich J, Deininger PL. A droplet digital PCR detection method for rare L1 insertions in tumors. Mob DNA 2014; 5:30. [PMID: 25598847 PMCID: PMC4297411 DOI: 10.1186/s13100-014-0030-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Accepted: 11/19/2014] [Indexed: 01/17/2023] Open
Abstract
Background The active human mobile element, long interspersed element 1 (L1) currently populates human genomes in excess of 500,000 copies per haploid genome. Through its mobility via a process called target primed reverse transcription (TPRT), L1 mobilization has resulted in over 100 de novo cases of human disease and has recently been associated with various cancer types. Large advances in high-throughput sequencing (HTS) technology have allowed for an increased understanding of the role of L1 in human cancer; however, researchers are still limited by the ability to validate potentially rare L1 insertion events detected by HTS that may occur in only a small fraction of tumor cells. Additionally, HTS detection of rare events varies greatly as a function of read depth, and new tools for de novo element discovery are needed to fill in gaps created by HTS. Results We have employed droplet digital PCR (ddPCR) to detect rare L1 loci in mosaic human genomes. Our assay allows for the detection of L1 insertions as rare as one cell in every 10,000. Conclusions ddPCR represents a robust method to be used alongside HTS techniques for detecting, validating and quantitating rare L1 insertion events in tumors and other tissues. Electronic supplementary material The online version of this article (doi:10.1186/s13100-014-0030-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Travis B White
- Tulane Cancer Center, 1430 Tulane Avenue, New Orleans, LA 70112 USA
| | - Adam M McCoy
- Bio-Rad Laboratories, 750 Alfred Nobel Drive, Hercules, CA 94547 USA.,Present Address: Eureka Genomics, 2000 Alfred Nobel Drive, Hercules, CA 94547 USA
| | - Vincent A Streva
- Tulane Cancer Center, 1430 Tulane Avenue, New Orleans, LA 70112 USA.,Present Address: Division of Infectious Diseases, Boston Children's Hospital and Harvard Medical School, 300 Longwood Avenue, Boston, MA 02115 USA
| | - Joshua Fenrich
- Bio-Rad Laboratories, 750 Alfred Nobel Drive, Hercules, CA 94547 USA
| | | |
Collapse
|
78
|
Boissinot S, Alvarez L, Giraldo-Ramirez J, Tollis M. Neutral nuclear variation in Baboons (genus Papio) provides insights into their evolutionary and demographic histories. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2014; 155:621-34. [PMID: 25234435 PMCID: PMC4339869 DOI: 10.1002/ajpa.22618] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Revised: 08/20/2014] [Accepted: 09/08/2014] [Indexed: 12/31/2022]
Abstract
Baboons (genus Papio) are distributed over most of sub-Saharan Africa and in the southern portion of the Arabian Peninsula. Six distinct morphotypes, with clearly defined geographic distributions, are recognized (the olive, chacma, yellow, Guinea, Kinda, and hamadryas baboons). The evolutionary relationships among baboon forms have long been a controversial issue. Phylogenetic analyses based on mitochondrial DNA sequences revealed that the modern baboon morphotypes are mitochondrially paraphyletic or polyphyletic. The discordance between mitochondrial lineages and morphology is indicative of extensive introgressive hybridization between ancestral baboon populations. To gain insights into the evolutionary relationships among morphotypes and their demographic history, we performed an analysis of nuclear variation in baboons. We sequenced 13 noncoding, putatively neutral, nuclear regions, and scored the presence/absence of 18 polymorphic transposable elements in a sample of 45 baboons belonging to five of the six recognized baboon forms. We found that the chacma baboon is the sister-taxon to all other baboons and the yellow baboon is the sister-taxon to an unresolved northern clade containing the olive, Guinea, and hamadryas baboons. We estimated that the diversification of baboons occurred entirely in the Pleistocene, the earliest split dating ∼1.5 million years ago, and that baboons have experienced relatively large and constant effective population sizes for most of their evolutionary history (∼30,000 to 95,000 individuals).
Collapse
Affiliation(s)
- Stéphane Boissinot
- Department of Biology, Queens College, the City University of New York, Queens, NY, USA
- Ecology, Evolutionary Biology and Behavior, Graduate Center, the City University of New York, New York, NY, USA
| | - Lauren Alvarez
- Department of Biology, Queens College, the City University of New York, Queens, NY, USA
| | | | - Marc Tollis
- Department of Biology, Queens College, the City University of New York, Queens, NY, USA
- Ecology, Evolutionary Biology and Behavior, Graduate Center, the City University of New York, New York, NY, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| |
Collapse
|
79
|
Tollis M, Boissinot S. Lizards and LINEs: selection and demography affect the fate of L1 retrotransposons in the genome of the green anole (Anolis carolinensis). Genome Biol Evol 2014; 5:1754-68. [PMID: 24013105 PMCID: PMC3787681 DOI: 10.1093/gbe/evt133] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Autonomous retrotransposons lacking long terminal repeats (LTR) account for much of the variation in genome size and structure among vertebrates. Mammalian genomes contain hundreds of thousands of non-LTR retrotransposon copies, mostly resulting from the amplification of a single clade known as L1. The genomes of teleost fish and squamate reptiles contain a much more diverse array of non-LTR retrotransposon families, whereas copy number is relatively low. The majority of non-LTR retrotransposon insertions in nonmammalian vertebrates also appear to be very recent, suggesting strong purifying selection limits the accumulation of non-LTR retrotransposon copies. It is however unclear whether this turnover model, originally proposed in Drosophila, applies to nonmammalian vertebrates. Here, we studied the population dynamics of L1 in the green anole lizard (Anolis carolinensis). We found that although most L1 elements are recent in this genome, truncated insertions accumulate readily, and many are fixed at both the population and species level. In contrast, full-length L1 insertions are found at lower population frequencies, suggesting that the turnover model only applies to longer L1 elements in Anolis. We also found that full-length L1 inserts are more likely to be fixed in populations of small effective size, suggesting that the strength of purifying selection against deleterious alleles is highly dependent on host demographic history. Similar mechanisms seem to be controlling the fate of non-LTR retrotransposons in both Anolis and teleostean fish, which suggests that mammals have considerably diverged from the ancestral vertebrate in terms of how they interact with their intragenomic parasites.
Collapse
Affiliation(s)
- Marc Tollis
- Biology Department, Queens College, City University of New York, Flushing
| | | |
Collapse
|
80
|
Gnanakkan VP, Jaffe AE, Dai L, Fu J, Wheelan SJ, Levitsky HI, Boeke JD, Burns KH. TE-array--a high throughput tool to study transposon transcription. BMC Genomics 2013; 14:869. [PMID: 24325565 PMCID: PMC3878892 DOI: 10.1186/1471-2164-14-869] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2013] [Accepted: 11/27/2013] [Indexed: 12/17/2022] Open
Abstract
Background Although transposable element (TE) derived DNA accounts for more than half of mammalian genomes and initiates a significant proportion of RNA transcripts, high throughput methods are rarely leveraged specifically to detect expression from interspersed repeats. Results To characterize the contribution of transposons to mammalian transcriptomes, we developed a custom microarray platform with probes covering known human and mouse transposons in both sense and antisense orientations. We termed this platform the “TE-array” and profiled TE repeat expression in a panel of normal mouse tissues. Validation with nanoString® and RNAseq technologies demonstrated that TE-array is an effective method. Our data show that TE transcription occurs preferentially from the sense strand and is regulated in highly tissue-specific patterns. Conclusions Our results are consistent with the hypothesis that transposon RNAs frequently originate within genomic TE units and do not primarily accumulate as a consequence of random ‘read-through’ from gene promoters. Moreover, we find TE expression is highly dependent on the tissue context. This suggests that TE expression may be related to tissue-specific chromatin states or cellular phenotypes. We anticipate that TE-array will provide a scalable method to characterize transposable element RNAs.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Jef D Boeke
- The Institute of Genetic Medicine, The Johns Hopkins University School of Medicine, 733 North Broadway, Miller Research Building (MRB) Room 469, Baltimore, MD 21205, USA.
| | | |
Collapse
|
81
|
Macfarlane CM, Collier P, Rahbari R, Beck CR, Wagstaff JF, Igoe S, Moran JV, Badge RM. Transduction-specific ATLAS reveals a cohort of highly active L1 retrotransposons in human populations. Hum Mutat 2013; 34:974-85. [PMID: 23553801 DOI: 10.1002/humu.22327] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Accepted: 03/15/2013] [Indexed: 11/09/2022]
Abstract
Long INterspersed Element-1 (LINE-1 or L1) retrotransposons are the only autonomously active transposable elements in the human genome. The average human genome contains ∼80-100 active L1s, but only a subset of these L1s are highly active or 'hot'. Human L1s are closely related in sequence, making it difficult to decipher progenitor/offspring relationships using traditional phylogenetic methods. However, L1 mRNAs can sometimes bypass their own polyadenylation signal and instead utilize fortuitous polyadenylation signals in 3' flanking genomic DNA. Retrotransposition of the resultant mRNAs then results in lineage specific sequence "tags" (i.e., 3' transductions) that mark the descendants of active L1 progenitors. Here, we developed a method (Transduction-Specific Amplification Typing of L1 Active Subfamilies or TS-ATLAS) that exploits L1 3' transductions to identify active L1 lineages in a genome-wide context. TS-ATLAS enabled the characterization of a putative active progenitor of one L1 lineage that includes the disease causing L1 insertion L1RP , and the identification of new retrotransposition events within two other "hot" L1 lineages. Intriguingly, the analysis of the newly discovered transduction lineage members suggests that L1 polyadenylation, even within a lineage, is highly stochastic. Thus, TS-ATLAS provides a new tool to explore the dynamics of L1 lineage evolution and retrotransposon biology.
Collapse
|
82
|
Sookdeo A, Hepp CM, McClure MA, Boissinot S. Revisiting the evolution of mouse LINE-1 in the genomic era. Mob DNA 2013; 4:3. [PMID: 23286374 PMCID: PMC3600994 DOI: 10.1186/1759-8753-4-3] [Citation(s) in RCA: 115] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2012] [Accepted: 10/25/2012] [Indexed: 11/10/2022] Open
Abstract
Background LINE-1 (L1) is the dominant category of transposable elements in placental mammals. L1 has significantly affected the size and structure of all mammalian genomes and understanding the nature of the interactions between L1 and its mammalian host remains a question of crucial importance in comparative genomics. For this reason, much attention has been dedicated to the evolution of L1. Among the most studied elements is the mouse L1 which has been the subject of a number of studies in the 1980s and 1990s. These seminal studies, performed in the pre-genomic era when only a limited number of L1 sequences were available, have significantly improved our understanding of L1 evolution. Yet, no comprehensive study on the evolution of L1 in mouse has been performed since the completion of this genome sequence. Results Using the Genome Parsing Suite we performed the first evolutionary analysis of mouse L1 over the entire length of the element. This analysis indicates that the mouse L1 has recruited novel 5’UTR sequences more frequently than previously thought and that the simultaneous activity of non-homologous promoters seems to be one of the conditions for the co-existence of multiple L1 families or lineages. In addition the exchange of genetic information between L1 families is not limited to the 5’UTR as evidence of inter-family recombination was observed in ORF1, ORF2, and the 3’UTR. In contrast to the human L1, there was little evidence of rapid amino-acid replacement in the coiled-coil of ORF1, although this region is structurally unstable. We propose that the structural instability of the coiled-coil domain might be adaptive and that structural changes in this region are selectively equivalent to the rapid evolution at the amino-acid level reported in the human lineage. Conclusions The pattern of evolution of L1 in mouse shows some similarity with human suggesting that the nature of the interactions between L1 and its host might be similar in these two species. Yet, some notable differences, particularly in the evolution of ORF1, suggest that the molecular mechanisms involved in host-L1 interactions might be different in these two species.
Collapse
Affiliation(s)
- Akash Sookdeo
- Department of Biology, Queens College, the City University of New York, 65-30 Kissena Boulevard, Flushing, NY 11367-1597, USA.
| | | | | | | |
Collapse
|
83
|
Kim YJ, Lee J, Han K. Transposable Elements: No More 'Junk DNA'. Genomics Inform 2012; 10:226-33. [PMID: 23346034 PMCID: PMC3543922 DOI: 10.5808/gi.2012.10.4.226] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Revised: 11/16/2012] [Accepted: 11/17/2012] [Indexed: 01/03/2023] Open
Abstract
Since the advent of whole-genome sequencing, transposable elements (TEs), just thought to be 'junk' DNA, have been noticed because of their numerous copies in various eukaryotic genomes. Many studies about TEs have been conducted to discover their functions in their host genomes. Based on the results of those studies, it has been generally accepted that they have a function to cause genomic and genetic variations. However, their infinite functions are not fully elucidated. Through various mechanisms, including de novo TE insertions, TE insertion-mediated deletions, and recombination events, they manipulate their host genomes. In this review, we focus on Alu, L1, human endogenous retrovirus, and short interspersed element/variable number of tandem repeats/Alu (SVA) elements and discuss how they have affected primate genomes, especially the human and chimpanzee genomes, since their divergence.
Collapse
Affiliation(s)
- Yun-Ji Kim
- Department of Nanobiomedical Science, WCU Research Center, Dankook University, Cheonan 330-714, Korea
| | | | | |
Collapse
|
84
|
Solyom S, Ewing AD, Rahrmann EP, Doucet T, Nelson HH, Burns MB, Harris RS, Sigmon DF, Casella A, Erlanger B, Wheelan S, Upton KR, Shukla R, Faulkner GJ, Largaespada DA, Kazazian HH. Extensive somatic L1 retrotransposition in colorectal tumors. Genome Res 2012; 22:2328-38. [PMID: 22968929 PMCID: PMC3514663 DOI: 10.1101/gr.145235.112] [Citation(s) in RCA: 197] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
L1 retrotransposons comprise 17% of the human genome and are its only autonomous mobile elements. Although L1-induced insertional mutagenesis causes Mendelian disease, their mutagenic load in cancer has been elusive. Using L1-targeted resequencing of 16 colorectal tumor and matched normal DNAs, we found that certain cancers were excessively mutagenized by human-specific L1s, while no verifiable insertions were present in normal tissues. We confirmed de novo L1 insertions in malignancy by both validating and sequencing 69/107 tumor-specific insertions and retrieving both 5′ and 3′ junctions for 35. In contrast to germline polymorphic L1s, all insertions were severely 5′ truncated. Validated insertion numbers varied from up to 17 in some tumors to none in three others, and correlated with the age of the patients. Numerous genes with a role in tumorigenesis were targeted, including ODZ3, ROBO2, PTPRM, PCM1, and CDH11. Thus, somatic retrotransposition may play an etiologic role in colorectal cancer.
Collapse
Affiliation(s)
- Szilvia Solyom
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
85
|
Extracellular vesicles and their convergence with viral pathways. Adv Virol 2012; 2012:767694. [PMID: 22888349 PMCID: PMC3410301 DOI: 10.1155/2012/767694] [Citation(s) in RCA: 89] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Accepted: 06/06/2012] [Indexed: 01/01/2023] Open
Abstract
Extracellular vesicles (microvesicles), such as exosomes and shed microvesicles, contain a variety of molecules including proteins, lipids, and nucleic acids. Microvesicles appear mostly to originate from multivesicular bodies or to bud from the plasma membrane. Here, we review the convergence of microvesicle biogenesis and aspects of viral assembly and release pathways. Herpesviruses and retroviruses, amongst others, recruit several elements from the microvesicle biogenesis pathways for functional virus release. In addition, noninfectious pleiotropic virus-like vesicles can be released, containing viral and cellular components. We highlight the heterogeneity of microvesicle function during viral infection, addressing microvesicles that can either block or enhance infection, or cause immune dysregulation through bystander action in the immune system. Finally, endogenous retrovirus and retrotransposon elements deposited in our genomes millions of years ago can be released from cells within microvesicles, suggestive of a viral origin of the microvesicle system or perhaps of an evolutionary conserved system of virus-vesicle codependence. More research is needed to further elucidate the complex function of the various microvesicles produced during viral infection, possibly revealing new therapeutic intervention strategies.
Collapse
|
86
|
Retargeting sleeping beauty transposon insertions by engineered zinc finger DNA-binding domains. Mol Ther 2012; 20:1852-62. [PMID: 22776959 DOI: 10.1038/mt.2012.126] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The Sleeping Beauty (SB) transposon is a nonviral, integrating vector system with proven efficacy in preclinical animal models, and thus holds promise for future clinical applications. However, SB has a close-to-random insertion profile that could lead to genotoxic effects, thereby presenting a potential safety issue. We evaluated zinc finger (ZF) DNA-binding domains (DBDs) for their abilities to introduce a bias into SB's insertion profile. E2C, that binds a unique site in the erbB-2 gene, mediated locus-specific transposon insertions at low frequencies. A novel ZF targeting LINE1 repeats, ZF-B, showed specific binding to an 18-bp site represented by ~12,000 copies in the human genome. We mapped SB insertions using linear-amplification (LAM)-PCR and Illumina sequencing. Targeted insertions with ZF-B peaked at approximately fourfold enrichment of transposition around ZF-B binding sites yielding ~45% overall frequency of insertion into LINE1. A decrease in the ZF-B dataset with respect to transposon insertions in genes was found, suggesting that LINE1 repeats act as a sponge that "soak up" a fraction of SB insertions and thereby redirect them away from genes. Improvements in ZF technology and a careful choice of targeted genomic regions may improve the safety profile of SB for future clinical applications.
Collapse
|
87
|
Ishizaka Y, Okudaira N, Tamura M, Iijima K, Shimura M, Goto M, Okamura T. Modes of retrotransposition of long interspersed element-1 by environmental factors. Front Microbiol 2012; 3:191. [PMID: 22666219 PMCID: PMC3364524 DOI: 10.3389/fmicb.2012.00191] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2012] [Accepted: 05/10/2012] [Indexed: 11/13/2022] Open
Abstract
Approximately 42% of the human genome is composed of endogenous retroelements, and the major retroelement component, long interspersed element-1 (L1), comprises ∼17% of the total genome. A single human cell has more than 5 × 10(5) copies of L1, 80∼100 copies of which are competent for retrotransposition (RTP). Notably, L1 can induce RTP of other retroelements, such as Alu and SVA, and is believed to function as a driving force of evolution. Although L1-RTP during early embryogenesis has been highlighted in the literature, recent observations revealed that L1-RTP also occurs in somatic cells. However, little is known about how environmental factors induce L1-RTP. Here, we summarize our current understanding of the mechanism of L1-RTP in somatic cells. We have focused on the mode of L1-RTP that is dependent on the basic helix-loop-helix/per-arnt-sim (bHLH/PAS) family of transcription factors. Along with the proposed function of bHLH/PAS proteins in environmental adaptation, we discuss the functional linking of L1-RTP and bHLH/PAS proteins for environmental adaptation and evolution.
Collapse
Affiliation(s)
- Yukihito Ishizaka
- Department of Intractable Diseases, National Center for Global Health and Medicine Tokyo, Japan
| | | | | | | | | | | | | |
Collapse
|
88
|
Nilsson MA, Janke A, Murchison EP, Ning Z, Hallström BM. Expansion of CORE-SINEs in the genome of the Tasmanian devil. BMC Genomics 2012; 13:172. [PMID: 22559330 PMCID: PMC3403934 DOI: 10.1186/1471-2164-13-172] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Accepted: 05/06/2012] [Indexed: 11/22/2022] Open
Abstract
Background The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species’ survival. To better understand the Tasmanian devil genome, we screened it for transposable elements and investigated the dynamics of short interspersed element (SINE) retroposons. Results The temporal history of Tasmanian devil SINEs, elucidated using a transposition in transposition analysis, indicates that WSINE1, a CORE-SINE present in around 200,000 copies, is the most recently active element. Moreover, we discovered a new subtype of WSINE1 (WSINE1b) that comprises at least 90% of all Tasmanian devil WSINE1s. The frequencies of WSINE1 subtypes differ in the genomes of two of the other Australian marsupial orders. A co-segregation analysis indicated that at least 66 subfamilies of WSINE1 evolved during the evolution of Dasyuromorphia. Using a substitution rate derived from WSINE1 insertions, the ages of the subfamilies were estimated and correlated with a newly established phylogeny of Dasyuromorphia. Phylogenetic analyses and divergence time estimates of mitochondrial genome data indicate a rapid radiation of the Tasmanian devil and the closest relative the quolls (Dasyurus) around 14 million years ago. Conclusions The radiation and abundance of CORE-SINEs in marsupial genomes indicates that they may be a major player in the evolution of marsupials. It is evident that the early phases of evolution of the carnivorous marsupial order Dasyuromorphia was characterized by a burst of SINE activity. A correlation between a speciation event and a major burst of retroposon activity is for the first time shown in a marsupial genome.
Collapse
Affiliation(s)
- Maria A Nilsson
- LOEWE-Biodiversity and Climate Research Center, BiK-F, Senckenberganlage 25, Frankfurt am Main D-60325, Germany.
| | | | | | | | | |
Collapse
|
89
|
Muñoz-Lopez M, Garcia-Cañadas M, Macia A, Morell S, Garcia-Perez JL. Analysis of LINE-1 expression in human pluripotent cells. Methods Mol Biol 2012; 873:113-25. [PMID: 22528351 DOI: 10.1007/978-1-61779-794-1_7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Half of the human genome is composed of repeated DNA, and some types are mobile within our genome (transposons and retrotransposons). Despite their abundance, only a small fraction of them are currently active in our genome (Long Interspersed Element-1 (LINE-1), Alu, and SVA elements). LINE-1 or L1 elements are a family of active non-LTR retrotransposons, the ongoing mobilization of which still impacts our genome. As selfish DNA elements, L1 activity is more prominent in early human development, where new insertions would be transmitted to the progeny. Here, we describe the conventional methods aimed to determine the expression level of LINE-1 elements in pluripotent human cells.
Collapse
Affiliation(s)
- Martin Muñoz-Lopez
- Andalusian Stem Cell Bank, Consejeria de Salud Junta de Andalucia, Seville, Spain
| | | | | | | | | |
Collapse
|
90
|
Abstract
Transposons are DNA sequences capable of moving in genomes. Early evidence showed their accumulation in many species and suggested their continued activity in at least isolated organisms. In the past decade, with the development of various genomic technologies, it has become abundantly clear that ongoing activity is the rule rather than the exception. Active transposons of various classes are observed throughout plants and animals, including humans. They continue to create new insertions, have an enormous variety of structural and functional impact on genes and genomes, and play important roles in genome evolution. Transposon activities have been identified and measured by employing various strategies. Here, we summarize evidence of current transposon activity in various plant and animal genomes.
Collapse
Affiliation(s)
- Cheng Ran Lisa Huang
- Institute of Genetic Medicine and High Throughput Biology Center, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Kathleen H. Burns
- Department of Pathology, Department of Oncology, and High Throughput Biology Center, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Jef D. Boeke
- Molecular Biology and Genetics, Institute of Genetic Medicine, and High Throughput Biology Center, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| |
Collapse
|
91
|
Rosser JM, An W. L1 expression and regulation in humans and rodents. Front Biosci (Elite Ed) 2012; 4:2203-25. [PMID: 22202032 DOI: 10.2741/537] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Long interspersed elements type 1 (LINE-1s, or L1s) have impacted mammalian genomes at multiple levels. L1 transcription is mainly controlled by its 5' untranslated region (5'UTR), which differs significantly among active human and rodent L1 families. In this review, L1 expression and its regulation are examined in the context of human and rodent development. First, endogenous L1 expression patterns in three different species-human, rat, and mouse-are compared and contrasted. A detailed account of relevant experimental evidence is presented according to the source material, such as cell lines, tumors, and normal somatic and germline tissues from different developmental stages. Second, factors involved in the regulation of L1 expression at both transcriptional and posttranscriptional levels are discussed. These include transcription factors, DNA methylation, PIWI-interacting RNAs (piRNAs), RNA interference (RNAi), and posttranscriptional host factors. Similarities and differences between human and rodent L1s are highlighted. Third, recent findings from transgenic mouse models of L1 are summarized and contrasted with those from endogenous L1 studies. Finally, the challenges and opportunities for L1 mouse models are discussed.
Collapse
Affiliation(s)
- James M Rosser
- School of Molecular Biosciences and Center for Reproductive Biology, Washington State University, Pullman, WA 99164, USA
| | | |
Collapse
|
92
|
Solyom S, Ewing AD, Hancks DC, Takeshima Y, Awano H, Matsuo M, Kazazian HH. Pathogenic orphan transduction created by a nonreference LINE-1 retrotransposon. Hum Mutat 2011; 33:369-71. [PMID: 22095564 DOI: 10.1002/humu.21663] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Accepted: 11/10/2011] [Indexed: 11/08/2022]
Abstract
Long INterspersed Element-1 (LINE-1) retrotransposons comprise 17% of the human genome, and move by a potentially mutagenic "copy and paste" mechanism via an RNA intermediate. Recently, the retrotransposition-mediated insertion of a new transcript was described as a novel cause of genetic disease, Duchenne muscular dystrophy, in a Japanese male. The inserted sequence was presumed to derive from a single-copy, noncoding RNA transcribed from chromosome 11q22.3 that retrotransposed into the dystrophin gene. Here, we demonstrate that a nonreference full-length LINE-1 is situated in the proband and maternal genome at chromosome 11q22.3, directly upstream of the sequence, whose copy was inserted into the dystrophin gene. This LINE-1 is highly active in a cell culture assay. LINE-1 insertions are often associated with 3' transduction of adjacent genomic sequences. Thus, the likely explanation for the mutagenic insertion is a LINE-1-mediated 3' transduction with severe 5' truncation. This is the first example of LINE-1-induced human disease caused by an "orphan" 3' transduction.
Collapse
Affiliation(s)
- Szilvia Solyom
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | | | | | | | | | | | | |
Collapse
|
93
|
Piñeyro D, López-Panadès E, Lucena-Pérez M, Casacuberta E. Transcriptional analysis of the HeT-A retrotransposon in mutant and wild type stocks reveals high sequence variability at Drosophila telomeres and other unusual features. BMC Genomics 2011; 12:573. [PMID: 22111838 PMCID: PMC3235214 DOI: 10.1186/1471-2164-12-573] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Accepted: 11/23/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Telomere replication in Drosophila depends on the transposition of a domesticated retroelement, the HeT-A retrotransposon. The sequence of the HeT-A retrotransposon changes rapidly resulting in differentiated subfamilies. This pattern of sequence change contrasts with the essential function with which the HeT-A is entrusted and brings about questions concerning the extent of sequence variability, the telomere contribution of different subfamilies, and whether wild type and mutant Drosophila stocks show different HeT-A scenarios. RESULTS A detailed study on the variability of HeT-A reveals that both the level of variability and the number of subfamilies are higher than previously reported. Comparisons between GIII, a strain with longer telomeres, and its parental strain Oregon-R indicate that both strains have the same set of HeT-A subfamilies. Finally, the presence of a highly conserved splicing pattern only in its antisense transcripts indicates a putative regulatory, functional or structural role for the HeT-A RNA. Interestingly, our results also suggest that most HeT-A copies are actively expressed regardless of which telomere and where in the telomere they are located. CONCLUSIONS Our study demonstrates how the HeT-A sequence changes much faster than previously reported resulting in at least nine different subfamilies most of which could actively contribute to telomere extension in Drosophila. Interestingly, the only significant difference observed between Oregon-R and GIII resides in the nature and proportion of the antisense transcripts, suggesting a possible mechanism that would in part explain the longer telomeres of the GIII stock.
Collapse
Affiliation(s)
- David Piñeyro
- Institute of Evolutionary Biology (CSIC-UPF), Passeig Marítim de la Barceloneta 37-49, 08003, Barcelona, Spain
| | - Elisenda López-Panadès
- Institute of Evolutionary Biology (CSIC-UPF), Passeig Marítim de la Barceloneta 37-49, 08003, Barcelona, Spain
| | - María Lucena-Pérez
- Institute of Evolutionary Biology (CSIC-UPF), Passeig Marítim de la Barceloneta 37-49, 08003, Barcelona, Spain
| | - Elena Casacuberta
- Institute of Evolutionary Biology (CSIC-UPF), Passeig Marítim de la Barceloneta 37-49, 08003, Barcelona, Spain
| |
Collapse
|
94
|
The NF1 gene contains hotspots for L1 endonuclease-dependent de novo insertion. PLoS Genet 2011; 7:e1002371. [PMID: 22125493 PMCID: PMC3219598 DOI: 10.1371/journal.pgen.1002371] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2011] [Accepted: 09/19/2011] [Indexed: 12/19/2022] Open
Abstract
Long interspersed (L1) and Alu elements are actively amplified in the human genome through retrotransposition of their RNA intermediates by the -100 still retrotranspositionally fully competent L1 elements. Retrotransposition can cause inherited disease if such an element is inserted near or within a functional gene. Using direct cDNA sequencing as the primary assay for comprehensive NF1 mutation analysis, we uncovered in 18 unrelated index patients splicing alterations not readily explained at the genomic level by an underlying point-mutation or deletion. Improved PCR protocols avoiding allelic drop-out of the mutant alleles uncovered insertions of fourteen Alu elements, three L1 elements, and one poly(T) stretch to cause these splicing defects. Taken together, the 18 pathogenic L1 endonuclease-mediated de novo insertions represent the largest number of this type of mutations characterized in a single human gene. Our findings show that retrotransposon insertions account for as many as -0.4% of all NF1 mutations. Since altered splicing was the main effect of the inserted elements, the current finding was facilitated by the use of RNA-based mutation analysis protocols, resulting in improved detection compared to gDNA-based approaches. Six different insertions clustered in a relatively small 1.5-kb region (NF1 exons 21(16)-23(18)) within the 280-kb NF1 gene. Furthermore, three different specific integration sites, one of them located in this cluster region, were each used twice, i.e. NM_000267.3(NF1):c.1642-1_1642 in intron 14(10c), NM_000267.3(NF1):c.2835_2836 in exon 21(16), and NM_000267.3(NF1):c.4319_4320 in exon 33(25). Identification of three loci that each served twice as integration site for independent retrotransposition events as well as 1.5-kb cluster region harboring six independent insertions supports the notion of non-random insertion of retrotransposons in the human genome. Currently, little is known about which features make sites particularly vulnerable to L1 EN-mediated insertions. The here identified integration sites may serve to elucidate these features in future studies.
Collapse
|
95
|
Beck CR, Garcia-Perez JL, Badge RM, Moran JV. LINE-1 elements in structural variation and disease. Annu Rev Genomics Hum Genet 2011; 12:187-215. [PMID: 21801021 DOI: 10.1146/annurev-genom-082509-141802] [Citation(s) in RCA: 394] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The completion of the human genome reference sequence ushered in a new era for the study and discovery of human transposable elements. It now is undeniable that transposable elements, historically dismissed as junk DNA, have had an instrumental role in sculpting the structure and function of our genomes. In particular, long interspersed element-1 (LINE-1 or L1) and short interspersed elements (SINEs) continue to affect our genome, and their movement can lead to sporadic cases of disease. Here, we briefly review the types of transposable elements present in the human genome and their mechanisms of mobility. We next highlight how advances in DNA sequencing and genomic technologies have enabled the discovery of novel retrotransposons in individual genomes. Finally, we discuss how L1-mediated retrotransposition events impact human genomes.
Collapse
Affiliation(s)
- Christine R Beck
- Department of Human Genetics, University of MIchigan Medical School, Ann Arbor, Michigan 48109-5618, USA.
| | | | | | | |
Collapse
|
96
|
Callahan KE, Hickman AB, Jones CE, Ghirlando R, Furano AV. Polymerization and nucleic acid-binding properties of human L1 ORF1 protein. Nucleic Acids Res 2011; 40:813-27. [PMID: 21937507 PMCID: PMC3258132 DOI: 10.1093/nar/gkr728] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The L1 (LINE 1) retrotransposable element encodes two proteins, ORF1p and ORF2p. ORF2p is the L1 replicase, but the role of ORF1p is unknown. Mouse ORF1p, a coiled-coil-mediated trimer of ∼42-kDa monomers, binds nucleic acids and has nucleic acid chaperone activity. We purified human L1 ORF1p expressed in insect cells and made two findings that significantly advance our knowledge of the protein. First, in the absence of nucleic acids, the protein polymerizes under the very conditions (0.05 M NaCl) that are optimal for high (∼1 nM)-affinity nucleic acid binding. The non-coiled-coil C-terminal half mediates formation of the polymer, an active conformer that is instantly resolved to trimers, or multimers thereof, by nucleic acid. Second, the protein has a biphasic effect on mismatched double-stranded DNA, a proxy chaperone substrate. It protects the duplex from dissociation at 37°C before eventually melting it when largely polymeric. Therefore, polymerization of ORF1p seemingly affects its interaction with nucleic acids. Additionally, polymerization of ORF1p at its translation site could explain the heretofore-inexplicable phenomenon of cis preference-the favored retrotransposition of the actively translated L1 transcript, which is essential for L1 survival.
Collapse
Affiliation(s)
- Kathryn E Callahan
- The Laboratory of Molecular and Cellular Biology, National Institue of Diabetes, Digestive, and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | |
Collapse
|
97
|
Freeman P, Macfarlane C, Collier P, Jeffreys AJ, Badge RM. L1 hybridization enrichment: a method for directly accessing de novo L1 insertions in the human germline. Hum Mutat 2011; 32:978-88. [PMID: 21560187 PMCID: PMC3412225 DOI: 10.1002/humu.21533] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2011] [Accepted: 04/25/2011] [Indexed: 11/14/2022]
Abstract
Long interspersed nuclear element 1 (L1) retrotransposons are the only autonomously mobile human transposable elements. L1 retrotransposition has shaped our genome via insertional mutagenesis, sequence transduction, pseudogene formation, and ectopic recombination. However, L1 germline retrotransposition dynamics are poorly understood because de novo insertions occur very rarely: the frequency of disease-causing retrotransposon insertions suggests that one insertion event occurs in roughly 18–180 gametes. The method described here recovers full-length L1 insertions by using hybridization enrichment to capture L1 sequences from multiplex PCR-amplified DNA. Enrichment is achieved by hybridizing L1-specific biotinylated oligonucleotides to complementary molecules, followed by capture on streptavidin-coated paramagnetic beads. We show that multiplex, long-range PCR can amplify single molecules containing full-length L1 insertions for recovery by hybridization enrichment. We screened 600 µg of sperm DNA from one donor, but no bone fide de novo L1 insertions were found, suggesting a L1 retrotransposition frequency of <1 insertion in 400 haploid genomes. This lies below the lower bound of previous estimates, and indicates that L1 insertion, at least into the loci studied, is very rare in the male germline. It is a paradox that L1 replication is ongoing in the face of such apparently low activity. Hum Mutat 32:1–11, 2011. © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Peter Freeman
- Department of Genetics, University of Leicester, University Road, Leicester, United Kingdom
| | | | | | | | | |
Collapse
|
98
|
Muñoz-Lopez M, Macia A, Garcia-Cañadas M, Badge RM, Garcia-Perez JL. An epi [c] genetic battle: LINE-1 retrotransposons and intragenomic conflict in humans. Mob Genet Elements 2011; 1:122-127. [PMID: 22016860 DOI: 10.4161/mge.1.2.16730] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Accepted: 05/27/2011] [Indexed: 01/14/2023] Open
Abstract
The ongoing activity of the human retrotransposon Long Interspersed Element 1 (LINE-1 or L1) continues to impact the human genome in various ways. Throughout evolution, mammalian and primate genomes have been under selection to generate strategies to reduce the activity of selfish DNA like L1. Similarly, selfish DNA has evolved to elude these containment systems. This intragenomic conflict has left many inactive versions of LINEs and other Transposable Elements (TEs) littering the human genome, which together account for roughly half of our DNA. Here, we survey the distinct mechanisms operating in the human genome that seem to reduce the mobility of L1s. In addition, we discuss recent findings that strongly suggest epigenetic mechanisms specifically regulate L1 activity in pluripotent human cells.
Collapse
Affiliation(s)
- Martin Muñoz-Lopez
- Department of Human DNA Variability; GENYO (Centre Pfizer-University of Granada-Junta de Andalucía of Genomics and Oncology); Granada, Spain
| | | | | | | | | |
Collapse
|
99
|
Ray DA, Batzer MA. Reading TE leaves: new approaches to the identification of transposable element insertions. Genome Res 2011; 21:813-20. [PMID: 21632748 PMCID: PMC3106314 DOI: 10.1101/gr.110528.110] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Transposable elements (TEs) are a tremendous source of genome instability and genetic variation. Of particular interest to investigators of human biology and human evolution are retrotransposon insertions that are recent and/or polymorphic in the human population. As a consequence, the ability to assay large numbers of polymorphic TEs in a given genome is valuable. Five recent manuscripts each propose methods to scan whole human genomes to identify, map, and, in some cases, genotype polymorphic retrotransposon insertions in multiple human genomes simultaneously. These technologies promise to revolutionize our ability to analyze human genomes for TE-based variation important to studies of human variability and human disease. Furthermore, the approaches hold promise for researchers interested in nonhuman genomic variability. Herein, we explore the methods reported in the manuscripts and discuss their applications to aspects of human biology and the biology of other organisms.
Collapse
Affiliation(s)
- David A. Ray
- Department of Biochemistry and Molecular Biology, Mississippi State University, Mississippi State, Mississippi 39762, USA
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| |
Collapse
|
100
|
Lukic S, Chen K. Human piRNAs are under selection in Africans and repress transposable elements. Mol Biol Evol 2011; 28:3061-7. [PMID: 21613236 PMCID: PMC3199439 DOI: 10.1093/molbev/msr141] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Piwi-interacting RNAs (piRNAs) are a recently discovered class of 24- to 30-nt noncoding RNAs whose best-understood function is to repress transposable elements (TEs) in animal germ lines. In humans, TE-derived sequences comprise ∼45% of the genome and there are several active TE families, including LINE-1 and Alu elements, which are a significant source of de novo mutations and intrapopulation variability. In the “ping-pong model,” piRNAs are thought to alternatively cleave sense and antisense TE transcripts in a positive feedback loop. Because piRNAs are poorly conserved between closely related species, including human and chimpanzee, we took a population genomics approach to study piRNA function and evolution. We found strong statistical evidence that piRNA sequences are under selective constraint in African populations. We then mapped the piRNA sequences to human TE sequences and found strong correlations between the age of each LINE-1 and Alu subfamily and the number of piRNAs mapping to the subfamily. This result supports the idea that piRNAs function as repressors of TEs in humans. Finally, we observed a significant depletion of piRNA matches in the reverse transcriptase region of the consensus human LINE-1 element but not of the consensus mouse LINE-1 element. This result suggests that reverse transcriptase might have an endogenous role specific to humans. Overall, our results elucidate the function and evolution of piRNAs in humans and highlight the utility of population genomics analysis for studying this rapidly evolving genetic system.
Collapse
Affiliation(s)
- Sergio Lukic
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | |
Collapse
|