1
|
Moore EC, Thomas GWC, Mortimer S, Kopania EEK, Hunnicutt KE, Clare-Salzler ZJ, Larson EL, Good JM. The evolution of widespread recombination suppression on the dwarf hamster (Phodopus) X chromosome. Genome Biol Evol 2022; 14:6596369. [PMID: 35642315 PMCID: PMC9185382 DOI: 10.1093/gbe/evac080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2022] [Indexed: 11/24/2022] Open
Abstract
The X chromosome of therian mammals shows strong conservation among distantly related species, limiting insights into the distinct selective processes that have shaped sex chromosome evolution. We constructed a chromosome-scale de novo genome assembly for the Siberian dwarf hamster (Phodopus sungorus), a species reported to show extensive recombination suppression across an entire arm of the X chromosome. Combining a physical genome assembly based on shotgun and long-range proximity ligation sequencing with a dense genetic map, we detected widespread suppression of female recombination across ∼65% of the Phodopus X chromosome. This region of suppressed recombination likely corresponds to the Xp arm, which has previously been shown to be highly heterochromatic. Using additional sequencing data from two closely related species (P. campbelli and P. roborovskii), we show that recombination suppression on Xp appears to be independent of major structural rearrangements. The suppressed Xp arm was enriched for several transposable element families and de-enriched for genes primarily expressed in placenta, but otherwise showed similar gene densities, expression patterns, and rates of molecular evolution when compared to the recombinant Xq arm. Phodopus Xp gene content and order was also broadly conserved relative to the more distantly related rat X chromosome. These data suggest that widespread suppression of recombination has likely evolved through the transient induction of facultative heterochromatin on the Phodopus Xp arm without major changes in chromosome structure or genetic content. Thus, substantial changes in the recombination landscape have so far had relatively subtle influences on patterns of X-linked molecular evolution in these species.
Collapse
Affiliation(s)
- Emily C Moore
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Gregg W C Thomas
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Sebastian Mortimer
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Emily E K Kopania
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| | - Kelsie E Hunnicutt
- Department of Biological Sciences, The University of Denver, Denver, Colorado, 80208, USA
| | | | - Erica L Larson
- Department of Biological Sciences, The University of Denver, Denver, Colorado, 80208, USA
| | - Jeffrey M Good
- Division of Biological Sciences, The University of Montana, Missoula, Montana, 59812, USA
| |
Collapse
|
2
|
Almeida MV, Vernaz G, Putman AL, Miska EA. Taming transposable elements in vertebrates: from epigenetic silencing to domestication. Trends Genet 2022; 38:529-553. [DOI: 10.1016/j.tig.2022.02.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 02/14/2022] [Accepted: 02/15/2022] [Indexed: 12/20/2022]
|
3
|
Abstract
Transposable elements (TEs) are mobile DNA sequences that propagate within genomes. Through diverse invasion strategies, TEs have come to occupy a substantial fraction of nearly all eukaryotic genomes, and they represent a major source of genetic variation and novelty. Here we review the defining features of each major group of eukaryotic TEs and explore their evolutionary origins and relationships. We discuss how the unique biology of different TEs influences their propagation and distribution within and across genomes. Environmental and genetic factors acting at the level of the host species further modulate the activity, diversification, and fate of TEs, producing the dramatic variation in TE content observed across eukaryotes. We argue that cataloging TE diversity and dissecting the idiosyncratic behavior of individual elements are crucial to expanding our comprehension of their impact on the biology of genomes and the evolution of species.
Collapse
Affiliation(s)
- Jonathan N Wells
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| |
Collapse
|
4
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
5
|
Wu X, Luo C, Hu L, Chen X, Chen Y, Fan J, Cheng CY, Sun F. Unraveling epigenomic abnormality in azoospermic human males by WGBS, RNA-Seq, and transcriptome profiling analyses. J Assist Reprod Genet 2020; 37:789-802. [PMID: 32056059 DOI: 10.1007/s10815-020-01716-7] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 02/06/2020] [Indexed: 02/02/2023] Open
Abstract
PURPOSE To determine associations between genomic DNA methylation in testicular cells and azoospermia in human males. METHODS This was a case-control study investigating the differences and conservations in DNA methylation, genome-wide DNA methylation, and bulk RNA-Seq for transcriptome profiling using testicular biopsy tissues from NOA and OA patients. Differential methylation and different conserved methylation regions associated with azoospermia were identified by comparing genomic DNA methylation of testicular seminiferous cells derived from NOA and OA patients. RESULTS The genome methylation modification of testicular cells from NOA patients was disordered, and the reproductive-related gene expression was significantly different. CONCLUSION Our findings not only provide valuable knowledge of human spermatogenesis but also paved the way for the identification of genes/proteins involved in male germ cell development. The approach presented in this report provides a powerful tool to identify responsible biomolecules, and/or cellular changes (e.g., epigenetic abnormality) that induce male reproductive dysfunction such as OA and NOA.
Collapse
Affiliation(s)
- Xiaolong Wu
- Medical School, Institute of Reproductive Medicine, Nantong University, Nantong, 226001, Jiangsu, China
| | - Chunhai Luo
- Medical School, Institute of Reproductive Medicine, Nantong University, Nantong, 226001, Jiangsu, China
| | - Longfei Hu
- Singleron Biotechnologies Ltd., 211 Pubin Road, Nanjing, Jiangsu, People's Republic of China
| | - Xue Chen
- Medical School, Institute of Reproductive Medicine, Nantong University, Nantong, 226001, Jiangsu, China
| | - Yunmei Chen
- Singleron Biotechnologies Ltd., 211 Pubin Road, Nanjing, Jiangsu, People's Republic of China
| | - Jue Fan
- Singleron Biotechnologies Ltd., 211 Pubin Road, Nanjing, Jiangsu, People's Republic of China
| | - C Yan Cheng
- The Mary M. Wohlford Laboratory for Male Contraceptive Research, Center for Biomedical Research, Population Council, 1230 York Ave, New York, 10065, USA.
| | - Fei Sun
- Medical School, Institute of Reproductive Medicine, Nantong University, Nantong, 226001, Jiangsu, China.
| |
Collapse
|
6
|
[Virus-host coevolution: Endogenous RNA viral elements as pseudogenes]. Uirusu 2020; 70:49-56. [PMID: 33967113 DOI: 10.2222/jsv.70.49] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
RNA viruses do not need to take the form of DNAs, and RNAs alone complete their replication cycles. On the other hand, since the 1970s, it has been known that DNA fragments derived from RNA viruses can be detected in RNA virus-infected cells. Furthermore, in this decade, it has become clear that the eukaryotic genomes contain genetic sequences derived from non-retroviral RNA viruses. The DNA sequences derived from these RNA viruses are thought to be generatedby using a transposable mechanism of retrotransposon, such as LINE-1. Many endogenous RNA viral sequences are formed by the same mechanism as processed pseudogenes in eukaryotic cells, but the significance of the production of RNA viral "pseudogenes " in infected cells has not been elucidated. We have discovered endogenous bornavirus-like elements (EBLs), which derived from a negative-sense, single-stranded RNA virus, Bornaviruses, and have studied the evolution and function of EBLs in host animals. The analysis of EBLs provides us a clue to unravel the history of host-RNA virus coexistence. In this review, I overview about the function of endogenous RNA virus sequences, especially EBLs in mammalian genomes, and discuss the significance of endogenization of RNA viruses as viral pseudogenes in evolution.
Collapse
|
7
|
Jedlicka P, Lexa M, Vanat I, Hobza R, Kejnovsky E. Nested plant LTR retrotransposons target specific regions of other elements, while all LTR retrotransposons often target palindromes and nucleosome-occupied regions: in silico study. Mob DNA 2019; 10:50. [PMID: 31871489 PMCID: PMC6911290 DOI: 10.1186/s13100-019-0186-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 10/31/2019] [Indexed: 01/08/2023] Open
Abstract
Background Nesting is common in LTR retrotransposons, especially in large genomes containing a high number of elements. Results We analyzed 12 plant genomes and obtained 1491 pairs of nested and original (pre-existing) LTR retrotransposons. We systematically analyzed mutual nesting of individual LTR retrotransposons and found that certain families, more often belonging to the Ty3/gypsy than Ty1/copia superfamilies, showed a higher nesting frequency as well as a higher preference for older copies of the same family ("autoinsertions"). Nested LTR retrotransposons were preferentially located in the 3'UTR of other LTR retrotransposons, while coding and regulatory regions (LTRs) are not commonly targeted. Insertions displayed a weak preference for palindromes and were associated with a strong positional pattern of higher predicted nucleosome occupancy. Deviation from randomness in target site choice was also found in 13,983 non-nested plant LTR retrotransposons. Conclusions We reveal that nesting of LTR retrotransposons is not random. Integration is correlated with sequence composition, secondary structure and the chromatin environment. Insertion into retrotransposon positions with a low negative impact on family fitness supports the concept of the genome being viewed as an ecosystem of various elements.
Collapse
Affiliation(s)
- Pavel Jedlicka
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200 Brno, Czech Republic
| | - Matej Lexa
- 2Faculty of Informatics, Masaryk University, Botanicka 68a, 60200 Brno, Czech Republic
| | - Ivan Vanat
- 2Faculty of Informatics, Masaryk University, Botanicka 68a, 60200 Brno, Czech Republic
| | - Roman Hobza
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200 Brno, Czech Republic
| | - Eduard Kejnovsky
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200 Brno, Czech Republic
| |
Collapse
|
8
|
Damert A. Phylogenomic analysis reveals splicing as a mechanism of parallel evolution of non-canonical SVAs in hominine primates. Mob DNA 2018; 9:30. [PMID: 30237828 PMCID: PMC6139936 DOI: 10.1186/s13100-018-0135-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Accepted: 09/03/2018] [Indexed: 02/07/2023] Open
Abstract
SVA (SINE-R-VNTR-Alu) elements are non-autonomous non-LTR (Long Terminal Repeat) retrotransposons. They are found in all hominoid primates but did not amplify to appreciable numbers in gibbons. Recently, phylogenetic networks of hominid (orangutan, gorilla, chimpanzee, human) SVA elements based on comparison of overall sequence identity have been reported. Here I present a detailed phylogeny of SVA_D elements in gorilla, chimpanzee and humans based on sorting of co-segregating substitutions. Complementary comparative genomics analysis revealed that the majority (1763 out of 1826-97%) of SVA_D elements in gorilla represent species-specific insertions - indicating very low activity of the subfamily before the gorilla/chimpanzee-human split. The origin of the human-specific subfamily SVA_F could be traced back to a source element in the hominine common ancestor. The major expanding lineage-specific subfamilies were found to differ between chimpanzee and humans. Precursors of the dominant chimpanzee SVA_D subfamily are present in humans; however, they did not expand to appreciable levels. The analysis also uncovered that one of the chimpanzee-specific subfamilies was formed by splicing of the STK40 first exon to the SVA Alu-like region. Many of the 94 subfamily members contain additional 5' transductions - among them exons of 8 different other genes. Striking similarities to the MAST2-containing human SVA_F1 suggest parallel evolution of non-canonical SVAs in chimpanzees and humans.
Collapse
Affiliation(s)
- Annette Damert
- Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Göttingen, Germany
| |
Collapse
|
9
|
Banuelos M, Sindi S. Modeling transposable element dynamics with fragmentation equations. Math Biosci 2018; 302:46-66. [DOI: 10.1016/j.mbs.2018.05.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Revised: 04/02/2018] [Accepted: 05/11/2018] [Indexed: 12/16/2022]
|
10
|
Faulkner GJ, Billon V. L1 retrotransposition in the soma: a field jumping ahead. Mob DNA 2018; 9:22. [PMID: 30002735 PMCID: PMC6035798 DOI: 10.1186/s13100-018-0128-1] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Accepted: 06/27/2018] [Indexed: 12/13/2022] Open
Abstract
Retrotransposons are transposable elements (TEs) capable of "jumping" in germ, embryonic and tumor cells and, as is now clearly established, in the neuronal lineage. Mosaic TE insertions form part of a broader landscape of somatic genome variation and hold significant potential to generate phenotypic diversity, in the brain and elsewhere. At present, the LINE-1 (L1) retrotransposon family appears to be the most active autonomous TE in most mammals, based on experimental data obtained from disease-causing L1 mutations, engineered L1 reporter systems tested in cultured cells and transgenic rodents, and single-cell genomic analyses. However, the biological consequences of almost all somatic L1 insertions identified thus far remain unknown. In this review, we briefly summarize the current state-of-the-art in the field, including estimates of L1 retrotransposition rate in neurons. We bring forward the hypothesis that an extensive subset of retrotransposition-competent L1s may be de-repressed and mobile in the soma but largely inactive in the germline. We discuss recent reports of non-canonical L1-associated sequence variants in the brain and propose that the elevated L1 DNA content reported in several neurological disorders may predominantly comprise accumulated, unintegrated L1 nucleic acids, rather than somatic L1 insertions. Finally, we consider the main objectives and obstacles going forward in elucidating the biological impact of somatic retrotransposition.
Collapse
Affiliation(s)
- Geoffrey J. Faulkner
- Mater Research Institute – University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
- School of Biomedical Sciences, University of Queensland, Brisbane, QLD 4072 Australia
- Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072 Australia
| | - Victor Billon
- Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072 Australia
- Biology Department, École Normale Supérieure Paris-Saclay, 61 Avenue du Président Wilson, 94230 Cachan, France
| |
Collapse
|
11
|
Lavi E, Carmel L. Alu exaptation enriches the human transcriptome by introducing new gene ends. RNA Biol 2018; 15:715-725. [PMID: 29493382 DOI: 10.1080/15476286.2018.1429880] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
In mammals, transposable elements are largely silenced, but under fortuitous circumstances may be co-opted to play a functional role. Here, we show that when Alu elements are inserted within or nearby genes in sense orientation, they may contribute to the transcriptome diversity by forming new cleavage and polyadenylation sites. We mapped these new gene ends in human onto the Alu sequence and identified three hotspots of cleavage and polyadenylation site formation. Interestingly, the native Alu sequence does not contain any canonical polyadenylation signal. We therefore studied what evolutionary processes might explain the formation of these specific hotspots of novel gene ends. We show that two of the three hotspots might have emerged from mutational processes that turned sequences that resemble polyadenylation signals into full-blown canonical signals, whereas one hotspot is tightly linked to the process of Alu insertion into the genome. Overall, Alu elements may lie behind the formation of 302 new gene end variants, affecting a total of 243 genes. Intergenic Alu elements may elongate genes by creating a downstream cleavage site, intronic Alu elements may lead to gene variants which code for truncated proteins, and 3'UTR Alu elements may result in gene variants with alternative 3'UTR.
Collapse
Affiliation(s)
- Eitan Lavi
- a Department of Genetics , The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem , Jerusalem , Israel
| | - Liran Carmel
- a Department of Genetics , The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem , Jerusalem , Israel
| |
Collapse
|
12
|
Marnetto D, Mantica F, Molineris I, Grassi E, Pesando I, Provero P. Evolutionary Rewiring of Human Regulatory Networks by Waves of Genome Expansion. Am J Hum Genet 2018; 102:207-218. [PMID: 29357977 DOI: 10.1016/j.ajhg.2017.12.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Accepted: 12/15/2017] [Indexed: 01/09/2023] Open
Abstract
Genome expansion is believed to be an important driver of the evolution of gene regulation. To investigate the role of a newly arising sequence in rewiring regulatory networks, we estimated the age of each region of the human genome by applying maximum parsimony to genome-wide alignments with 100 vertebrates. We then studied the age distribution of several types of functional regions, with a focus on regulatory elements. The age distribution of regulatory elements reveals the extensive use of newly formed genomic sequence in the evolution of regulatory interactions. Many transcription factors have expanded their repertoire of targets through waves of genomic expansions that can be traced to specific evolutionary times. Repeated elements contributed a major part of such expansion: many classes of such elements are enriched in binding sites of one or a few specific transcription factors, whose binding sites are localized in specific portions of the element and characterized by distinctive motif words. These features suggest that the binding sites were available as soon as the new sequence entered the genome, rather than being created later by accumulation of point mutations. By comparing the age of regulatory regions to the evolutionary shift in expression of nearby genes, we show that rewiring through genome expansion played an important role in shaping human regulatory networks.
Collapse
|
13
|
Ou S, Jiang N. LTR_retriever: A Highly Accurate and Sensitive Program for Identification of Long Terminal Repeat Retrotransposons. PLANT PHYSIOLOGY 2018; 176:1410-1422. [PMID: 29233850 PMCID: PMC5813529 DOI: 10.1104/pp.17.01310] [Citation(s) in RCA: 598] [Impact Index Per Article: 99.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Accepted: 12/10/2017] [Indexed: 05/18/2023]
Abstract
Long terminal repeat retrotransposons (LTR-RTs) are prevalent in plant genomes. The identification of LTR-RTs is critical for achieving high-quality gene annotation. Based on the well-conserved structure, multiple programs were developed for the de novo identification of LTR-RTs; however, these programs are associated with low specificity and high false discovery rates. Here, we report LTR_retriever, a multithreading-empowered Perl program that identifies LTR-RTs and generates high-quality LTR libraries from genomic sequences. LTR_retriever demonstrated significant improvements by achieving high levels of sensitivity (91%), specificity (97%), accuracy (96%), and precision (90%) in rice (Oryza sativa). LTR_retriever is also compatible with long sequencing reads. With 40k self-corrected PacBio reads equivalent to 4.5× genome coverage in Arabidopsis (Arabidopsis thaliana), the constructed LTR library showed excellent sensitivity and specificity. In addition to canonical LTR-RTs with 5'-TG…CA-3' termini, LTR_retriever also identifies noncanonical LTR-RTs (non-TGCA), which have been largely ignored in genome-wide studies. We identified seven types of noncanonical LTRs from 42 out of 50 plant genomes. The majority of noncanonical LTRs are Copia elements, with which the LTR is four times shorter than that of other Copia elements, which may be a result of their target specificity. Strikingly, non-TGCA Copia elements are often located in genic regions and preferentially insert nearby or within genes, indicating their impact on the evolution of genes and their potential as mutagenesis tools.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824
| |
Collapse
|
14
|
Jacob-Hirsch J, Eyal E, Knisbacher BA, Roth J, Cesarkas K, Dor C, Farage-Barhom S, Kunik V, Simon AJ, Gal M, Yalon M, Moshitch-Moshkovitz S, Tearle R, Constantini S, Levanon EY, Amariglio N, Rechavi G. Whole-genome sequencing reveals principles of brain retrotransposition in neurodevelopmental disorders. Cell Res 2018; 28:187-203. [PMID: 29327725 DOI: 10.1038/cr.2018.8] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Revised: 11/10/2017] [Accepted: 11/20/2017] [Indexed: 02/07/2023] Open
Abstract
Neural progenitor cells undergo somatic retrotransposition events, mainly involving L1 elements, which can be potentially deleterious. Here, we analyze the whole genomes of 20 brain samples and 80 non-brain samples, and characterized the retrotransposition landscape of patients affected by a variety of neurodevelopmental disorders including Rett syndrome, tuberous sclerosis, ataxia-telangiectasia and autism. We report that the number of retrotranspositions in brain tissues is higher than that observed in non-brain samples and even higher in pathologic vs normal brains. The majority of somatic brain retrotransposons integrate into pre-existing repetitive elements, preferentially A/T rich L1 sequences, resulting in nested insertions. Our findings document the fingerprints of encoded endonuclease independent mechanisms in the majority of L1 brain insertion events. The insertions are "non-classical" in that they are truncated at both ends, integrate in the same orientation as the host element, and their target sequences are enriched with a CCATT motif in contrast to the classical endonuclease motif of most other retrotranspositions. We show that L1Hs elements integrate preferentially into genes associated with neural functions and diseases. We propose that pre-existing retrotransposons act as "lightning rods" for novel insertions, which may give fine modulation of gene expression while safeguarding from deleterious events. Overwhelmingly uncontrolled retrotransposition may breach this safeguard mechanism and increase the risk of harmful mutagenesis in neurodevelopmental disorders.
Collapse
Affiliation(s)
- Jasmine Jacob-Hirsch
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel.,Mina and Everard Goodman Faculty of Life Sciences, Bar Ilan University, Israel
| | - Eran Eyal
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel
| | | | - Jonathan Roth
- Department of Pediatric Neurosurgery, Dana Children's Hospital, Tel Aviv Medical Center, Israel.,Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Karen Cesarkas
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Chen Dor
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Sarit Farage-Barhom
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Vered Kunik
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Amos J Simon
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Moran Gal
- Mina and Everard Goodman Faculty of Life Sciences, Bar Ilan University, Israel
| | - Michal Yalon
- Department of Pediatric Hematology-Oncology, Edmond and Lily Safra Children's Hospital, The Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Sharon Moshitch-Moshkovitz
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel
| | - Rick Tearle
- Complete Genomics, 2071 Stierlin Court, Mountain View, CA 94043, USA
| | - Shlomi Constantini
- Department of Pediatric Neurosurgery, Dana Children's Hospital, Tel Aviv Medical Center, Israel.,Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Erez Y Levanon
- Mina and Everard Goodman Faculty of Life Sciences, Bar Ilan University, Israel
| | - Ninette Amariglio
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel.,Mina and Everard Goodman Faculty of Life Sciences, Bar Ilan University, Israel
| | - Gideon Rechavi
- Cancer Research Center and the Wohl Institute of Translational Medicine, the Chaim Sheba Medical Center, Tel Hashomer, Israel.,Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
15
|
Farré M, Robinson TJ, Ruiz-Herrera A. An Integrative Breakage Model of genome architecture, reshuffling and evolution: The Integrative Breakage Model of genome evolution, a novel multidisciplinary hypothesis for the study of genome plasticity. Bioessays 2015; 37:479-88. [PMID: 25739389 DOI: 10.1002/bies.201400174] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2014] [Revised: 02/12/2015] [Accepted: 02/13/2015] [Indexed: 12/23/2022]
Abstract
Our understanding of genomic reorganization, the mechanics of genomic transmission to offspring during germ line formation, and how these structural changes contribute to the speciation process, and genetic disease is far from complete. Earlier attempts to understand the mechanism(s) and constraints that govern genome remodeling suffered from being too narrowly focused, and failed to provide a unified and encompassing view of how genomes are organized and regulated inside cells. Here, we propose a new multidisciplinary Integrative Breakage Model for the study of genome evolution. The analysis of the high-level structural organization of genomes (nucleome), together with the functional constrains that accompany genome reshuffling, provide insights into the origin and plasticity of genome organization that may assist with the detection and isolation of therapeutic targets for the treatment of complex human disorders.
Collapse
Affiliation(s)
- Marta Farré
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, Campus UAB, Barcelona, Spain
| | | | | |
Collapse
|
16
|
An Z, Tang Z, Ma B, Mason AS, Guo Y, Yin J, Gao C, Wei L, Li J, Fu D. Transposon variation by order during allopolyploidisation between Brassica oleracea and Brassica rapa. PLANT BIOLOGY (STUTTGART, GERMANY) 2014; 16:825-35. [PMID: 24176077 DOI: 10.1111/plb.12121] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2013] [Accepted: 09/23/2013] [Indexed: 05/02/2023]
Abstract
Although many studies have shown that transposable element (TE) activation is induced by hybridisation and polyploidisation in plants, much less is known on how different types of TE respond to hybridisation, and the impact of TE-associated sequences on gene function. We investigated the frequency and regularity of putative transposon activation for different types of TE, and determined the impact of TE-associated sequence variation on the genome during allopolyploidisation. We designed different types of TE primers and adopted the Inter-Retrotransposon Amplified Polymorphism (IRAP) method to detect variation in TE-associated sequences during the process of allopolyploidisation between Brassica rapa (AA) and Brassica oleracea (CC), and in successive generations of self-pollinated progeny. In addition, fragments with TE insertions were used to perform Blast2GO analysis to characterise the putative functions of the fragments with TE insertions. Ninety-two primers amplifying 548 loci were used to detect variation in sequences associated with four different orders of TE sequences. TEs could be classed in ascending frequency into LTR-REs, TIRs, LINEs, SINEs and unknown TEs. The frequency of novel variation (putative activation) detected for the four orders of TEs was highest from the F1 to F2 generations, and lowest from the F2 to F3 generations. Functional annotation of sequences with TE insertions showed that genes with TE insertions were mainly involved in metabolic processes and binding, and preferentially functioned in organelles. TE variation in our study severely disturbed the genetic compositions of the different generations, resulting in inconsistencies in genetic clustering. Different types of TE showed different patterns of variation during the process of allopolyploidisation.
Collapse
Affiliation(s)
- Z An
- Engineering Research Center of South Upland Agriculture of Ministry of Education, College of Agronomy and Biotechnology, Southwest University, Chongqing, China; Crop Research Institute, Gansu Academy of Agricultural Sciences, Lanzhou, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Campos-Sánchez R, Kapusta A, Feschotte C, Chiaromonte F, Makova KD. Genomic landscape of human, bat, and ex vivo DNA transposon integrations. Mol Biol Evol 2014; 31:1816-32. [PMID: 24809961 DOI: 10.1093/molbev/msu138] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The integration and fixation preferences of DNA transposons, one of the major classes of eukaryotic transposable elements, have never been evaluated comprehensively on a genome-wide scale. Here, we present a detailed study of the distribution of DNA transposons in the human and bat genomes. We studied three groups of DNA transposons that integrated at different evolutionary times: 1) ancient (>40 My) and currently inactive human elements, 2) younger (<40 My) bat elements, and 3) ex vivo integrations of piggyBat and Sleeping Beauty elements in HeLa cells. Although the distribution of ex vivo elements reflected integration preferences, the distribution of human and (to a lesser extent) bat elements was also affected by selection. We used regression techniques (linear, negative binomial, and logistic regression models with multiple predictors) applied to 20-kb and 1-Mb windows to investigate how the genomic landscape in the vicinity of DNA transposons contributes to their integration and fixation. Our models indicate that genomic landscape explains 16-79% of variability in DNA transposon genome-wide distribution. Importantly, we not only confirmed previously identified predictors (e.g., DNA conformation and recombination hotspots) but also identified several novel predictors (e.g., signatures of double-strand breaks and telomere hexamer). Ex vivo integrations showed a bias toward actively transcribed regions. Older DNA transposons were located in genomic regions scarce in most conserved elements-likely reflecting purifying selection. Our study highlights how DNA transposons are integral to the evolution of bat and human genomes, and has implications for the development of DNA transposon assays for gene therapy and mutagenesis applications.
Collapse
Affiliation(s)
- Rebeca Campos-Sánchez
- Genetics Program, The Huck Institutes of the Life Sciences, Penn State University, University Park, PA
| | - Aurélie Kapusta
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT
| | - Francesca Chiaromonte
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, PADepartment of Statistics, Penn State University, University Park, PA
| | - Kateryna D Makova
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, PADepartment of Biology, Penn State University, University Park, PA
| |
Collapse
|
18
|
David M, Mustafa H, Brudno M. Detecting Alu insertions from high-throughput sequencing data. Nucleic Acids Res 2013; 41:e169. [PMID: 23921633 PMCID: PMC3783187 DOI: 10.1093/nar/gkt612] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
High-throughput sequencing technologies have allowed for the cataloguing of variation in personal human genomes. In this manuscript, we present alu-detect, a tool that combines read-pair and split-read information to detect novel Alus and their precise breakpoints directly from either whole-genome or whole-exome sequencing data while also identifying insertions directly in the vicinity of existing Alus. To set the parameters of our method, we use simulation of a faux reference, which allows us to compute the precision and recall of various parameter settings using real sequencing data. Applying our method to 100 bp paired Illumina data from seven individuals, including two trios, we detected on average 1519 novel Alus per sample. Based on the faux-reference simulation, we estimate that our method has 97% precision and 85% recall. We identify 808 novel Alus not previously described in other studies. We also demonstrate the use of alu-detect to study the local sequence and global location preferences for novel Alu insertions.
Collapse
Affiliation(s)
- Matei David
- Department of Computer Science, University of Toronto, 10 King's College Road, Toronto, ON M5S 3G4, Canada and Centre for Computational Medicine, Genetics and Genome Biology Program, The Hospital for Sick Children, 555 University Avenue, Toronto, ON M5G 1X8, Canada
| | | | | |
Collapse
|
19
|
Gao C, Xiao M, Ren X, Hayward A, Yin J, Wu L, Fu D, Li J. Characterization and functional annotation of nested transposable elements in eukaryotic genomes. Genomics 2012; 100:222-30. [PMID: 22800764 DOI: 10.1016/j.ygeno.2012.07.004] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2011] [Revised: 06/26/2012] [Accepted: 07/03/2012] [Indexed: 11/18/2022]
Abstract
The movement of transposable elements (TE) in eukaryotic genomes can often result in the occurrence of nested TEs (the insertion of TEs into pre-existing TEs). We performed a general TE assessment using available databases to detect nested TEs and analyze their characteristics and putative functions in eukaryote genomes. A total of 802 TEs were found to be inserted into 690 host TEs from a total number of 11,329 TEs. We reveal that repetitive sequences are associated with an increased occurrence of nested TEs and sequence biased of TE insertion. A high proportion of the genes which were associated with nested TEs are predicted to localize to organelles and participate in nucleic acid and protein binding. Many of these function in metabolic processes, and encode important enzymes for transposition and integration. Therefore, nested TEs in eukaryotic genomes may negatively influence genome expansion, and enrich the diversity of gene expression or regulation.
Collapse
Affiliation(s)
- Caihua Gao
- Engineering Research Center of South Upland Agriculture, Ministry of Education, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Characterization of transcriptional activation and inserted-into-gene preference of various transposable elements in the Brassica species. Mol Biol Rep 2012; 39:7513-23. [DOI: 10.1007/s11033-012-1585-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2011] [Accepted: 01/30/2012] [Indexed: 12/24/2022]
|
21
|
Linheiro RS, Bergman CM. Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster. PLoS One 2012; 7:e30008. [PMID: 22347367 PMCID: PMC3276498 DOI: 10.1371/journal.pone.0030008] [Citation(s) in RCA: 99] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2011] [Accepted: 12/11/2011] [Indexed: 12/20/2022] Open
Abstract
Transposable elements are mobile DNA sequences that integrate into host genomes using diverse mechanisms with varying degrees of target site specificity. While the target site preferences of some engineered transposable elements are well studied, the natural target preferences of most transposable elements are poorly characterized. Using population genomic resequencing data from 166 strains of Drosophila melanogaster, we identified over 8,000 new insertion sites not present in the reference genome sequence that we used to decode the natural target preferences of 22 families of transposable element in this species. We found that terminal inverted repeat transposon and long terminal repeat retrotransposon families present clade-specific target site duplications and target site sequence motifs. Additionally, we found that the sequence motifs at transposable element target sites are always palindromes that extend beyond the target site duplication. Our results demonstrate the utility of population genomics data for high-throughput inference of transposable element targeting preferences in the wild and establish general rules for terminal inverted repeat transposon and long terminal repeat retrotransposon target site selection in eukaryotic genomes.
Collapse
Affiliation(s)
- Raquel S. Linheiro
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Casey M. Bergman
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| |
Collapse
|
22
|
Cook GW, Konkel MK, Major JD, Walker JA, Han K, Batzer MA. Alu pair exclusions in the human genome. Mob DNA 2011; 2:10. [PMID: 21943335 PMCID: PMC3215922 DOI: 10.1186/1759-8753-2-10] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2011] [Accepted: 09/23/2011] [Indexed: 12/16/2022] Open
Abstract
Background The human genome contains approximately one million Alu elements which comprise more than 10% of human DNA by mass. Alu elements possess direction, and are distributed almost equally in positive and negative strand orientations throughout the genome. Previously, it has been shown that closely spaced Alu pairs in opposing orientation (inverted pairs) are found less frequently than Alu pairs having the same orientation (direct pairs). However, this imbalance has only been investigated for Alu pairs separated by 650 or fewer base pairs (bp) in a study conducted prior to the completion of the draft human genome sequence. Results We performed a comprehensive analysis of all (> 800,000) full-length Alu elements in the human genome. This large sample size permits detection of small differences in the ratio between inverted and direct Alu pairs (I:D). We have discovered a significant depression in the full-length Alu pair I:D ratio that extends to repeat pairs separated by ≤ 350,000 bp. Within this imbalance bubble (those Alu pairs separated by ≤ 350,000 bp), direct pairs outnumber inverted pairs. Using PCR, we experimentally verified several examples of inverted Alu pair exclusions that were caused by deletions. Conclusions Over 50 million full-length Alu pairs reside within the I:D imbalance bubble. Their collective impact may represent one source of Alu element-related human genomic instability that has not been previously characterized.
Collapse
Affiliation(s)
- George W Cook
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, 70803, USA.
| | | | | | | | | | | |
Collapse
|
23
|
Abstract
Transposons are found in virtually all organisms and play fundamental roles in genome evolution. They can also acquire new functions in the host organism and some have been developed as incisive genetic tools for transformation and mutagenesis. The hAT transposon superfamily contains members from the plant and animal kingdoms, some of which are active when introduced into new host organisms. We have identified two new active hAT transposons, AeBuster1, from the mosquito Aedes aegypti and TcBuster from the red flour beetle Tribolium castaneum. Activity of both transposons is illustrated by excision and transposition assays performed in Drosophila melanogaster and Ae. aegypti and by in vitro strand transfer assays. These two active insect transposons are more closely related to the Buster sequences identified in humans than they are to the previously identified active hAT transposons, Ac, Tam3, Tol2, hobo, and Hermes. We therefore reexamined the structural and functional relationships of hAT and hAT-like transposase sequences extracted from genome databases and found that the hAT superfamily is divided into at least two families. This division is supported by a difference in target-site selections generated by active transposons of each family. We name these families the Ac and Buster families after the first identified transposon or transposon-like sequence in each. We find that the recently discovered SPIN transposons of mammals are located within the family of Buster elements.
Collapse
|
24
|
Churakov G, Grundmann N, Kuritzin A, Brosius J, Makałowski W, Schmitz J. A novel web-based TinT application and the chronology of the Primate Alu retroposon activity. BMC Evol Biol 2010; 10:376. [PMID: 21126360 PMCID: PMC3014933 DOI: 10.1186/1471-2148-10-376] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2010] [Accepted: 12/02/2010] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND DNA sequences afford access to the evolutionary pathways of life. Particularly mobile elements that constantly co-evolve in genomes encrypt recent and ancient information of their host's history. In mammals there is an extraordinarily abundant activity of mobile elements that occurs in a dynamic succession of active families, subfamilies, types, and subtypes of retroposed elements. The high frequency of retroposons in mammals implies that, by chance, such elements also insert into each other. While inactive elements are no longer able to retropose, active elements retropose by chance into other active and inactive elements. Thousands of such directional, element-in-element insertions are found in present-day genomes. To help analyze these events, we developed a computational algorithm (Transpositions in Transpositions, or TinT) that examines the different frequencies of nested transpositions and reconstructs the chronological order of retroposon activities. RESULTS By examining the different frequencies of such nested transpositions, the TinT application reconstructs the chronological order of retroposon activities. We use such activity patterns as a comparative tool to (1) delineate the historical rise and fall of retroposons and their relations to each other, (2) understand the retroposon-induced complexity of recent genomes, and (3) find selective informative homoplasy-free markers of phylogeny. The efficiency of the new application is demonstrated by applying it to dimeric Alu Short INterspersed Elements (SINE) to derive a complete chronology of such elements in primates. CONCLUSION The user-friendly, web-based TinT interface presented here affords an easy, automated screening for nested transpositions from genome assemblies or trace data, assembles them in a frequency-matrix, and schematically displays their chronological activity history.
Collapse
Affiliation(s)
- Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| | - Norbert Grundmann
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Niels Stensen Str. 14, 48149 Münster, Germany
| | - Andrej Kuritzin
- Department of Physics and Mathematics, Saint Petersburg State Institute of Technology, 26 Moskovsky av., St.-Petersburg 198013, Russia
| | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| | - Wojciech Makałowski
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Niels Stensen Str. 14, 48149 Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| |
Collapse
|
25
|
The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates. Genome Biol 2010; 11:R59. [PMID: 20525173 PMCID: PMC2911107 DOI: 10.1186/gb-2010-11-6-r59] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2010] [Revised: 04/27/2010] [Accepted: 06/02/2010] [Indexed: 01/29/2023] Open
Abstract
Background Transposable elements (TEs) have played an important role in the diversification and enrichment of mammalian transcriptomes through various mechanisms such as exonization and intronization (the birth of new exons/introns from previously intronic/exonic sequences, respectively), and insertion into first and last exons. However, no extensive analysis has compared the effects of TEs on the transcriptomes of mammals, non-mammalian vertebrates and invertebrates. Results We analyzed the influence of TEs on the transcriptomes of five species, three invertebrates and two non-mammalian vertebrates. Compared to previously analyzed mammals, there were lower levels of TE introduction into introns, significantly lower numbers of exonizations originating from TEs and a lower percentage of TE insertion within the first and last exons. Although the transcriptomes of vertebrates exhibit significant levels of exonization of TEs, only anecdotal cases were found in invertebrates. In vertebrates, as in mammals, the exonized TEs are mostly alternatively spliced, indicating that selective pressure maintains the original mRNA product generated from such genes. Conclusions Exonization of TEs is widespread in mammals, less so in non-mammalian vertebrates, and very low in invertebrates. We assume that the exonization process depends on the length of introns. Vertebrates, unlike invertebrates, are characterized by long introns and short internal exons. Our results suggest that there is a direct link between the length of introns and exonization of TEs and that this process became more prevalent following the appearance of mammals.
Collapse
|