1
|
Frith MC. Paleozoic Protein Fossils Illuminate the Evolution of Vertebrate Genomes and Transposable Elements. Mol Biol Evol 2022; 39:6555113. [PMID: 35348724 PMCID: PMC9004415 DOI: 10.1093/molbev/msac068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genomes hold a treasure trove of protein fossils: fragments of formerly protein-coding DNA, which mainly come from transposable elements (TEs) or host genes. These fossils reveal ancient evolution of TEs and genomes, and many fossils have been exapted to perform diverse functions important for the host's fitness. However, old and highly-degraded fossils are hard to identify, standard methods (e.g. BLAST) are not optimized for this task, and few Paleozoic protein fossils have been found. Here, a recently optimized method is used to find protein fossils in vertebrate genomes. It finds Paleozoic fossils predating the amphibian/amniote divergence from most major TE categories, including virus-related Polinton and Gypsy elements. It finds 10 fossils in the human genome (8 from TEs and 2 from host genes) that predate the last common ancestor of all jawed vertebrates, probably from the Ordovician period. It also finds types of transposon and retrotransposon not found in human before. These fossils have extreme sequence conservation, indicating exaptation: some have evidence of gene-regulatory function, and they tend to lienearest to developmental genes. Some ancient fossils suggest "genome tectonics", where two fragments of one TE have drifted apart by up to megabases, possibly explaining gene deserts and large introns. This paints a picture of great TE diversity in our aquatic ancestors, with patchy TE inheritance by later vertebrates, producing new genes and regulatory elements on the way. Host-gene fossils too have contributed anciently-conserved DNA segments. This paves the way to further studies of ancient protein fossils.
Collapse
Affiliation(s)
- Martin C Frith
- Artificial Intelligence Research Center, AIST, Tokyo, Japan.,Graduate School of Frontier Sciences, University of Tokyo, Chiba, Japan.,Computational Bio Big-Data Open Innovation Laboratory, AIST, Tokyo, Japan
| |
Collapse
|
2
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
3
|
Seibt KM, Schmidt T, Heitkam T. The conserved 3' Angio-domain defines a superfamily of short interspersed nuclear elements (SINEs) in higher plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:681-699. [PMID: 31610059 DOI: 10.1111/tpj.14567] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 09/13/2019] [Accepted: 09/17/2019] [Indexed: 06/10/2023]
Abstract
Repetitive sequences are ubiquitous components of eukaryotic genomes affecting genome size and evolution as well as gene regulation. Among them, short interspersed nuclear elements (SINEs) are non-coding retrotransposons usually shorter than 1000 bp. They contain only few short conserved structural motifs, in particular an internal promoter derived from cellular RNAs and a mostly AT-rich 3' tail, whereas the remaining regions are highly variable. SINEs emerge and vanish during evolution, and often diversify into numerous families and subfamilies that are usually specific for only a limited number of species. In contrast, at the 3' end of multiple plant SINEs we detected the highly conserved 'Angio-domain'. This 37 bp segment defines the Angio-SINE superfamily, which encompasses 24 plant SINE families widely distributed across 13 orders within the plant kingdom. We retrieved 28 433 full-length Angio-SINE copies from genome assemblies of 46 plant species, frequently located in genes. Compensatory mutations in and adjacent to the Angio-domain imply selective restraints maintaining its RNA structure. Angio-SINE families share segmental sequence similarities, indicating a modular evolution with strong Angio-domain preservation. We suggest that the conserved domain contributes to the evolutionary success of Angio-SINEs through either structural interactions between SINE RNA and proteins increasing their transpositional efficiency, or by enhancing their accumulation in genes.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Thomas Schmidt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| |
Collapse
|
4
|
Nishiyama E, Ohshima K. Cross-Kingdom Commonality of a Novel Insertion Signature of RTE-Related Short Retroposons. Genome Biol Evol 2018; 10:1471-1483. [PMID: 29850801 PMCID: PMC6007223 DOI: 10.1093/gbe/evy098] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2018] [Indexed: 12/15/2022] Open
Abstract
In multicellular organisms, such as vertebrates and flowering plants, horizontal transfer (HT) of genetic information is thought to be a rare event. However, recent findings unveiled unexpectedly frequent HT of RTE-clade LINEs. To elucidate the molecular footprints of the genomic integration machinery of RTE-related retroposons, the sequence patterns surrounding the insertion sites of plant Au-like SINE families were analyzed in the genomes of a wide variety of flowering plants. A novel and remarkable finding regarding target site duplications (TSDs) for SINEs was they start with thymine approximately one helical pitch (ten nucleotides) downstream of a thymine stretch. This TSD pattern was found in RTE-clade LINEs, which share the 3'-end sequence of these SINEs, in the genome of leguminous plants. These results demonstrably show that Au-like SINEs were mobilized by the enzymatic machinery of RTE-clade LINEs. Further, we discovered the same TSD pattern in animal SINEs from lizard and mammals, in which the RTE-clade LINEs sharing the 3'-end sequence with these animal SINEs showed a distinct TSD pattern. Moreover, a significant correlation was observed between the first nucleotide of TSDs and microsatellite-like sequences found at the 3'-ends of SINEs and LINEs. We propose that RTE-encoded protein could preferentially bind to a DNA region that contains a thymine stretch to cleave a phosphodiester bond downstream of the stretch. Further, determination of cleavage sites and/or efficiency of primer sites for reverse transcription may depend on microsatellite-like repeats in the RNA template. Such a unique mechanism may have enabled retroposons to successfully expand in frontier genomes after HT.
Collapse
Affiliation(s)
- Eri Nishiyama
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Shiga, Japan
| | - Kazuhiko Ohshima
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Shiga, Japan
| |
Collapse
|
5
|
Mao H, Wang H. Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs). Genome Biol Evol 2018; 9:2048-2056. [PMID: 28903462 PMCID: PMC5585668 DOI: 10.1093/gbe/evx145] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2017] [Indexed: 02/06/2023] Open
Abstract
Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3' SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family.
Collapse
Affiliation(s)
- Hongliang Mao
- Department of Physics, T-Life Research Center, Fudan University, Shanghai, P.R. China
| | - Hao Wang
- Department of Physics, T-Life Research Center, Fudan University, Shanghai, P.R. China.,Department of Genetics, University of Georgia
| |
Collapse
|
6
|
Šiukšta R, Vaitkūnienė V, Rančelis V. Is auxin involved in the induction of genetic instability in barley homeotic double mutants? PLANTA 2018; 247:483-498. [PMID: 29080070 DOI: 10.1007/s00425-017-2802-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Accepted: 10/24/2017] [Indexed: 06/07/2023]
Abstract
The triggers of genetic instability in barley homeotic double mutants are tweaky spike -type mutations associated with an auxin imbalance in separate spike phytomeres. Barley homeotic tweaky spike;Hooded (tw;Hd) double mutants are characterized by an inherited instability of spike and flower development, which is absent in the single parental constituents. The aim of the present study was to show that the trigger of genetic instability in the double mutants is the tw mutations, which are associated with an auxin imbalance in the developing spikes. Their pleiotropic effects on genes related to spike/flower development may cause the genetic instability of double mutants. The study of four double-mutant groups composed of different mutant alleles showed that the instability arose only if the mutant allele tw was a constituent of the double mutants. Application of auxin inhibitors and 2,4-dichlorophenoxyacetic acid (2,4-D) demonstrated the relationship of the instability of the double mutants and the phenotype of the tw mutants to auxin imbalance. 2,4-D induced phenocopies of the tw mutation in wild-type plants and rescued the phenotypes of three allelic tw mutants. The differential display (dd-PCR) method allowed the identification of several putative candidate genes in tw that may be responsible for the initiation of instability in the double mutants by pleiotropic variations of their expression in the tw mutant associated with auxin imbalance in the developing spikes. The results of the present study linked the genetic instability of homeotic double mutants with an auxin imbalance caused by one of the constituents (tw). The genetic instability of the double mutants in relation to auxin imbalance was studied for the first time. A matrocliny on instability expression was also observed.
Collapse
Affiliation(s)
- Raimondas Šiukšta
- Life Sciences Centre, Institute of Biosciences, Vilnius University, Saulėtekis Ave. 7, 10257, Vilnius, Lithuania.
- Botanical Garden of Vilnius University, Kairėnai Str. 43, 10239, Vilnius, Lithuania.
| | - Virginija Vaitkūnienė
- Life Sciences Centre, Institute of Biosciences, Vilnius University, Saulėtekis Ave. 7, 10257, Vilnius, Lithuania
- Botanical Garden of Vilnius University, Kairėnai Str. 43, 10239, Vilnius, Lithuania
| | - Vytautas Rančelis
- Life Sciences Centre, Institute of Biosciences, Vilnius University, Saulėtekis Ave. 7, 10257, Vilnius, Lithuania
| |
Collapse
|
7
|
Peccoud J, Cordaux R, Gilbert C. Analyzing Horizontal Transfer of Transposable Elements on a Large Scale: Challenges and Prospects. Bioessays 2017; 40. [DOI: 10.1002/bies.201700177] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Revised: 11/22/2017] [Indexed: 02/06/2023]
Affiliation(s)
- Jean Peccoud
- UMR CNRS 7267; Ecologie et Biologie des Interactions; Equipe Ecologie Evolution Symbiose; Université de Poitiers; 86000 Poitiers France
| | - Richard Cordaux
- UMR CNRS 7267; Ecologie et Biologie des Interactions; Equipe Ecologie Evolution Symbiose; Université de Poitiers; 86000 Poitiers France
| | - Clément Gilbert
- UMR CNRS 9191; UMR 247 IRD Laboratoire Evolution, Génomes, Comportement, Écologie; Université Paris-Sud,; 91198 Gif-sur-Yvette France
| |
Collapse
|
8
|
Kögler A, Schmidt T, Wenke T. Evolutionary modes of emergence of short interspersed nuclear element (SINE) families in grasses. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 92:676-695. [PMID: 28857316 DOI: 10.1111/tpj.13676] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2017] [Revised: 08/18/2017] [Accepted: 08/22/2017] [Indexed: 06/07/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous transposable elements which are propagated by retrotransposition and constitute an inherent part of the genome of most eukaryotic species. Knowledge of heterogeneous and highly abundant SINEs is crucial for de novo (or improvement of) annotation of whole genome sequences. We scanned Poaceae genome sequences of six important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Panicum virgatum, Sorghum bicolor, Zea mays) and Brachypodium distachyon to examine the diversity and evolution of SINE populations. We comparatively analyzed the structural features, distribution, evolutionary relation and abundance of 32 SINE families and subfamilies within grasses, comprising 11 052 individual copies. The investigation of activity profiles within the Poaceae provides insights into their species-specific diversification and amplification. We found that Poaceae SINEs (PoaS) fall into two length categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Detailed analysis at the nucleotide level revealed that multimerization of related and unrelated SINE copies is an important evolutionary mechanism of SINE formation. We conclude that PoaS families diversify by massive reshuffling between SINE families, likely caused by insertion of truncated copies, and provide a model for this evolutionary scenario. Twenty-eight of 32 PoaS families and subfamilies show significant conservation, in particular either in the 5' or 3' regions, across Poaceae species and share large sequence stretches with one or more other PoaS families.
Collapse
Affiliation(s)
- Anja Kögler
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| |
Collapse
|
9
|
Yin H, Wu X, Shi D, Chen Y, Qi K, Ma Z, Zhang S. TGTT and AACA: two transcriptionally active LTR retrotransposon subfamilies with a specific LTR structure and horizontal transfer in four Rosaceae species. Mob DNA 2017; 8:14. [PMID: 29093758 PMCID: PMC5659011 DOI: 10.1186/s13100-017-0098-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 10/18/2017] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Long terminal repeat retrotransposons (LTR-RTs) are major components of plant genomes. Common LTR-RTs contain the palindromic dinucleotide 5'-'TG'-'CA'-3' motif at the ends. Thus, further analyses of non-canonical LTR-RTs with non-palindromic motifs will enhance our understanding of their structures and evolutionary history. RESULTS Here, we report two new LTR-RT subfamilies (TGTT and AACA) with atypical dinucleotide ends of 5'-'TG'-'TT'-3', and 5'-'AA'-'CA'-3' in pear, apple, peach and mei. In total, 91 intact LTR-RTs were identified and classified into four TGTT and four AACA families. A structural annotation analysis showed that the four TGTT families, together with AACA1 and AACA2, belong to the Copia-like superfamily, whereas AACA3 and AACA4 appeared to be TRIM elements. The average amplification time frames for the eight families ranged from 0.05 to 2.32 million years. Phylogenetics coupled with sequence analyses revealed that the TGTT1 elements of peach were horizontally transferred from apple. In addition, 32 elements from two TGTT and three AACA families had detectable transcriptional activation, and a qRT-PCR analysis indicated that their expression levels varied dramatically in different species, organs and stress treatments. CONCLUSIONS Two novel LTR-RT subfamilies that terminated with non-palindromic dinucleotides at the ends of their LTRs were identified in four Rosaceae species, and a deep analysis showed their recent activity, horizontal transfer and varied transcriptional levels in different species, organs and stress treatments. This work enhances our understanding of the structural variation and evolutionary history of LTR-RTs in plants and also provides a valuable resource for future investigations of LTR-RTs having specific structures in other species.
Collapse
Affiliation(s)
- Hao Yin
- Center of Pear Engineering Technology Research, College of Horticulture, Nanjing Agricultural University, Nanjing, 210095 China
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Xiao Wu
- Center of Pear Engineering Technology Research, College of Horticulture, Nanjing Agricultural University, Nanjing, 210095 China
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Dongqing Shi
- Center of Pear Engineering Technology Research, College of Horticulture, Nanjing Agricultural University, Nanjing, 210095 China
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Yangyang Chen
- Center of Pear Engineering Technology Research, College of Horticulture, Nanjing Agricultural University, Nanjing, 210095 China
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Kaijie Qi
- Center of Pear Engineering Technology Research, College of Horticulture, Nanjing Agricultural University, Nanjing, 210095 China
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Zhengqiang Ma
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
- College of Agricultural Sciences, Nanjing Agricultural University, Nanjing, China
| | - Shaoling Zhang
- Center of Pear Engineering Technology Research, College of Horticulture, Nanjing Agricultural University, Nanjing, 210095 China
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|