1
|
Kögler A, Schmidt T, Wenke T. Evolutionary modes of emergence of short interspersed nuclear element (SINE) families in grasses. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 92:676-695. [PMID: 28857316 DOI: 10.1111/tpj.13676] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2017] [Revised: 08/18/2017] [Accepted: 08/22/2017] [Indexed: 06/07/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous transposable elements which are propagated by retrotransposition and constitute an inherent part of the genome of most eukaryotic species. Knowledge of heterogeneous and highly abundant SINEs is crucial for de novo (or improvement of) annotation of whole genome sequences. We scanned Poaceae genome sequences of six important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Panicum virgatum, Sorghum bicolor, Zea mays) and Brachypodium distachyon to examine the diversity and evolution of SINE populations. We comparatively analyzed the structural features, distribution, evolutionary relation and abundance of 32 SINE families and subfamilies within grasses, comprising 11 052 individual copies. The investigation of activity profiles within the Poaceae provides insights into their species-specific diversification and amplification. We found that Poaceae SINEs (PoaS) fall into two length categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Detailed analysis at the nucleotide level revealed that multimerization of related and unrelated SINE copies is an important evolutionary mechanism of SINE formation. We conclude that PoaS families diversify by massive reshuffling between SINE families, likely caused by insertion of truncated copies, and provide a model for this evolutionary scenario. Twenty-eight of 32 PoaS families and subfamilies show significant conservation, in particular either in the 5' or 3' regions, across Poaceae species and share large sequence stretches with one or more other PoaS families.
Collapse
Affiliation(s)
- Anja Kögler
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| |
Collapse
|
2
|
Schwichtenberg K, Wenke T, Zakrzewski F, Seibt KM, Minoche A, Dohm JC, Weisshaar B, Himmelbauer H, Schmidt T. Diversification, evolution and methylation of short interspersed nuclear element families in sugar beet and related Amaranthaceae species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 85:229-44. [PMID: 26676716 DOI: 10.1111/tpj.13103] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2015] [Revised: 11/23/2015] [Accepted: 11/26/2015] [Indexed: 05/18/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-long terminal repeat retrotransposons which are widely distributed in eukaryotic organisms. While SINEs have been intensively studied in animals, only limited information is available about plant SINEs. We analysed 22 SINE families from seven genomes of the Amaranthaceae family and identified 34 806 SINEs, including 19 549 full-length copies. With the focus on sugar beet (Beta vulgaris), we performed a comparative analysis of the diversity, genomic and chromosomal organization and the methylation of SINEs to provide a detailed insight into the evolution and age of Amaranthaceae SINEs. The lengths of consensus sequences of SINEs range from 113 nucleotides (nt) up to 224 nt. The SINEs show dispersed distribution on all chromosomes but were found with higher incidence in subterminal euchromatic chromosome regions. The methylation of SINEs is increased compared with their flanking regions, and the strongest effect is visible for cytosines in the CHH context, indicating an involvement of asymmetric methylation in the silencing of SINEs.
Collapse
Affiliation(s)
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Falk Zakrzewski
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - André Minoche
- Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany
- Garvan Institute of Medical Research, 2010, Sydney, NSW, Australia
| | - Juliane C Dohm
- Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany
- Department of Biotechnology, University of Natural Resources and Life Sciences (BOKU), 1190, Vienna, Austria
| | - Bernd Weisshaar
- CeBiTec & Department of Biology, University of Bielefeld, 33615, Bielefeld, Germany
| | - Heinz Himmelbauer
- Garvan Institute of Medical Research, 2010, Sydney, NSW, Australia
- Department of Biotechnology, University of Natural Resources and Life Sciences (BOKU), 1190, Vienna, Austria
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| |
Collapse
|
3
|
Mehra M, Gangwar I, Shankar R. A Deluge of Complex Repeats: The Solanum Genome. PLoS One 2015; 10:e0133962. [PMID: 26241045 PMCID: PMC4524691 DOI: 10.1371/journal.pone.0133962] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2015] [Accepted: 07/06/2015] [Indexed: 12/18/2022] Open
Abstract
Repetitive elements have lately emerged as key components of genome, performing varieties of roles. It has now become necessary to have an account of repeats for every genome to understand its dynamics and state. Recently, genomes of two major Solanaceae species, Solanum tuberosum and Solanum lycopersicum, were sequenced. These species are important crops having high commercial significance as well as value as model species. However, there is a reasonable gap in information about repetitive elements and their possible roles in genome regulation for these species. The present study was aimed at detailed identification and characterization of complex repetitive elements in these genomes, along with study of their possible functional associations as well as to assess possible transcriptionally active repetitive elements. In this study, it was found that ~50-60% of genomes of S. tuberosum and S. lycopersicum were composed of repetitive elements. It was also found that complex repetitive elements were associated with >95% of genes in both species. These two genomes are mostly composed of LTR retrotransposons. Two novel repeat families very similar to LTR/ERV1 and LINE/RTE-BovB have been reported for the first time. Active existence of complex repeats was estimated by measuring their transcriptional abundance using Next Generation Sequencing read data and Microarray platforms. A reasonable amount of regulatory components like transcription factor binding sites and miRNAs appear to be under the influence of these complex repetitive elements in these species, while several genes appeared to possess exonized repeats.
Collapse
MESH Headings
- Base Sequence
- Binding Sites
- Chromosomes, Plant/genetics
- DNA, Plant/genetics
- Evolution, Molecular
- Exons/genetics
- Gene Expression Regulation, Plant/genetics
- Genome, Plant
- Humans
- INDEL Mutation
- Solanum lycopersicum/genetics
- MicroRNAs/genetics
- Molecular Sequence Data
- Phylogeny
- Plant Proteins/metabolism
- RNA, Messenger/biosynthesis
- RNA, Messenger/genetics
- RNA, Plant/biosynthesis
- RNA, Plant/genetics
- Repetitive Sequences, Nucleic Acid
- Retroelements/genetics
- Sequence Alignment
- Solanum tuberosum/genetics
- Species Specificity
- Terminal Repeat Sequences
- Transcription Factors/metabolism
- Transcription, Genetic
Collapse
Affiliation(s)
- Mrigaya Mehra
- Studio of Computational Biology & Bioinformatics, Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, 176061, HP, India
- Academy of Scientific & Innovative Research, Chennai, India
| | - Indu Gangwar
- Studio of Computational Biology & Bioinformatics, Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, 176061, HP, India
- Academy of Scientific & Innovative Research, Chennai, India
| | - Ravi Shankar
- Studio of Computational Biology & Bioinformatics, Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, 176061, HP, India
- Academy of Scientific & Innovative Research, Chennai, India
| |
Collapse
|
4
|
Wenke T, Seibt KM, Döbel T, Muders K, Schmidt T. Inter-SINE Amplified Polymorphism (ISAP) for rapid and robust plant genotyping. Methods Mol Biol 2015; 1245:183-192. [PMID: 25373758 DOI: 10.1007/978-1-4939-1966-6_14] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The unambiguous differentiation of crop genotypes is often laborious or expensive. A rapid, robust, and cost-efficient marker system is required for routine genotyping in plant breeding and marker-assisted selection. We describe the Inter-SINE Amplified Polymorphism (ISAP) system that is based on standard molecular methods resulting in genotype-specific fingerprints at high resolution. These markers are derived from Short Interspersed Nuclear Elements (SINEs) which are dispersed repetitive sequences present in most if not all plant genomes and can be efficiently extracted from plant genome sequences. The ISAP method was developed on potato as model plant but is also transferable to other plant species.
Collapse
Affiliation(s)
- Torsten Wenke
- Institute of Botany, Technische Universität Dresden, Mommsenstraße 11, Dresden, 01062, Germany
| | | | | | | | | |
Collapse
|
5
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
6
|
Shu Y, Li Y, Bai X, Cai H, Ji W, Ji Z, Guo C, Zhu Y. Identification and characterization of a new member of the SINE Au retroposon family (GmAu1) in the soybean, Glycine max (L.) Merr., genome and its potential application. PLANT CELL REPORTS 2011; 30:2207-13. [PMID: 21796384 DOI: 10.1007/s00299-011-1126-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2011] [Revised: 07/06/2011] [Accepted: 07/12/2011] [Indexed: 05/31/2023]
Abstract
A plant short interspersed element (SINE) was identified in Glycine max after re-sequencing of the soybean sequence characterized amplified region (SCAR) markers. Detailed analysis revealed that this newly recognized SINE element consisted of a tRNA-related region, a tRNA non-related region, direct flanking repeat sequences, and a short stretch of Ts at the 3'-terminal region. These features are similar to previously characterized SINEs. To investigate the evolution of the SINE retroposon, BLASTN was used to search against genome sequences of other plants. Since it is homologous with the retroposon Au in Aegilops umbellulata (wheat) and its homology in soybean, the SINE is named as GmAu1. Genome analysis of the Glycine max var. Willimas 82 uncovered more than 847 copies of GmAu1 per haploid genome of soybean. Examination of the regions flanking the inserted GmAu1 sequences indicated a preference for introns over exons or other noncoding regions. Considering the flanking insertion sequences, 146 primers were designed in order to detect insertion mutations by a PCR-based method. Seventy-seven primers displayed polymorphism and were used to develop corresponding GmAu1-based SCAR markers. The retroposon GmAu1 and its related SCAR markers identified in this study will prove valuable to future investigations into the genetic mapping, phylogeny, and evolution of the Glycine genus.
Collapse
Affiliation(s)
- Yongjun Shu
- Key Laboratory of Molecular Cytogenetics and Genetic Breeding, College of Life Science and Technology, Harbin Normal University, Harbin, 150025, Heilongjiang, People's Republic of China
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Wenke T, Döbel T, Sörensen TR, Junghans H, Weisshaar B, Schmidt T. Targeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes. THE PLANT CELL 2011; 23:3117-28. [PMID: 21908723 PMCID: PMC3203444 DOI: 10.1105/tpc.111.088682] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2011] [Revised: 08/23/2011] [Accepted: 08/26/2011] [Indexed: 05/18/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5' truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3' end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families.
Collapse
Affiliation(s)
- Torsten Wenke
- Department of Biology, Dresden University of Technology, D-01062 Dresden, Germany.
| | | | | | | | | | | |
Collapse
|
8
|
Yagi E, Akita T, Kawahara T. A novel Au SINE sequence found in a gymnosperm. Genes Genet Syst 2011; 86:19-25. [PMID: 21498919 DOI: 10.1266/ggs.86.19] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Although many SINE families have been identified in the animal kingdom, only a few SINE families have been identified in plants, and their distribution is somewhat limited. The Au SINE (Au) has been found discontinuously in basal angiosperms, monocots, and eudicots. In this study, we examined the presence of the Au in gymnosperms and ferns by PCR using internal primers for Au. As a result, we found Au in a gymnosperm species, Ephedra ciliata. Therefore, Au was supposed to be present in the common ancestor of angiosperms and gymnosperms. The Au in E. ciliate was 15 bp shorter than the consensus sequence, which is similar to the Au SINE found in Glycine. However, the 3'end of the Au found in E. ciliate was more similar to the 3'end of the Medicago-type Au than that of the Glycine-type Au. A phylogenetic tree indicated that the Au sequence from E. ciliate is more closely related to the sequence found in Glycine than that found in Medicago/Lotus. These results indicated that Au were present in both angiosperms and gymnosperms.
Collapse
Affiliation(s)
- Eiki Yagi
- Laboratory of Crop Evolution, Graduate School of Agriculture, Kyoto University, Japan.
| | | | | |
Collapse
|
9
|
Alix K, Joets J, Ryder CD, Moore J, Barker GC, Bailey JP, King GJ, Pat Heslop-Harrison JS. The CACTA transposon Bot1 played a major role in Brassica genome divergence and gene proliferation. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2008; 56:1030-44. [PMID: 18764926 DOI: 10.1111/j.1365-313x.2008.03660.x] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
We isolated and characterized a Brassica C genome-specific CACTA element, which was designated Bot1 (Brassica oleracea transposon 1). After analysing phylogenetic relationships, copy numbers and sequence similarity of Bot1 and Bot1 analogues in B. oleracea (C genome) versus Brassica rapa (A genome), we concluded that Bot1 has encountered several rounds of amplification in the oleracea genome only, and has played a major role in the recent rapa and oleracea genome divergence. We performed in silico analyses of the genomic organization and internal structure of Bot1, and established which segment of Bot1 is C-genome specific. Our work reports a fully characterized Brassica repetitive sequence that can distinguish the Brassica A and C chromosomes in the allotetraploid Brassica napus, by fluorescent in situ hybridization. We demonstrated that Bot1 carries a host S locus-associated SLL3 gene copy. We speculate that Bot1 was involved in the proliferation of SLL3 around the Brassica genome. The present study reinforces the assumption that transposons are a major driver of genome and gene evolution in higher plants.
Collapse
Affiliation(s)
- Karine Alix
- UMR de Génétique Végétale INRA/Univ Paris-Sud/CNRS/AgroParisTech, Ferme du Moulon, F-91190 Gif-sur-Yvette, France.
| | | | | | | | | | | | | | | |
Collapse
|
10
|
Tsuchimoto S, Hirao Y, Ohtsubo E, Ohtsubo H. New SINE families from rice, OsSN, with poly(A) at the 3' ends. Genes Genet Syst 2008; 83:227-36. [PMID: 18670134 DOI: 10.1266/ggs.83.227] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A database search of the sequences flanking a member of rice retrotransposon RIRE7 revealed that a 298-bp sequence in the region downstream of the member is a repetitive sequence interspersed in the genome of Oryza sativa cv. Nipponbare. Most of the repetitive sequences were flanked by a direct repeat of a target-site sequence, about 14 bp in length. The consensus sequence, 293 bp in length, had no regions encoding any proteins but had sequence motifs of an internal promoter of RNA polymerase III. These indicate that the sequence is a retroposon SINE, designated OsSN1 (Oryza sativa SINE1). OsSN1 is a new rice SINE, because it has no homology with any of the three p-SINE families previously identified from rice, and because it has a stretch of A at the 3' end, unlike p-SINE and any other Gramineae SINEs which have a stretch of T at the 3' end. The Nipponbare genome was found to have many members related to OsSN1, forming two additional new SINE families (designated OsSN2 and OsSN3). OsSN2 and OsSN3 are highly homologous to the 3' and 5' regions of OsSN1, respectively. This suggests that OsSN1 has a mosaic structure, which is generated by sequence exchange (or shuffling) between ancestral OsSN2 and OsSN3. Despite the absence of homology in the 3' regions between OsSN1 (or OsSN2) and OsSN3, a sequence, 5'-TTCTC-3', is commonly present in the region preceding the A stretch at the 3' end. This sequence together with the A stretch and a stem-loop structure found in the region near the A stretch are assumed to be important for retroposition. OsSN members were present in strains of Oryza species, as were p-SINE members. Some of the members showed insertion polymorphism at the respective loci among the rice strains. p-SINE had such polymorphic members, which are useful for classification and phylogenetic analysis of various strains of Oryza species. The polymorphic members of OsSN were more frequently found than those of p-SINE, and therefore, such members are likely to be useful for extensive taxonomic and phylogenetic studies on various rice strains.
Collapse
Affiliation(s)
- Suguru Tsuchimoto
- Institute of Molecular and Cellular Biosciences, the University of Tokyo, Tokyo, Japan.
| | | | | | | |
Collapse
|
11
|
Deragon JM, Zhang X. Short interspersed elements (SINEs) in plants: origin, classification, and use as phylogenetic markers. Syst Biol 2007; 55:949-56. [PMID: 17345676 DOI: 10.1080/10635150601047843] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
Short interspersed elements (SINEs) are a class of dispersed mobile sequences that use RNA as an intermediate in an amplification process called retroposition. The presence-absence of a SINE at a given locus has been used as a meaningful classification criterion to evaluate phylogenetic relations among species. We review here recent developments in the characterisation of plant SINEs and their use as molecular makers to retrace phylogenetic relations among wild and cultivated Oryza and Brassica species. In Brassicaceae, further use of SINE markers is limited by our partial knowledge of endogenous SINE families (their origin and evolution histories) and by the absence of a clear classification. To solve this problem, phylogenetic relations among all known Brassicaceae SINEs were analyzed and a new classification, grouping SINEs in 15 different families, is proposed. The relative age and size of each Brassicaceae SINE family was evaluated and new phylogenetically supported subfamilies were described. We also present evidence suggesting that new potentially active SINEs recently emerged in Brassica oleracea from the shuffling of preexisting SINE portions. Finally, the comparative evolution history of SINE families present in Arabidopsis thaliana and Brassica oleracea revealed that SINEs were in general more active in the Brassica lineage. The importance of these new data for the use of Brassicaceae SINEs as molecular markers in future applications is discussed.
Collapse
Affiliation(s)
- Jean-Marc Deragon
- CNRS UMR6547, GDR2157 Biomove, Université Blaise Pascal, 24 Avenue des Landais, 63177, Aubière, France.
| | | |
Collapse
|