1
|
Gao D. Introduction of Plant Transposon Annotation for Beginners. BIOLOGY 2023; 12:1468. [PMID: 38132293 PMCID: PMC10741241 DOI: 10.3390/biology12121468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023]
Abstract
Transposons are mobile DNA sequences that contribute large fractions of many plant genomes. They provide exclusive resources for tracking gene and genome evolution and for developing molecular tools for basic and applied research. Despite extensive efforts, it is still challenging to accurately annotate transposons, especially for beginners, as transposon prediction requires necessary expertise in both transposon biology and bioinformatics. Moreover, the complexity of plant genomes and the dynamic evolution of transposons also bring difficulties for genome-wide transposon discovery. This review summarizes the three major strategies for transposon detection including repeat-based, structure-based, and homology-based annotation, and introduces the transposon superfamilies identified in plants thus far, and some related bioinformatics resources for detecting plant transposons. Furthermore, it describes transposon classification and explains why the terms 'autonomous' and 'non-autonomous' cannot be used to classify the superfamilies of transposons. Lastly, this review also discusses how to identify misannotated transposons and improve the quality of the transposon database. This review provides helpful information about plant transposons and a beginner's guide on annotating these repetitive sequences.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID 83210, USA
| |
Collapse
|
2
|
Firsov SY, Kosherova KA, Mukha DV. Identification and functional characterization of the German cockroach, Blattella germanica, short interspersed nuclear elements. PLoS One 2022; 17:e0266699. [PMID: 35696390 PMCID: PMC9191728 DOI: 10.1371/journal.pone.0266699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 03/24/2022] [Indexed: 11/18/2022] Open
Abstract
In recent decades, experimental data has accumulated indicating that short interspersed nuclear elements (SINEs) can play a significant functional role in the regulation of gene expression in the host genome. In addition, molecular markers based on SINE insertion polymorphisms have been developed and are widely used for genetic differentiation of populations of eukaryotic organisms. Using routine bioinformatics analysis and publicly available genomic DNA and small RNA-seq data, we first described nine SINEs in the genome of the German cockroach, Blattella germanica. All described SINEs have tRNA promoters, and the start of their transcription begins 11 bp upstream of an "A" box of these promoters. The number of copies of the described SINEs in the B. germanica genome ranges from several copies to more than a thousand copies in a SINE-specific manner. Some of the described SINEs and their degenerate copies can be localized both in the introns of genes and loci known as piRNA clusters. piRNAs originating from piRNA clusters are shown to be mapped to seven of the nine types of SINEs described, including copies of SINEs localized in gene introns. We speculate that SINEs, localized in the introns of certain genes, may regulate the level of expression of these genes by a PIWI-related molecular mechanism.
Collapse
Affiliation(s)
- Sergei Yu. Firsov
- Vavilov Institute of General Genetics Russian Academy of Sciences, Moscow, Russia
| | - Karina A. Kosherova
- Vavilov Institute of General Genetics Russian Academy of Sciences, Moscow, Russia
| | - Dmitry V. Mukha
- Vavilov Institute of General Genetics Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
3
|
Li Y, Jiang N, Sun Y. AnnoSINE: a short interspersed nuclear elements annotation tool for plant genomes. PLANT PHYSIOLOGY 2022; 188:955-970. [PMID: 34792587 PMCID: PMC8825457 DOI: 10.1093/plphys/kiab524] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/01/2021] [Indexed: 06/13/2023]
Abstract
Short interspersed nuclear elements (SINEs) are a widespread type of small transposable element (TE). With increasing evidence for their impact on gene function and genome evolution in plants, accurate genome-scale SINE annotation becomes a fundamental step for studying the regulatory roles of SINEs and their relationship with other components in the genomes. Despite the overall promising progress made in TE annotation, SINE annotation remains a major challenge. Unlike some other TEs, SINEs are short and heterogeneous, and they usually lack well-conserved sequence or structural features. Thus, current SINE annotation tools have either low sensitivity or high false discovery rates. Given the demand and challenges, we aimed to provide a more accurate and efficient SINE annotation tool for plant genomes. The pipeline starts with maximizing the pool of SINE candidates via profile hidden Markov model-based homology search and de novo SINE search using structural features. Then, it excludes the false positives by integrating all known features of SINEs and the features of other types of TEs that can often be misannotated as SINEs. As a result, the pipeline substantially improves the tradeoff between sensitivity and accuracy, with both values close to or over 90%. We tested our tool in Arabidopsis thaliana and rice (Oryza sativa), and the results show that our tool competes favorably against existing SINE annotation tools. The simplicity and effectiveness of this tool would potentially be useful for generating more accurate SINE annotations for other plant species. The pipeline is freely available at https://github.com/yangli557/AnnoSINE.
Collapse
Affiliation(s)
- Yang Li
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| |
Collapse
|
4
|
Chalopin D, Clark LG, Wysocki WP, Park M, Duvall MR, Bennetzen JL. Integrated Genomic Analyses From Low-Depth Sequencing Help Resolve Phylogenetic Incongruence in the Bamboos (Poaceae: Bambusoideae). FRONTIERS IN PLANT SCIENCE 2021; 12:725728. [PMID: 34567039 PMCID: PMC8456298 DOI: 10.3389/fpls.2021.725728] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 08/13/2021] [Indexed: 06/13/2023]
Abstract
The bamboos (Bambusoideae, Poaceae) comprise a major grass lineage with a complex evolutionary history involving ancient hybridization and allopolyploidy. About 1700 described species are classified into three tribes, Olyreae (herbaceous bamboos), Bambuseae (tropical woody bamboos), and Arundinarieae (temperate woody bamboos). Nuclear analyses strongly support monophyly of the woody tribes, whereas plastome analyses strongly support paraphyly, with Bambuseae sister to Olyreae. Our objectives were to clarify the origin(s) of the woody bamboo tribes and resolve the nuclear vs. plastid conflict using genomic tools. For the first time, plastid and nuclear genomic information from the same bamboo species were combined in a single study. We sampled 51 species of bamboos representing the three tribes, estimated their genome sizes and generated low-depth sample sequence data, from which plastomes were assembled and nuclear repeats were analyzed. The distribution of repeat families was found to agree with nuclear gene phylogenies, but also provides novel insights into nuclear evolutionary history. We infer two early, independent hybridization events, one between an Olyreae ancestor and a woody ancestor giving rise to the two Bambuseae lineages, and another between two woody ancestors giving rise to the Arundinarieae. Retention of the Olyreae plastome associated with differential dominance of nuclear genomes and subsequent diploidization in some lineages explains the paraphyly observed in plastome phylogenetic estimations. We confirm ancient hybridization and allopolyploidy in the origins of the extant woody bamboo lineages and propose biased fractionation and diploidization as important factors in their evolution.
Collapse
Affiliation(s)
- Domitille Chalopin
- Department of Genetics, University of Georgia, Athens, GA, United States
| | - Lynn G. Clark
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, United States
| | - William P. Wysocki
- Center for Translational Data Science, University of Chicago, Chicago, IL, United States
| | - Minkyu Park
- Department of Genetics, University of Georgia, Athens, GA, United States
| | - Melvin R. Duvall
- Department of Biology and Institute for the Study of the Environment, Sustainability, and Energy, Northern Illinois University, DeKalb, IL, United States
| | | |
Collapse
|
5
|
Besse P. Guidelines for the Choice of Sequences for Molecular Plant Taxonomy. Methods Mol Biol 2021; 2222:39-55. [PMID: 33301086 DOI: 10.1007/978-1-0716-0997-2_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
This chapter presents an overview of the major plant DNA sequences and molecular methods available for plant taxonomy. Guidelines are provided for the choice of sequences and methods to be used, based on the DNA compartment (nuclear, chloroplastic, mitochondrial), evolutionary mechanisms, and the level of taxonomic differentiation of the plants under survey.
Collapse
Affiliation(s)
- Pascale Besse
- UMR PVBMT, Universite de la Reunion, St Pierre, Réunion, France.
| |
Collapse
|
6
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
7
|
Seibt KM, Schmidt T, Heitkam T. The conserved 3' Angio-domain defines a superfamily of short interspersed nuclear elements (SINEs) in higher plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:681-699. [PMID: 31610059 DOI: 10.1111/tpj.14567] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 09/13/2019] [Accepted: 09/17/2019] [Indexed: 06/10/2023]
Abstract
Repetitive sequences are ubiquitous components of eukaryotic genomes affecting genome size and evolution as well as gene regulation. Among them, short interspersed nuclear elements (SINEs) are non-coding retrotransposons usually shorter than 1000 bp. They contain only few short conserved structural motifs, in particular an internal promoter derived from cellular RNAs and a mostly AT-rich 3' tail, whereas the remaining regions are highly variable. SINEs emerge and vanish during evolution, and often diversify into numerous families and subfamilies that are usually specific for only a limited number of species. In contrast, at the 3' end of multiple plant SINEs we detected the highly conserved 'Angio-domain'. This 37 bp segment defines the Angio-SINE superfamily, which encompasses 24 plant SINE families widely distributed across 13 orders within the plant kingdom. We retrieved 28 433 full-length Angio-SINE copies from genome assemblies of 46 plant species, frequently located in genes. Compensatory mutations in and adjacent to the Angio-domain imply selective restraints maintaining its RNA structure. Angio-SINE families share segmental sequence similarities, indicating a modular evolution with strong Angio-domain preservation. We suggest that the conserved domain contributes to the evolutionary success of Angio-SINEs through either structural interactions between SINE RNA and proteins increasing their transpositional efficiency, or by enhancing their accumulation in genes.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Thomas Schmidt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| |
Collapse
|
8
|
Nishiyama E, Ohshima K. Cross-Kingdom Commonality of a Novel Insertion Signature of RTE-Related Short Retroposons. Genome Biol Evol 2018; 10:1471-1483. [PMID: 29850801 PMCID: PMC6007223 DOI: 10.1093/gbe/evy098] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2018] [Indexed: 12/15/2022] Open
Abstract
In multicellular organisms, such as vertebrates and flowering plants, horizontal transfer (HT) of genetic information is thought to be a rare event. However, recent findings unveiled unexpectedly frequent HT of RTE-clade LINEs. To elucidate the molecular footprints of the genomic integration machinery of RTE-related retroposons, the sequence patterns surrounding the insertion sites of plant Au-like SINE families were analyzed in the genomes of a wide variety of flowering plants. A novel and remarkable finding regarding target site duplications (TSDs) for SINEs was they start with thymine approximately one helical pitch (ten nucleotides) downstream of a thymine stretch. This TSD pattern was found in RTE-clade LINEs, which share the 3'-end sequence of these SINEs, in the genome of leguminous plants. These results demonstrably show that Au-like SINEs were mobilized by the enzymatic machinery of RTE-clade LINEs. Further, we discovered the same TSD pattern in animal SINEs from lizard and mammals, in which the RTE-clade LINEs sharing the 3'-end sequence with these animal SINEs showed a distinct TSD pattern. Moreover, a significant correlation was observed between the first nucleotide of TSDs and microsatellite-like sequences found at the 3'-ends of SINEs and LINEs. We propose that RTE-encoded protein could preferentially bind to a DNA region that contains a thymine stretch to cleave a phosphodiester bond downstream of the stretch. Further, determination of cleavage sites and/or efficiency of primer sites for reverse transcription may depend on microsatellite-like repeats in the RNA template. Such a unique mechanism may have enabled retroposons to successfully expand in frontier genomes after HT.
Collapse
Affiliation(s)
- Eri Nishiyama
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Shiga, Japan
| | - Kazuhiko Ohshima
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Shiga, Japan
| |
Collapse
|
9
|
Mao H, Wang H. Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs). Genome Biol Evol 2018; 9:2048-2056. [PMID: 28903462 PMCID: PMC5585668 DOI: 10.1093/gbe/evx145] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2017] [Indexed: 02/06/2023] Open
Abstract
Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3' SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family.
Collapse
Affiliation(s)
- Hongliang Mao
- Department of Physics, T-Life Research Center, Fudan University, Shanghai, P.R. China
| | - Hao Wang
- Department of Physics, T-Life Research Center, Fudan University, Shanghai, P.R. China.,Department of Genetics, University of Georgia
| |
Collapse
|
10
|
Keidar D, Doron C, Kashkush K. Genome-wide analysis of a recently active retrotransposon, Au SINE, in wheat: content, distribution within subgenomes and chromosomes, and gene associations. PLANT CELL REPORTS 2018; 37:193-208. [PMID: 29164313 PMCID: PMC5787218 DOI: 10.1007/s00299-017-2213-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Accepted: 10/05/2017] [Indexed: 05/02/2023]
Abstract
Here, we show that Au SINE elements have strong associations with protein-coding genes in wheat. Most importantly Au SINE insertion within introns causes allelic variation and might induce intron retention. The impact of transposable elements (TEs) on genome structure and function is intensively studied in eukaryotes, especially in plants where TEs can reach up to 90% of the genome in some cases, such as in wheat. Here, we have performed a genome-wide in-silico analysis using the updated publicly available genome draft of bread wheat (T. aestivum), in addition to the updated genome drafts of the diploid donor species, T. urartu and Ae. tauschii, to retrieve and analyze a non-LTR retrotransposon family, termed Au SINE, which was found to be widespread in plant species. Then, we have performed site-specific PCR and realtime RT-PCR analyses to assess the possible impact of Au SINE on gene structure and function. To this end, we retrieved 133, 180 and 1886 intact Au SINE insertions from T. urartu, Ae. tauschii and T. aestivum genome drafts, respectively. The 1886 Au SINE insertions were distributed in the seven homoeologous chromosomes of T. aestivum, while ~ 67% of the insertions were associated with genes. Detailed analysis of 40 genes harboring Au SINE revealed allelic variation of those genes in the Triticum-Aegilops genus. In addition, expression analysis revealed that both regular transcripts and alternative Au SINE-containing transcripts were simultaneously amplified in the same tissue, indicating retention of Au SINE-containing introns. Analysis of the wheat transcriptome revealed that hundreds of protein-coding genes harbor Au SINE in at least one of their mature splice variants. Au SINE might play a prominent role in speciation by creating transcriptome variation.
Collapse
Affiliation(s)
- Danielle Keidar
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, 84105, Israel
| | - Chen Doron
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, 84105, Israel
| | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, 84105, Israel.
| |
Collapse
|
11
|
Kögler A, Schmidt T, Wenke T. Evolutionary modes of emergence of short interspersed nuclear element (SINE) families in grasses. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 92:676-695. [PMID: 28857316 DOI: 10.1111/tpj.13676] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2017] [Revised: 08/18/2017] [Accepted: 08/22/2017] [Indexed: 06/07/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous transposable elements which are propagated by retrotransposition and constitute an inherent part of the genome of most eukaryotic species. Knowledge of heterogeneous and highly abundant SINEs is crucial for de novo (or improvement of) annotation of whole genome sequences. We scanned Poaceae genome sequences of six important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Panicum virgatum, Sorghum bicolor, Zea mays) and Brachypodium distachyon to examine the diversity and evolution of SINE populations. We comparatively analyzed the structural features, distribution, evolutionary relation and abundance of 32 SINE families and subfamilies within grasses, comprising 11 052 individual copies. The investigation of activity profiles within the Poaceae provides insights into their species-specific diversification and amplification. We found that Poaceae SINEs (PoaS) fall into two length categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Detailed analysis at the nucleotide level revealed that multimerization of related and unrelated SINE copies is an important evolutionary mechanism of SINE formation. We conclude that PoaS families diversify by massive reshuffling between SINE families, likely caused by insertion of truncated copies, and provide a model for this evolutionary scenario. Twenty-eight of 32 PoaS families and subfamilies show significant conservation, in particular either in the 5' or 3' regions, across Poaceae species and share large sequence stretches with one or more other PoaS families.
Collapse
Affiliation(s)
- Anja Kögler
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, Dresden, 01069, Germany
| |
Collapse
|
12
|
Luchetti A, Plazzi F, Mantovani B. Evolution of Two Short Interspersed Elements in Callorhinchus milii (Chondrichthyes, Holocephali) and Related Elements in Sharks and the Coelacanth. Genome Biol Evol 2017; 9:3824762. [PMID: 28505260 PMCID: PMC5499810 DOI: 10.1093/gbe/evx094] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/11/2017] [Indexed: 12/11/2022] Open
Abstract
Short interspersed elements (SINEs) are non-autonomous retrotransposons. Although they usually show fast evolutionary rates, in some instances highly conserved domains (HCDs) have been observed in elements with otherwise divergent sequences and from distantly related species. Here, we document the life history of two HCD-SINE families in the elephant shark Callorhinchus milii, one specific to the holocephalan lineage (CmiSINEs) and another one (SacSINE1-CM) with homologous elements in sharks and the coelacanth (SacSINE1s, LmeSINE1s). The analyses of their relationships indicated that these elements share the same 3′-tail, which would have allowed both elements to rise to high copy number by exploiting the C. milii L2-2_CM long interspersed element (LINE) enzymes. Molecular clock analysis on SINE activity in C. milii genome evidenced two replication bursts occurring right after two major events in the holocephalan evolution: the end-Permian mass extinction and the radiation of modern Holocephali. Accordingly, the same analysis on the coelacanth homologous elements, LmeSINE1, identified a replication wave close to the split age of the two extant Latimeria species. The genomic distribution of the studied SINEs pointed out contrasting results: some elements were preferentially sorted out from gene regions, but accumulated in flanking regions, while others appear more conserved within genes. Moreover, data from the C. milii transcriptome suggest that these SINEs could be involved in miRNA biogenesis and may be targets for miRNA-based regulation.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| | - Federico Plazzi
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| |
Collapse
|
13
|
Identification and characterisation of Short Interspersed Nuclear Elements in the olive tree (Olea europaea L.) genome. Mol Genet Genomics 2016; 292:53-61. [PMID: 27714457 DOI: 10.1007/s00438-016-1255-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2016] [Accepted: 10/03/2016] [Indexed: 10/20/2022]
Abstract
Short Interspersed Nuclear Elements (SINEs) are nonautonomous retrotransposons in the genome of most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, SINE identification has been carried out only in a limited number of plant species. This lack of information is apparent especially in non-model plants whose genome has not been sequenced yet. The aim of this work was to produce a specific bioinformatics pipeline for analysing second generation sequence reads of a non-model species and identifying SINEs. We have identified, for the first time, 227 putative SINEs of the olive tree (Olea europaea), that constitute one of the few sets of such sequences in dicotyledonous species. The identified SINEs ranged from 140 to 362 bp in length and were characterised with regard to the occurrence of the tRNA domain in their sequence. The majority of identified elements resulted in single copy or very lowly repeated, often in association with genic sequences. Analysis of sequence similarity allowed us to identify two major groups of SINEs showing different abundances in the olive tree genome, the former with sequence similarity to SINEs of Scrophulariaceae and Solanaceae and the latter to SINEs of Salicaceae. A comparison of sequence conservation between olive SINEs and LTR retrotransposon families suggested that SINE expansion in the genome occurred especially in very ancient times, before LTR retrotransposon expansion, and presumably before the separation of the rosids (to which Oleaceae belong) from the Asterids. Besides providing data on olive SINEs, our results demonstrate the suitability of the pipeline employed for SINE identification. Applying this pipeline will favour further structural and functional analyses on these relatively unknown elements to be performed also in other plant species, even in the absence of a reference genome, and will allow establishing general evolutionary patterns for this kind of repeats in plants.
Collapse
|
14
|
Seibt KM, Wenke T, Muders K, Truberg B, Schmidt T. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 86:268-285. [PMID: 26996788 DOI: 10.1111/tpj.13170] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 03/11/2016] [Accepted: 03/14/2016] [Indexed: 06/05/2023]
Abstract
Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | | | | | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
15
|
Susek K, Bielski WK, Hasterok R, Naganowska B, Wolko B. A First Glimpse of Wild Lupin Karyotype Variation As Revealed by Comparative Cytogenetic Mapping. FRONTIERS IN PLANT SCIENCE 2016; 7:1152. [PMID: 27516770 PMCID: PMC4964750 DOI: 10.3389/fpls.2016.01152] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Accepted: 07/11/2016] [Indexed: 05/19/2023]
Abstract
Insight into plant genomes at the cytomolecular level provides useful information about their karyotype structure, enabling inferences about taxonomic relationships and evolutionary origins. The Old World lupins (OWL) demonstrate a high level of genomic diversification involving variation in chromosome numbers (2n = 32-52), basic chromosome numbers (x = 5-7, 9, 13) and in nuclear genome size (2C DNA = 0.97-2.68 pg). Lupins comprise both crop and wild species and provide an intriguing system to study karyotype evolution. In order to investigate lupin chromosome structure, heterologous FISH was used. Sixteen BACs that had been generated as chromosome markers for the reference species, Lupinus angustifolius, were used to identify chromosomes in the wild species and explore karyotype variation. While all "single-locus" in L. angustifolius, in the wild lupins these clones proved to be "single-locus," "single-locus" with additional signals, "repetitive" or had no detectable BAC-FISH signal. The diverse distribution of the clones in the targeted genomes suggests a complex evolution history, which possibly involved multiple chromosomal changes such as fusions/fissions and repetitive sequence amplification. Twelve BACs were sequenced and we found numerous transposable elements including DNA transposons as well as LTR and non-LTR retrotransposons with varying quantity and composition among the different lupin species. However, at this preliminary stage, no correlation was observed between the pattern of BAC-FISH signals and the repeat content in particular BACs. Here, we describe the first BAC-based chromosome-specific markers for the wild species: L. cosentinii, L. cryptanthus, L. pilosus, L. micranthus and one New World lupin, L. multiflorus. These BACs could constitute the basis for an assignment of the chromosomal and genetic maps of other lupins, e.g., L. albus and L. luteus. Moreover, we identified karyotype variation that helps illustrate the relationships between the lupins and the extensive cytological diversity within this group. In this study we premise that lupin genomes underwent at least two rounds of fusion and fission events resulting in the reduction in chromosome number from 2n = 52 through 2n = 40 to 2n = 32, followed by chromosome number increment to 2n = 42.
Collapse
Affiliation(s)
- Karolina Susek
- Department of Genomics, Institute of Plant Genetics, Polish Academy of SciencesPoznan, Poland
- *Correspondence: Karolina Susek
| | - Wojciech K. Bielski
- Department of Genomics, Institute of Plant Genetics, Polish Academy of SciencesPoznan, Poland
| | - Robert Hasterok
- Department of Plant Anatomy and Cytology, University of Silesia in KatowiceKatowice, Poland
| | - Barbara Naganowska
- Department of Genomics, Institute of Plant Genetics, Polish Academy of SciencesPoznan, Poland
| | - Bogdan Wolko
- Department of Genomics, Institute of Plant Genetics, Polish Academy of SciencesPoznan, Poland
| |
Collapse
|
16
|
Schwichtenberg K, Wenke T, Zakrzewski F, Seibt KM, Minoche A, Dohm JC, Weisshaar B, Himmelbauer H, Schmidt T. Diversification, evolution and methylation of short interspersed nuclear element families in sugar beet and related Amaranthaceae species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 85:229-44. [PMID: 26676716 DOI: 10.1111/tpj.13103] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2015] [Revised: 11/23/2015] [Accepted: 11/26/2015] [Indexed: 05/18/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-long terminal repeat retrotransposons which are widely distributed in eukaryotic organisms. While SINEs have been intensively studied in animals, only limited information is available about plant SINEs. We analysed 22 SINE families from seven genomes of the Amaranthaceae family and identified 34 806 SINEs, including 19 549 full-length copies. With the focus on sugar beet (Beta vulgaris), we performed a comparative analysis of the diversity, genomic and chromosomal organization and the methylation of SINEs to provide a detailed insight into the evolution and age of Amaranthaceae SINEs. The lengths of consensus sequences of SINEs range from 113 nucleotides (nt) up to 224 nt. The SINEs show dispersed distribution on all chromosomes but were found with higher incidence in subterminal euchromatic chromosome regions. The methylation of SINEs is increased compared with their flanking regions, and the strongest effect is visible for cytosines in the CHH context, indicating an involvement of asymmetric methylation in the silencing of SINEs.
Collapse
Affiliation(s)
| | - Torsten Wenke
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Falk Zakrzewski
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - André Minoche
- Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany
- Garvan Institute of Medical Research, 2010, Sydney, NSW, Australia
| | - Juliane C Dohm
- Max Planck Institute for Molecular Genetics, 14195, Berlin, Germany
- Department of Biotechnology, University of Natural Resources and Life Sciences (BOKU), 1190, Vienna, Austria
| | - Bernd Weisshaar
- CeBiTec & Department of Biology, University of Bielefeld, 33615, Bielefeld, Germany
| | - Heinz Himmelbauer
- Garvan Institute of Medical Research, 2010, Sydney, NSW, Australia
- Department of Biotechnology, University of Natural Resources and Life Sciences (BOKU), 1190, Vienna, Austria
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| |
Collapse
|
17
|
Wenke T, Seibt KM, Döbel T, Muders K, Schmidt T. Inter-SINE Amplified Polymorphism (ISAP) for rapid and robust plant genotyping. Methods Mol Biol 2015; 1245:183-192. [PMID: 25373758 DOI: 10.1007/978-1-4939-1966-6_14] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The unambiguous differentiation of crop genotypes is often laborious or expensive. A rapid, robust, and cost-efficient marker system is required for routine genotyping in plant breeding and marker-assisted selection. We describe the Inter-SINE Amplified Polymorphism (ISAP) system that is based on standard molecular methods resulting in genotype-specific fingerprints at high resolution. These markers are derived from Short Interspersed Nuclear Elements (SINEs) which are dispersed repetitive sequences present in most if not all plant genomes and can be efficiently extracted from plant genome sequences. The ISAP method was developed on potato as model plant but is also transferable to other plant species.
Collapse
Affiliation(s)
- Torsten Wenke
- Institute of Botany, Technische Universität Dresden, Mommsenstraße 11, Dresden, 01062, Germany
| | | | | | | | | |
Collapse
|
18
|
Genetics of cryptic speciation within an Arctic mustard, Draba nivalis. PLoS One 2014; 9:e93834. [PMID: 24691072 PMCID: PMC3972243 DOI: 10.1371/journal.pone.0093834] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2013] [Accepted: 03/09/2014] [Indexed: 01/13/2023] Open
Abstract
Crossing experiments indicate that hybrid sterility barriers frequently have developed within diploid, circumpolar plant species of the genus Draba. To gain insight into the rapid evolution of postzygotic reproductive isolation in this system, we augmented the linkage map of one of these species, D. nivalis, and searched for quantitative trait loci (QTLs) associated with reproductive isolation. The map adds 63 new dominant markers to a previously published dataset of 31 co-dominant microsatellites. These markers include 52 amplified fragment length polymorphisms (AFLPs) and 11 sequence-specific amplified polymorphisms (SSAPs) based on retrotransposon sequence. 22 markers displaying transmission ratio distortion were further included in the map. We resolved eight linkage groups with a total map length of 894 cM. Significant genotype-trait associations, or quantitative trait loci (QTL), were detected for reproductive phenotypes including pollen fertility (4 QTLs), seed set (3 QTLs), flowering time (3 QTLs) and number of flowers (4 QTLs). Observed patterns of inheritance were consistent with the influence of both nuclear-nuclear interactions and chromosomal changes on these traits. All seed set QTLs and one pollen fertility QTL displayed underdominant effects suggestive of the involvement of chromosomal rearrangements in hybrid sterility. Interestingly, D. nivalis is predominantly self-fertilizing, which may facilitate the establishment of underdominant loci and contribute to reproductive isolation.
Collapse
|
19
|
Abstract
This chapter presents an overview of the major plant DNA sequences and molecular methods available for plant taxonomy. Guidelines are provided for the choice of sequences and methods to be used, based on the DNA compartment (nuclear, chloroplastic, mitochondrial), evolutionary mechanisms, and the level of taxonomic differentiation of the plants under survey.
Collapse
|
20
|
Ben-David S, Yaakov B, Kashkush K. Genome-wide analysis of short interspersed nuclear elements SINES revealed high sequence conservation, gene association and retrotranspositional activity in wheat. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 76:201-10. [PMID: 23855320 PMCID: PMC4223381 DOI: 10.1111/tpj.12285] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Revised: 06/04/2013] [Accepted: 07/03/2013] [Indexed: 05/02/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation.
Collapse
|
21
|
Schulman AH. Retrotransposon replication in plants. Curr Opin Virol 2013; 3:604-14. [PMID: 24035277 DOI: 10.1016/j.coviro.2013.08.009] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Revised: 08/16/2013] [Accepted: 08/19/2013] [Indexed: 12/31/2022]
Abstract
Retrotransposons comprise the bulk of large plant genomes, replicating via an RNA intermediate whereby the original, integrated element remains in place. Of the two main orders, the LTR retrotransposons considerably outnumber the LINEs. LINEs integrate into target sites simultaneously with the RNA transcript being copied into cDNA by target-primed reverse transcription. LTR retrotransposon replication is basically equivalent to the intracellular phase of retroviral life cycles. The envelope gene giving extracellular mobility to retroviruses is in fact widespread in plants and their retrotransposons. Evolutionary analyses of the retrotransposons and retroviruses suggest that both form an ancient monophyletic group. The particular adaptations of LTR retrotransposons to plant life cycles enabling their success remain to be clarified.
Collapse
Affiliation(s)
- Alan H Schulman
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, P.O. Box 65, Helsinki FIN-00014, Finland; Biotechnology and Food Research, MTT Agrifood Research Finland, Jokioinen FIN-31600, Finland.
| |
Collapse
|
22
|
Luchetti A, Mantovani B. Conserved domains and SINE diversity during animal evolution. Genomics 2013; 102:296-300. [PMID: 23981965 DOI: 10.1016/j.ygeno.2013.08.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Revised: 06/25/2013] [Accepted: 08/14/2013] [Indexed: 11/28/2022]
Abstract
Eukaryotic genomes harbour a number of mobile genetic elements (MGEs); moving from one genomic location to another, they are known to impact on the host genome. Short interspersed elements (SINEs) are well-represented, non-autonomous retroelements and they are likely the most diversified MGEs. In some instances, sequence domains conserved across unrelated SINEs have been identified; remarkably, one of these, called Nin, has been conserved since the Radiata-Bilateria splitting. Here we report on two new domains: Inv, derived from Nin, identified in insects and in deuterostomes, and Pln, restricted to polyneopteran insects. The identification of Inv and Pln sequences allowed us to retrieve new SINEs, two in insects and one in a hemichordate. The diverse structural combination of the different domains in different SINE families, during metazoan evolution, offers a clearer view of SINE diversity and their frequent de novo emergence through module exchange, possibly underlying the high evolutionary success of SINEs.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dip. Scienze Biologiche, Geologiche e Ambientali (BiGeA) - Università di Bologna, via Selmi 3, 40126 Bologna, Italy.
| | | |
Collapse
|
23
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
24
|
Abstract
SINEBase (http://sines.eimb.ru) integrates the revisited body of knowledge about short interspersed elements (SINEs). A set of formal definitions concerning SINEs was introduced. All available sequence data were screened through these definitions and the genetic elements misidentified as SINEs were discarded. As a result, 175 SINE families have been recognized in animals, flowering plants and green algae. These families were classified by the modular structure of their nucleotide sequences and the frequencies of different patterns were evaluated. These data formed the basis for the database of SINEs. The SINEBase website can be used in two ways: first, to explore the database of SINE families, and second, to analyse candidate SINE sequences using specifically developed tools. This article presents an overview of the database and the process of SINE identification and analysis.
Collapse
Affiliation(s)
- Nikita S Vassetzky
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Moscow 119991, Russia
| | | |
Collapse
|
25
|
Shu Y, Li Y, Bai X, Cai H, Ji W, Ji Z, Guo C, Zhu Y. Identification and characterization of a new member of the SINE Au retroposon family (GmAu1) in the soybean, Glycine max (L.) Merr., genome and its potential application. PLANT CELL REPORTS 2011; 30:2207-13. [PMID: 21796384 DOI: 10.1007/s00299-011-1126-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2011] [Revised: 07/06/2011] [Accepted: 07/12/2011] [Indexed: 05/31/2023]
Abstract
A plant short interspersed element (SINE) was identified in Glycine max after re-sequencing of the soybean sequence characterized amplified region (SCAR) markers. Detailed analysis revealed that this newly recognized SINE element consisted of a tRNA-related region, a tRNA non-related region, direct flanking repeat sequences, and a short stretch of Ts at the 3'-terminal region. These features are similar to previously characterized SINEs. To investigate the evolution of the SINE retroposon, BLASTN was used to search against genome sequences of other plants. Since it is homologous with the retroposon Au in Aegilops umbellulata (wheat) and its homology in soybean, the SINE is named as GmAu1. Genome analysis of the Glycine max var. Willimas 82 uncovered more than 847 copies of GmAu1 per haploid genome of soybean. Examination of the regions flanking the inserted GmAu1 sequences indicated a preference for introns over exons or other noncoding regions. Considering the flanking insertion sequences, 146 primers were designed in order to detect insertion mutations by a PCR-based method. Seventy-seven primers displayed polymorphism and were used to develop corresponding GmAu1-based SCAR markers. The retroposon GmAu1 and its related SCAR markers identified in this study will prove valuable to future investigations into the genetic mapping, phylogeny, and evolution of the Glycine genus.
Collapse
Affiliation(s)
- Yongjun Shu
- Key Laboratory of Molecular Cytogenetics and Genetic Breeding, College of Life Science and Technology, Harbin Normal University, Harbin, 150025, Heilongjiang, People's Republic of China
| | | | | | | | | | | | | | | |
Collapse
|
26
|
Makarova SS, Minina EA, Makarov VV, Semenyuk PI, Kopertekh L, Schiemann J, Serebryakova MV, Erokhina TN, Solovyev AG, Morozov SY. Orthologues of a plant-specific At-4/1 gene in the genus Nicotiana and the structural properties of bacterially expressed 4/1 protein. Biochimie 2011; 93:1770-8. [PMID: 21712068 DOI: 10.1016/j.biochi.2011.06.018] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2011] [Accepted: 06/14/2011] [Indexed: 11/24/2022]
Abstract
Arabidopsis thaliana At-4/1 is the protein of unknown function capable of polar localization in plant cells and intercellular trafficking. In this work, we cloned cDNAs and chromosomal genes of At-4/1 orthologues from several Nicotiana species. Similarly to the 4/1 genes of A. thaliana and Oryza sativa, Nicotiana 4/1 genes have eight exons and seven introns but are considerably longer due to their larger introns. The allotetraploid genome of Nicotiana tabacum, which is known to consist of the 'S genome' originated from Nicotiana sylvestris and the 'T genome' derived from Nicotiana tomentosiformis, encodes two 4/1 genes. The T genome-encoded 4/1 gene, but not that of the S genome, contains a SINE-like transposable element in its intron 2. The 4/1 genes of Nicotiana hesperis and Nicotiana benthamiana lack such an element in the intron 2, but possess a related SINE-like sequence in their intron 4. Collectively, the sequence analysis data provide an insight into the organization of 4/1 genes in flowering plants and the patterns of evolution in the genus Nicotiana. The Nicotiana 4/1 proteins and those of other flowering plants show a significant level of sequence similarity. Computer-assisted analysis was further used to compare their predicted secondary structures. Several algorithms confidently predicted the presence of several coiled-coil domains occupying similar positions in different 4/1 proteins. Analysis of circular dichroism spectra carried out for bacterially expressed N. tabacum 4/1 protein (Nt-4/1) and its N- and C-terminally truncated mutants confirmed that the secondary structure of Nt-4/1 is generally alpha-helical. The C-terminal region of Nt-4/1 was found to undergo a partial proteolysis in Escherichia coli cells. Differential scanning calorimetry of Nt-4/1 protein and its mutants revealed three calorimetric domains most probably corresponding to the N-terminal, central, and C-terminal structural domains of the protein.
Collapse
Affiliation(s)
- Svetlana S Makarova
- Department of Virology, Biological Faculty, Moscow State University, Moscow 119992, Russia
| | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Wenke T, Döbel T, Sörensen TR, Junghans H, Weisshaar B, Schmidt T. Targeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes. THE PLANT CELL 2011; 23:3117-28. [PMID: 21908723 PMCID: PMC3203444 DOI: 10.1105/tpc.111.088682] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2011] [Revised: 08/23/2011] [Accepted: 08/26/2011] [Indexed: 05/18/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5' truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3' end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families.
Collapse
Affiliation(s)
- Torsten Wenke
- Department of Biology, Dresden University of Technology, D-01062 Dresden, Germany.
| | | | | | | | | | | |
Collapse
|
28
|
Yagi E, Akita T, Kawahara T. A novel Au SINE sequence found in a gymnosperm. Genes Genet Syst 2011; 86:19-25. [PMID: 21498919 DOI: 10.1266/ggs.86.19] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Although many SINE families have been identified in the animal kingdom, only a few SINE families have been identified in plants, and their distribution is somewhat limited. The Au SINE (Au) has been found discontinuously in basal angiosperms, monocots, and eudicots. In this study, we examined the presence of the Au in gymnosperms and ferns by PCR using internal primers for Au. As a result, we found Au in a gymnosperm species, Ephedra ciliata. Therefore, Au was supposed to be present in the common ancestor of angiosperms and gymnosperms. The Au in E. ciliate was 15 bp shorter than the consensus sequence, which is similar to the Au SINE found in Glycine. However, the 3'end of the Au found in E. ciliate was more similar to the 3'end of the Medicago-type Au than that of the Glycine-type Au. A phylogenetic tree indicated that the Au sequence from E. ciliate is more closely related to the sequence found in Glycine than that found in Medicago/Lotus. These results indicated that Au were present in both angiosperms and gymnosperms.
Collapse
Affiliation(s)
- Eiki Yagi
- Laboratory of Crop Evolution, Graduate School of Agriculture, Kyoto University, Japan.
| | | | | |
Collapse
|
29
|
Pélissier T, Clavel M, Chaparro C, Pouch-Pélissier MN, Vaucheret H, Deragon JM. Double-stranded RNA binding proteins DRB2 and DRB4 have an antagonistic impact on polymerase IV-dependent siRNA levels in Arabidopsis. RNA (NEW YORK, N.Y.) 2011; 17:1502-10. [PMID: 21700726 PMCID: PMC3153974 DOI: 10.1261/rna.2680711] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2011] [Accepted: 05/20/2011] [Indexed: 05/17/2023]
Abstract
Biogenesis of the vast majority of plant siRNAs depends on the activity of the plant-specific RNA polymerase IV (PolIV) enzyme. As part of the RNA-dependent DNA methylation (RdDM) process, PolIV-dependent siRNAs (p4-siRNAs) are loaded onto an ARGONAUTE4-containing complex and guide de novo DNA methyltransferases to target loci. Here we show that the double-stranded RNA binding proteins DRB2 and DRB4 are required for proper accumulation of p4-siRNAs. In flowers, loss of DRB2 results in increased accumulation of p4-siRNAs but not ta-siRNAs, inverted repeat (IR)-derived siRNAs, or miRNA. Loss of DRB2 does not impair uniparental expression of p4-dependent siRNAs in developing endosperm, indicating that p4-siRNA increased accumulation is not the result of the activation of the polIV pathway in the male gametophyte. In contrast to drb2, drb4 mutants exhibit reduced p4-siRNA levels, but the extent of this reduction is variable, according to the nature and size of the p4-siRNAs. Loss of DRB4 also leads to a spectacular increase of p4-independent IR-derived 24-nt siRNAs, suggesting a reallocation of factors from p4-dependent to p4-independent siRNA pathways in drb4. Opposite effects of drb2 and drb4 mutations on the accumulation of p4-siRNAs were also observed in vegetative tissues. Moreover, transgenic plants overexpressing DRB2 mimicked drb4 mutants at the morphological and molecular levels, confirming the antagonistic roles of DRB2 and DRB4.
Collapse
Affiliation(s)
- Thierry Pélissier
- Université de Perpignan Via Domitia, CNRS UMR5096 LGDP, 66860 Perpignan Cedex, France
| | - Marion Clavel
- Université de Perpignan Via Domitia, CNRS UMR5096 LGDP, 66860 Perpignan Cedex, France
| | - Cristian Chaparro
- Université de Perpignan Via Domitia, CNRS UMR5096 LGDP, 66860 Perpignan Cedex, France
| | | | - Hervé Vaucheret
- Institut Jean-Pierre Bourgin, Unité Mixte de Recherche 1318, Institut National de la Recherche Agronomique, 78026 Versailles Cedex, France
| | - Jean-Marc Deragon
- Université de Perpignan Via Domitia, CNRS UMR5096 LGDP, 66860 Perpignan Cedex, France
- Corresponding author.E-mail
| |
Collapse
|
30
|
Abstract
Short interspersed elements (SINEs) are mobile genetic elements that invade the genomes of many eukaryotes. Since their discovery about 30 years ago, many gaps in our understanding of the biology and function of SINEs have been filled. This review summarizes the past and recent advances in the studies of SINEs. The structure and origin of SINEs as well as the processes involved in their amplification, transcription, RNA processing, reverse transcription, and integration of a SINE copy into the genome are considered. Then we focus on the significance of SINEs for the host genomes. While these genomic parasites can be deleterious to the cell, the long-term being in the genome has made SINEs a valuable source of genetic variation providing regulatory elements for gene expression, alternative splice sites, polyadenylation signals, and even functional RNA genes.
Collapse
Affiliation(s)
- Dmitri A Kramerov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia.
| | | |
Collapse
|
31
|
Abstract
Short interspersed elements (SINEs) are one of the two most prolific mobile genomic elements in most of the higher eukaryotes. Although their biology is still not thoroughly understood, unusual life cycle of these simple elements amplified as genomic parasites makes their evolution unique in many ways. In contrast to most genetic elements including other transposons, SINEs emerged de novo many times in evolution from available molecules (for example, tRNA). The involvement of reverse transcription in their amplification cycle, huge number of genomic copies and modular structure allow variation mechanisms in SINEs uncommon or rare in other genetic elements (module exchange between SINE families, dimerization, and so on.). Overall, SINE evolution includes their emergence, progressive optimization and counteraction to the cell's defense against mobile genetic elements.
Collapse
|
32
|
Michaud M, Cognat V, Duchêne AM, Maréchal-Drouard L. A global picture of tRNA genes in plant genomes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2011; 66:80-93. [PMID: 21443625 DOI: 10.1111/j.1365-313x.2011.04490.x] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Although transfer RNA (tRNA) has a fundamental role in cell life, little is known about tRNA gene organization and expression on a genome-wide scale in eukaryotes, particularly plants. Here, we analyse the content and distribution of tRNA genes in five flowering plants and one green alga. The tRNA gene content is homogenous in plants, and is mostly correlated with genome size. The number of tRNA pseudogenes and organellar-like tRNA genes present in nuclear genomes varies greatly from one plant species to another. These pseudogenes or organellar-like genes appear to be generated or inserted randomly during evolution. Interestingly, we identified a new family of tRNA-related short interspersed nuclear elements (SINEs) in the Populus trichocarpa nuclear genome. In higher plants, intron-containing tRNA genes are rare, and correspond to genes coding for tRNA(Tyr) and tRNA(Mete) . By contrast, in green algae, more than half of the tRNA genes contain an intron. This suggests divergent means of intron acquisition and the splicing process between green algae and land plants. Numerous tRNAs are co-transcribed in Chlamydomonas, but they are mostly transcribed as a single unit in flowering plants. The only exceptions are tRNA(Gly) -snoRNA and tRNA(Mete) -snoRNA cotranscripts in dicots and monocots, respectively. The internal or external motifs required for efficient transcription of tRNA genes by RNA polymerase III are well conserved among angiosperms. A brief analysis of the mitochondrial and plastidial tRNA gene populations is also provided.
Collapse
Affiliation(s)
- Morgane Michaud
- Institut de Biologie Moléculaire des Plantes, UPR 2357-CNRS, Université de Strasbourg, 12 rue du Général Zimmer, F-67084 Strasbourg Cedex, France
| | | | | | | |
Collapse
|
33
|
Gadzalski M, Sakowicz T. Novel SINEs families in Medicago truncatula and Lotus japonicus: bioinformatic analysis. Gene 2011; 480:21-7. [PMID: 21352903 DOI: 10.1016/j.gene.2011.01.020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2010] [Revised: 12/01/2010] [Accepted: 01/31/2011] [Indexed: 02/02/2023]
Abstract
Although short interspersed elements (SINEs) were discovered nearly 30 years ago, the studies of these genomic repeats were mostly limited to animal genomes. Very little is known about SINEs in legumes--one of the most important plant families. Here we report identification, genomic distribution and molecular features of six novel SINE elements in Lotus japonicus (named LJ_SINE-1, -2, -3) and Medicago truncatula (MT_SINE-1, -2, -3), model species of legume. They possess all the structural features commonly found in short interspersed elements including RNA polymerase III promoter, polyA tail and flanking repeats. SINEs described here are present in low to moderate copy numbers from 150 to 3000. Bioinformatic analyses were used to searched public databases, we have shown that three of new SINE elements from M. truncatula seem to be characteristic of Medicago and Trifolium genera. Two SINE families have been found in L. japonicus and one is present in both M. truncatula and L. japonicus. In addition, we are discussing potential activities of the described elements.
Collapse
Affiliation(s)
- Marek Gadzalski
- Department of General Genetics, Plant Molecular Biology and Biotechnology, University of Lodz, Banacha 12/16, Lodz, Poland.
| | | |
Collapse
|
34
|
Lisch D, Slotkin RK. Strategies for silencing and escape: the ancient struggle between transposable elements and their hosts. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2011; 292:119-52. [PMID: 22078960 DOI: 10.1016/b978-0-12-386033-0.00003-7] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Over the past several years, there has been an explosion in our understanding of the mechanisms by which plant transposable elements (TEs) are epigenetically silenced and maintained in an inactive state over long periods of time. This highly efficient process results in vast numbers of inactive TEs; indeed, the majority of many plant genomes are composed of these quiescent elements. This observation has led to the rather static view that TEs represent an essentially inert portion of plant genomes. However, recent work has demonstrated that TE silencing is a highly dynamic process that often involves transcription of TEs at particular times and places during plant development. Plants appear to use transcripts from silenced TEs as an ongoing source of information concerning the mobile portion of the genome. In contrast to our understanding of silencing pathways, we know relatively little about the ways in which TEs evade silencing. However, vast differences in TE content between even closely related plant species suggest that they are often wildly successful at doing so. Here, we discuss TE activity in plants as the result of a constantly shifting balance between host strategies for TE silencing and TE strategies for escape and amplification.
Collapse
Affiliation(s)
- Damon Lisch
- Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
| | | |
Collapse
|
35
|
Baucom RS, Estill JC, Chaparro C, Upshaw N, Jogi A, Deragon JM, Westerman RP, SanMiguel PJ, Bennetzen JL. Exceptional diversity, non-random distribution, and rapid evolution of retroelements in the B73 maize genome. PLoS Genet 2009; 5:e1000732. [PMID: 19936065 PMCID: PMC2774510 DOI: 10.1371/journal.pgen.1000732] [Citation(s) in RCA: 245] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2009] [Accepted: 10/19/2009] [Indexed: 11/29/2022] Open
Abstract
Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs) in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75%) of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR) retrotransposon class of retroelements, with >400 families (>350 newly discovered) contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families) and LINEs (at least 30 families), were observed to contribute 1,991 and ∼35,000 copies, respectively, or a combined ∼1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all) medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to differentially occupy and exploit this genomic diversity. Although TEs are a major component of all studied plant genomes, and are the most significant contributors to genome structure and evolution in almost all eukaryotes that have been investigated, their properties and reasons for existence are not well understood in any eukaryotic genome. In order to begin a comprehensive study of TE contributions to the structure, function, and evolution of both genes and genomes, we first identified all of the TEs in maize and then investigated whether there were non-random patterns in their dispersal. We used homology and TE structure criteria in an effort to discover all of the retroelements in the recently sequenced genome from maize inbred B73. We found that the retroelements are incredibly diverse in maize, with many hundreds of families that show different insertion and/or retention specificities across the maize chromosomes. Most of these element families are present in low copy numbers and had been missed by previous searches that relied on a high-copy-number criterion. Different element families exhibited very different biases for accumulation across the chromosomes, indicating that they can detect and utilize many different chromatin environments.
Collapse
Affiliation(s)
- Regina S. Baucom
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - James C. Estill
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Cristian Chaparro
- Université de Perpignan, Via Domitia, CNRS UMR5096 LGDP, Perpignan, France
| | - Naadira Upshaw
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - Ansuya Jogi
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - Jean-Marc Deragon
- Université de Perpignan, Via Domitia, CNRS UMR5096 LGDP, Perpignan, France
| | - Richard P. Westerman
- Department of Horticulture and Landscape Architecture, Purdue University, West Lafayette, Indiana, United States of America
| | - Phillip J. SanMiguel
- Department of Horticulture and Landscape Architecture, Purdue University, West Lafayette, Indiana, United States of America
| | - Jeffrey L. Bennetzen
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
- * E-mail:
| |
Collapse
|
36
|
Gogolevsky KP, Vassetzky NS, Kramerov DA. 5S rRNA-derived and tRNA-derived SINEs in fruit bats. Genomics 2009; 93:494-500. [PMID: 19442632 DOI: 10.1016/j.ygeno.2009.02.001] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2008] [Revised: 02/04/2009] [Accepted: 02/04/2009] [Indexed: 11/24/2022]
Abstract
Most short retroposons (SINEs) descend from cellular tRNA of 7SL RNA. Here, four new SINEs were found in megabats (Megachiroptera) but neither in microbats nor in other mammals. Two of them, MEG-RS and MEG-RL, descend from another cellular RNA, 5S rRNA; one (MEG-T2) is a tRNA-derived SINE; and MEG-TR is a hybrid tRNA/5S rRNA SINE. Insertion locus analysis suggests that these SINEs were active in the recent fruit bat evolution. Analysis of MEG-RS and MEG-RL in comparison with other few 5S rRNA-derived SINEs demonstrates that the internal RNA polymerase III promoter is their most invariant region, while the secondary structure is more variable. The mechanisms underlying the modular structure of these and other SINEs as well as their variation are discussed. The scenario of evolution of MEG SINEs is proposed.
Collapse
Affiliation(s)
- Konstantin P Gogolevsky
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov St., Moscow 119991, Russia
| | | | | |
Collapse
|
37
|
Tsuchimoto S, Hirao Y, Ohtsubo E, Ohtsubo H. New SINE families from rice, OsSN, with poly(A) at the 3' ends. Genes Genet Syst 2008; 83:227-36. [PMID: 18670134 DOI: 10.1266/ggs.83.227] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A database search of the sequences flanking a member of rice retrotransposon RIRE7 revealed that a 298-bp sequence in the region downstream of the member is a repetitive sequence interspersed in the genome of Oryza sativa cv. Nipponbare. Most of the repetitive sequences were flanked by a direct repeat of a target-site sequence, about 14 bp in length. The consensus sequence, 293 bp in length, had no regions encoding any proteins but had sequence motifs of an internal promoter of RNA polymerase III. These indicate that the sequence is a retroposon SINE, designated OsSN1 (Oryza sativa SINE1). OsSN1 is a new rice SINE, because it has no homology with any of the three p-SINE families previously identified from rice, and because it has a stretch of A at the 3' end, unlike p-SINE and any other Gramineae SINEs which have a stretch of T at the 3' end. The Nipponbare genome was found to have many members related to OsSN1, forming two additional new SINE families (designated OsSN2 and OsSN3). OsSN2 and OsSN3 are highly homologous to the 3' and 5' regions of OsSN1, respectively. This suggests that OsSN1 has a mosaic structure, which is generated by sequence exchange (or shuffling) between ancestral OsSN2 and OsSN3. Despite the absence of homology in the 3' regions between OsSN1 (or OsSN2) and OsSN3, a sequence, 5'-TTCTC-3', is commonly present in the region preceding the A stretch at the 3' end. This sequence together with the A stretch and a stem-loop structure found in the region near the A stretch are assumed to be important for retroposition. OsSN members were present in strains of Oryza species, as were p-SINE members. Some of the members showed insertion polymorphism at the respective loci among the rice strains. p-SINE had such polymorphic members, which are useful for classification and phylogenetic analysis of various strains of Oryza species. The polymorphic members of OsSN were more frequently found than those of p-SINE, and therefore, such members are likely to be useful for extensive taxonomic and phylogenetic studies on various rice strains.
Collapse
Affiliation(s)
- Suguru Tsuchimoto
- Institute of Molecular and Cellular Biosciences, the University of Tokyo, Tokyo, Japan.
| | | | | | | |
Collapse
|
38
|
Pouch-Pélissier MN, Pélissier T, Elmayan T, Vaucheret H, Boko D, Jantsch MF, Deragon JM. SINE RNA induces severe developmental defects in Arabidopsis thaliana and interacts with HYL1 (DRB1), a key member of the DCL1 complex. PLoS Genet 2008; 4:e1000096. [PMID: 18551175 PMCID: PMC2408557 DOI: 10.1371/journal.pgen.1000096] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2008] [Accepted: 05/13/2008] [Indexed: 11/18/2022] Open
Abstract
The proper temporal and spatial expression of genes during plant development is governed, in part, by the regulatory activities of various types of small RNAs produced by the different RNAi pathways. Here we report that transgenic Arabidopsis plants constitutively expressing the rapeseed SB1 SINE retroposon exhibit developmental defects resembling those observed in some RNAi mutants. We show that SB1 RNA interacts with HYL1 (DRB1), a double-stranded RNA-binding protein (dsRBP) that associates with the Dicer homologue DCL1 to produce microRNAs. RNase V1 protection assays mapped the binding site of HYL1 to a SB1 region that mimics the hairpin structure of microRNA precursors. We also show that HYL1, upon binding to RNA substrates, induces conformational changes that force single-stranded RNA regions to adopt a structured helix-like conformation. Xenopus laevis ADAR1, but not Arabidopsis DRB4, binds SB1 RNA in the same region as HYL1, suggesting that SINE RNAs bind only a subset of dsRBPs. Consistently, DCL4-DRB4-dependent miRNA accumulation was unchanged in SB1 transgenic Arabidopsis, whereas DCL1-HYL1-dependent miRNA and DCL1-HYL1-DCL4-DRB4-dependent tasiRNA accumulation was decreased. We propose that SINE RNA can modulate the activity of the RNAi pathways in plants and possibly in other eukaryotes. Short interspersed elements (SINEs) are transposable elements in eukaryotic genomes that mobilize through an RNA intermediate. Recently, mammalian SINE RNAs were shown to have roles as noncoding riboregulators in stress situations or in specific tissues. Mammalian SINE RNAs modulate the level of mRNAs and proteins by interacting with key proteins involved in gene transcription and translation. Here we show that constitutive production of a plant SINE RNA induces developmental defects in Arabidopsis thaliana and that this SINE RNA interacts with HYL1, a double-stranded RNA-binding protein required for the production of microRNA and trans-acting small interfering (tasi)RNA. We mapped the binding site of HYL1 to a SINE RNA region that mimics the hairpin structure of microRNA precursors. We also found that HYL1 induces conformational changes upon binding to RNA substrates. These data suggest that SINE RNAs modulate the activity of RNAi pathways in Arabidopsis.
Collapse
Affiliation(s)
| | - Thierry Pélissier
- Université de Perpignan Via Domitia, CNRS UMR5096 LGDP, Perpignan, France
| | - Taline Elmayan
- INRA Laboratoire de Biologie Cellulaire, Versailles, France
| | | | - Drasko Boko
- Department of Chromosome Biology, Max F. Perutz Laboratories, University of Vienna, Vienna, Austria
| | - Michael F. Jantsch
- Department of Chromosome Biology, Max F. Perutz Laboratories, University of Vienna, Vienna, Austria
| | - Jean-Marc Deragon
- Université de Perpignan Via Domitia, CNRS UMR5096 LGDP, Perpignan, France
- * E-mail:
| |
Collapse
|
39
|
Cognat V, Deragon JM, Vinogradova E, Salinas T, Remacle C, Maréchal-Drouard L. On the evolution and expression of Chlamydomonas reinhardtii nucleus-encoded transfer RNA genes. Genetics 2008; 179:113-23. [PMID: 18493044 PMCID: PMC2390591 DOI: 10.1534/genetics.107.085688] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2007] [Accepted: 03/18/2008] [Indexed: 11/18/2022] Open
Abstract
In Chlamydomonas reinhardtii, 259 tRNA genes were identified and classified into 49 tRNA isoaccepting families. By constructing phylogenetic trees, we determined the evolutionary history for each tRNA gene family. The majority of the tRNA sequences are more closely related to their plant counterparts than to animals ones. Northern experiments also permitted us to show that at least one member of each tRNA isoacceptor family is transcribed and correctly processed in vivo. A short stretch of T residues known to be a signal for termination of polymerase III transcription was found downstream of most tRNA genes. It allowed us to propose that the vast majority of the tRNA genes are expressed and to confirm that numerous tRNA genes separated by short spacers are indeed cotranscribed. Interestingly, in silico analyses and hybridization experiments show that the cellular tRNA abundance is correlated with the number of tRNA genes and is adjusted to the codon usage to optimize translation efficiency. Finally, we studied the origin of SINEs, short interspersed elements related to tRNAs, whose presence in Chlamydomonas is exceptional. Phylogenetic analysis strongly suggests that tRNA(Asp)-related SINEs originate from a prokaryotic-type tRNA either horizontally transferred from a bacterium or originally present in mitochondria or chloroplasts.
Collapse
Affiliation(s)
- Valérie Cognat
- Institut de Biologie Moléculaire des Plantes, Unité Propre de Recherche 2357, Conventionnée avec l'Université Louis Pasteur (Strasbourg 1), Centre National de la Recherche Scientifique, 67084 Strasbourg Cedex, France
| | | | | | | | | | | |
Collapse
|
40
|
Weil C, Martienssen R. Epigenetic interactions between transposons and genes: lessons from plants. Curr Opin Genet Dev 2008; 18:188-92. [PMID: 18339541 DOI: 10.1016/j.gde.2008.01.015] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2008] [Revised: 01/18/2008] [Accepted: 01/18/2008] [Indexed: 11/17/2022]
Abstract
Transposons replicate, increase in copy number and persist in nature by moving, but insertion into genes is generally mutagenic. There is thus a strong selection for transposons that can achieve a balance between their own replication and minimal damage to their host. Epigenetic regulation proves to be a widespread way to achieve this balance, quieting transposition on the one hand, yet reversible on the other. As our understanding of epigenetics improves, the subtleties and the scope of how transposons can affect gene expression, both directly and indirectly, are becoming clearer.
Collapse
Affiliation(s)
- Cliff Weil
- Department of Agronomy, Purdue University, West Lafayette, IN, United States
| | | |
Collapse
|
41
|
Chan SWL, Zhang X, Bernatavichute YV, Jacobsen SE. Two-step recruitment of RNA-directed DNA methylation to tandem repeats. PLoS Biol 2006; 4:e363. [PMID: 17105345 PMCID: PMC1618421 DOI: 10.1371/journal.pbio.0040363] [Citation(s) in RCA: 137] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2006] [Accepted: 08/29/2006] [Indexed: 11/18/2022] Open
Abstract
Tandem repeat sequences are frequently associated with gene silencing phenomena. The Arabidopsis thaliana FWA gene contains two tandem repeats and is an efficient target for RNA-directed de novo DNA methylation when it is transformed into plants. We showed that the FWA tandem repeats are necessary and sufficient for de novo DNA methylation and that repeated character rather than intrinsic sequence is likely important. Endogenous FWA can adopt either of two stable epigenetic states: methylated and silenced or unmethylated and active. Surprisingly, we found small interfering RNAs (siRNAs) associated with FWA in both states. Despite this, only the methylated form of endogenous FWA could recruit further RNA-directed DNA methylation or cause efficient de novo methylation of transgenic FWA. This suggests that RNA-directed DNA methylation occurs in two steps: first, the initial recruitment of the siRNA-producing machinery, and second, siRNA-directed DNA methylation either in cis or in trans. The efficiency of this second step varies depending on the nature of the siRNA-producing locus, and at some loci, it may require pre-existing chromatin modifications such as DNA methylation itself. Enhancement of RNA-directed DNA methylation by pre-existing DNA methylation could create a self-reinforcing system to enhance the stability of silencing. Tandem repeats throughout the Arabidopsis genome produce siRNAs, suggesting that repeat acquisition may be a general mechanism for the evolution of gene silencing. InArabidopsis, methylation and silencing mediated by theFWA transgene are found to require the tandem repeat sequences upstream ofFWA and to be independent of siRNA association.
Collapse
Affiliation(s)
- Simon W.-L Chan
- Department of Molecular, Cell, and Developmental Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Xiaoyu Zhang
- Department of Molecular, Cell, and Developmental Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Yana V Bernatavichute
- Department of Molecular, Cell, and Developmental Biology, University of California Los Angeles, Los Angeles, California, United States of America
| | - Steven E Jacobsen
- Department of Molecular, Cell, and Developmental Biology, University of California Los Angeles, Los Angeles, California, United States of America
- Howard Hughes Medical Institute, University of California Los Angeles, Los Angeles, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|