1
|
Kojima KK. Daidara: A gigantic Gypsy LTR retrotransposon lineage in the springtail Allacma fusca genome. Genes Cells 2023; 28:746-752. [PMID: 37650155 DOI: 10.1111/gtc.13062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/17/2023] [Accepted: 08/20/2023] [Indexed: 09/01/2023]
Abstract
Long terminal repeat (LTR) retrotransposons are the major contributor to genome size expansion, as in the cases of the maize genome or the axolotl genome. Despite their impact on the genome size, the length of each retrotransposon is limited, compared to DNA transposons, which sometimes exceed over 100 kb. The longest LTR retrotransposon known to date is Burro-1 from the planarian Schmidtea medierranea, which is around 35.7 kb long. Here through bioinformatics analysis, a new lineage of gigantic LTR retrotransposons, designated Daidara, is reported from the springtail Allacma fusca genome. Their entire length (25-33 kb) rivals Burro families, while their LTRs are shorter than 1.5 kb, in contrast to other gigantic LTR retrotransposon lineages Burro and Ogre, whose LTRs are around 5 kb long. Daidara encodes three core proteins corresponding to gag, pol, and an additional protein of unknown function. The phylogenetic analysis supports the independent gigantification of Daidara from Burro or Ogre.
Collapse
Affiliation(s)
- Kenji K Kojima
- Genetic Information Research Institute, Cupertino, California, USA
| |
Collapse
|
2
|
Ramakrishnan M, Papolu PK, Mullasseri S, Zhou M, Sharma A, Ahmad Z, Satheesh V, Kalendar R, Wei Q. The role of LTR retrotransposons in plant genetic engineering: how to control their transposition in the genome. PLANT CELL REPORTS 2023; 42:3-15. [PMID: 36401648 DOI: 10.1007/s00299-022-02945-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 10/23/2022] [Indexed: 06/16/2023]
Abstract
We briefly discuss that the similarity of LTR retrotransposons to retroviruses is a great opportunity for the development of a genetic engineering tool that exploits intragenic elements in the plant genome for plant genetic improvement. Long terminal repeat (LTR) retrotransposons are very similar to retroviruses but do not have the property of being infectious. While spreading between its host cells, a retrovirus inserts a DNA copy of its genome into the cells. The ability of retroviruses to cause infection with genome integration allows genes to be delivered to cells and tissues. Retrovirus vectors are, however, only specific to animals and insects, and, thus, are not relevant to plant genetic engineering. However, the similarity of LTR retrotransposons to retroviruses is an opportunity to explore the former as a tool for genetic engineering. Although recent long-read sequencing technologies have advanced the knowledge about transposable elements (TEs), the integration of TEs is still unable either to control them or to direct them to specific genomic locations. The use of existing intragenic elements to achieve the desired genome composition is better than using artificial constructs like vectors, but it is not yet clear how to control the process. Moreover, most LTR retrotransposons are inactive and unable to produce complete proteins. They are also highly mutable. In addition, it is impossible to find a full active copy of a LTR retrotransposon out of thousands of its own copies. Theoretically, if these elements were directly controlled and turned on or off using certain epigenetic mechanisms (inducing by stress or infection), LTR retrotransposons could be a great opportunity to develop a genetic engineering tool using intragenic elements in the plant genome. In this review, the recent developments in uncovering the nature of LTR retrotransposons and the possibility of using these intragenic elements as a tool for plant genetic engineering are briefly discussed.
Collapse
Affiliation(s)
- Muthusamy Ramakrishnan
- Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, College of Biology and the Environment, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| | - Pradeep K Papolu
- State Key Laboratory of Subtropical Silviculture, Institute of Bamboo Research, Zhejiang A&F University, Lin'an, Hangzhou, 311300, Zhejiang, China
| | - Sileesh Mullasseri
- Department of Zoology, St. Albert's College (Autonomous), Kochi, 682018, Kerala, India
| | - Mingbing Zhou
- State Key Laboratory of Subtropical Silviculture, Institute of Bamboo Research, Zhejiang A&F University, Lin'an, Hangzhou, 311300, Zhejiang, China
- Zhejiang Provincial Collaborative Innovation Center for Bamboo Resources and High-Efficiency Utilization, Zhejiang A&F University, Lin'an, Hangzhou, 311300, Zhejiang, China
| | - Anket Sharma
- State Key Laboratory of Subtropical Silviculture, Institute of Bamboo Research, Zhejiang A&F University, Lin'an, Hangzhou, 311300, Zhejiang, China
- Department of Plant Science and Landscape Architecture, University of Maryland, College Park, USA
| | - Zishan Ahmad
- Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, College of Biology and the Environment, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China
| | - Viswanathan Satheesh
- Shanghai Center for Plant Stress Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Ruslan Kalendar
- Helsinki Institute of Life Science HiLIFE, University of Helsinki, Biocenter 3, Viikinkaari 1, F1-00014, Helsinki, Finland.
- Institute of Plant Biology and Biotechnology (IPBB), Timiryazev Street 45, 050040, Almaty, Kazakhstan.
| | - Qiang Wei
- Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, College of Biology and the Environment, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China.
| |
Collapse
|
3
|
Neumann P, Novák P, Hoštáková N, Macas J. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mob DNA 2019; 10:1. [PMID: 30622655 PMCID: PMC6317226 DOI: 10.1186/s13100-018-0144-1] [Citation(s) in RCA: 198] [Impact Index Per Article: 39.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 12/20/2018] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Plant LTR-retrotransposons are classified into two superfamilies, Ty1/copia and Ty3/gypsy. They are further divided into an enormous number of families which are, due to the high diversity of their nucleotide sequences, usually specific to a single or a group of closely related species. Previous attempts to group these families into broader categories reflecting their phylogenetic relationships were limited either to analyzing a narrow range of plant species or to analyzing a small numbers of elements. Furthermore, there is no reference database that allows for similarity based classification of LTR-retrotransposons. RESULTS We have assembled a database of retrotransposon encoded polyprotein domains sequences extracted from 5410 Ty1/copia elements and 8453 Ty3/gypsy elements sampled from 80 species representing major groups of green plants (Viridiplantae). Phylogenetic analysis of the three most conserved polyprotein domains (RT, RH and INT) led to dividing Ty1/copia and Ty3/gypsy retrotransposons into 16 and 14 lineages respectively. We also characterized various features of LTR-retrotransposon sequences including additional polyprotein domains, extra open reading frames and primer binding sites, and found that the occurrence and/or type of these features correlates with phylogenies inferred from the three protein domains. CONCLUSIONS We have established an improved classification system applicable to LTR-retrotransposons from a wide range of plant species. This system reflects phylogenetic relationships as well as distinct sequence and structural features of the elements. A comprehensive database of retrotransposon protein domains (REXdb) that reflects this classification provides a reference for efficient and unified annotation of LTR-retrotransposons in plant genomes. Access to REXdb related tools is implemented in the RepeatExplorer web server (https://repeatexplorer-elixir.cerit-sc.cz/) or using a standalone version of REXdb that can be downloaded seaparately from RepeatExplorer web page (http://repeatexplorer.org/).
Collapse
Affiliation(s)
- Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Nina Hoštáková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| |
Collapse
|
4
|
Macas J, Novák P, Pellicer J, Čížková J, Koblížková A, Neumann P, Fuková I, Doležel J, Kelly LJ, Leitch IJ. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae. PLoS One 2015; 10:e0143424. [PMID: 26606051 PMCID: PMC4659654 DOI: 10.1371/journal.pone.0143424] [Citation(s) in RCA: 118] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 11/04/2015] [Indexed: 01/30/2023] Open
Abstract
The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55–83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Collapse
Affiliation(s)
- Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
- * E-mail:
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Jaume Pellicer
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, United Kingdom
| | - Jana Čížková
- Institute of Experimental Botany, Olomouc, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Andrea Koblížková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Iva Fuková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Jaroslav Doležel
- Institute of Experimental Botany, Olomouc, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Laura J. Kelly
- School of Biological and Chemical Sciences, Queen Mary University of London, London, United Kingdom
| | - Ilia J. Leitch
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, United Kingdom
| |
Collapse
|
5
|
Ustyantsev K, Novikova O, Blinov A, Smyshlyaev G. Convergent evolution of ribonuclease h in LTR retrotransposons and retroviruses. Mol Biol Evol 2015; 32:1197-207. [PMID: 25605791 PMCID: PMC4408406 DOI: 10.1093/molbev/msv008] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Ty3/Gypsy long terminals repeat (LTR) retrotransposons are structurally and phylogenetically close to retroviruses. Two notable structural differences between these groups of genetic elements are 1) the presence in retroviruses of an additional envelope gene, env, which mediates infection, and 2) a specific dual ribonuclease H (RNH) domain encoded by the retroviral pol gene. However, similar to retroviruses, many Ty3/Gypsy LTR retrotransposons harbor additional env-like genes, promoting concepts of the infective mode of these retrotransposons. Here, we provide a further line of evidence of similarity between retroviruses and some Ty3/Gypsy LTR retrotransposons. We identify that, together with their additional genes, plant Ty3/Gypsy LTR retrotransposons of the Tat group have a second RNH, as do retroviruses. Most importantly, we show that the resulting dual RNHs of Tat LTR retrotransposons and retroviruses emerged independently, providing strong evidence for their convergent evolution. The convergent resemblance of Tat LTR retrotransposons and retroviruses may indicate similar selection pressures acting on these diverse groups of elements and reveal potential evolutionary constraints on their structure. We speculate that dual RNH is required to accelerate retrotransposon evolution through increased rates of strand transfer events and subsequent recombination events.
Collapse
Affiliation(s)
- Kirill Ustyantsev
- Laboratory of Molecular Genetic Systems, Institute of Cytology and Genetics, Novosibirsk, Russia
| | - Olga Novikova
- Department of Biological Sciences and RNA Institute, University at Albany
| | - Alexander Blinov
- Laboratory of Molecular Genetic Systems, Institute of Cytology and Genetics, Novosibirsk, Russia
| | - Georgy Smyshlyaev
- Laboratory of Molecular Genetic Systems, Institute of Cytology and Genetics, Novosibirsk, Russia Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
6
|
Kubat Z, Zluvova J, Vogel I, Kovacova V, Cermak T, Cegan R, Hobza R, Vyskot B, Kejnovsky E. Possible mechanisms responsible for absence of a retrotransposon family on a plant Y chromosome. THE NEW PHYTOLOGIST 2014; 202:662-678. [PMID: 24456522 DOI: 10.1111/nph.12669] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2013] [Accepted: 11/25/2013] [Indexed: 05/18/2023]
Abstract
Some transposable elements (TEs) show extraordinary variance in abundance along sex chromosomes but the mechanisms responsible for this variance are unknown. Here, we studied Ogre long terminal repeat (LTR) retrotransposons in Silene latifolia, a dioecious plant with evolutionarily young heteromorphic sex chromosomes. Ogre elements are ubiquitous in the S. latifolia genome but surprisingly absent on the Y chromosome. Bacterial artificial chromosome (BAC) library analysis and fluorescence in situ hybridization (FISH) were used to determine Ogre structure and chromosomal localization. Next generation sequencing (NGS) data were analysed to assess the transcription level and abundance of small RNAs. Methylation of Ogres was determined by bisulphite sequencing. Phylogenetic analysis was used to determine mobilization time and selection forces acting on Ogre elements. We characterized three Ogre families ubiquitous in the S. latifolia genome. One family is nearly absent on the Y chromosome despite all the families having similar structures and spreading mechanisms. We showed that Ogre retrotransposons evolved before sex chromosomes appeared but were mobilized after formation of the Y chromosome. Our data suggest that the absence of one Ogre family on the Y chromosome may be caused by 24-nucleotide (24-nt) small RNA-mediated silencing leading to female-specific spreading. Our findings highlight epigenetic silencing mechanisms as potentially crucial factors in sex-specific spreading of some TEs, but other possible mechanisms are also discussed.
Collapse
Affiliation(s)
- Zdenek Kubat
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
- Laboratory of Genome Dynamics, CEITEC - Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, 62500, Czech Republic
| | - Jitka Zluvova
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
| | - Ivan Vogel
- Laboratory of Genome Dynamics, CEITEC - Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, 62500, Czech Republic
| | - Viera Kovacova
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
| | - Tomas Cermak
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
| | - Radim Cegan
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
| | - Roman Hobza
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Sokolovska 6, Olomouc, 77200, Czech Republic
| | - Boris Vyskot
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
| | - Eduard Kejnovsky
- Department of Plant Developmental Genetics, Institute of Biophysics ASCR, Kralovopolska 135, Brno, 61200, Czech Republic
- Laboratory of Genome Dynamics, CEITEC - Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, 62500, Czech Republic
| |
Collapse
|
7
|
Chang W, Jääskeläinen M, Li SP, Schulman AH. BARE retrotransposons are translated and replicated via distinct RNA pools. PLoS One 2013; 8:e72270. [PMID: 23940808 PMCID: PMC3735527 DOI: 10.1371/journal.pone.0072270] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Accepted: 07/14/2013] [Indexed: 01/02/2023] Open
Abstract
The replication of Long Terminal Repeat (LTR) retrotransposons, which can constitute over 80% of higher plant genomes, resembles that of retroviruses. A major question for retrotransposons and retroviruses is how the two conflicting roles of their transcripts, in translation and reverse transcription, are balanced. Here, we show that the BARE retrotransposon, despite its organization into just one open reading frame, produces three distinct classes of transcripts. One is capped, polyadenylated, and translated, but cannot be copied into cDNA. The second is not capped or polyadenylated, but is destined for packaging and ultimate reverse transcription. The third class is capped, polyadenylated, and spliced to favor production of a subgenomic RNA encoding only Gag, the protein forming virus-like particles. Moreover, the BARE2 subfamily, which cannot synthesize Gag and is parasitic on BARE1, does not produce the spliced sub-genomic RNA for translation but does make the replication competent transcripts, which are packaged into BARE1 particles. To our knowledge, this is first demonstration of distinct RNA pools for translation and transcription for any retrotransposon.
Collapse
Affiliation(s)
- Wei Chang
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, Helsinki, Finland
| | - Marko Jääskeläinen
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, Helsinki, Finland
| | - Song-ping Li
- Genome-Scale Biology Program, University of Helsinki, Biomedicum, Helsinki, Finland
| | - Alan H. Schulman
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, Helsinki, Finland
- Biotechnology and Food Research, MTT Agrifood Research Finland, Jokioinen, Finland
- * E-mail:
| |
Collapse
|
8
|
Steinbauerová V, Neumann P, Novák P, Macas J. A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons. Genetica 2012; 139:1543-55. [PMID: 22544262 DOI: 10.1007/s10709-012-9654-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 04/16/2012] [Indexed: 01/21/2023]
Abstract
Long terminal repeat (LTR) retrotransposons make up substantial parts of most higher plant genomes where they accumulate due to their replicative mode of transposition. Although the transposition is facilitated by proteins encoded within the gag-pol region which is common to all autonomous elements, some LTR retrotransposons were found to potentially carry an additional protein coding capacity represented by extra open reading frames located upstream or downstream of gag-pol. In this study, we performed a comprehensive in silico survey and comparative analysis of these extra open reading frames (ORFs) in the group of Ty3/gypsy LTR retrotransposons as the first step towards our understanding of their origin and function. We found that extra ORFs occur in all three major lineages of plant Ty3/gypsy elements, being the most frequent in the Tat lineage where most (77 %) of identified elements contained extra ORFs. This lineage was also characterized by the highest diversity of extra ORF arrangement (position and orientation) within the elements. On the other hand, all of these ORFs could be classified into only two broad groups based on their mutual similarities or the presence of short conserved motifs in their inferred protein sequences. In the Athila lineage, the extra ORFs were confined to the element 3' regions but they displayed much higher sequence diversity compared to those found in Tat. In the lineage of Chromoviruses the extra ORFs were relatively rare, occurring only in 5' regions of a group of elements present in a single plant family (Poaceae). In all three lineages, most extra ORFs lacked sequence similarities to characterized gene sequences or functional protein domains, except for two Athila-like elements with similarities to LOGL4 gene and part of the Chromoviruses extra ORFs that displayed partial similarity to histone H3 gene. Thus, in these cases the extra ORFs most likely originated by transduction or recombination of cellular gene sequences. In addition, the protein domain which is otherwise associated with DNA transposons have been detected in part of the Tat-like extra ORFs, pointing to their origin from an insertion event of a mobile element.
Collapse
Affiliation(s)
- Veronika Steinbauerová
- Institute of Plant Molecular Biology, Biology Centre ASCR, Branišovská 31, Ceske Budejovice, Czech Republic
| | | | | | | |
Collapse
|
9
|
|