1
|
Martin GT, Solares E, Guadardo-Mendez J, Muyle A, Bousios A, Gaut BS. miRNA-like secondary structures in maize ( Zea mays) genes and transposable elements correlate with small RNAs, methylation, and expression. Genome Res 2023; 33:1932-1946. [PMID: 37918960 PMCID: PMC10760457 DOI: 10.1101/gr.277459.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 10/16/2023] [Indexed: 11/04/2023]
Abstract
RNA molecules carry information in their primary sequence and also their secondary structure. Secondary structure can confer important functional information, but it is also a signal for an RNAi-like host epigenetic response mediated by small RNAs (smRNAs). In this study, we used two bioinformatic methods to predict local secondary structures across features of the maize genome, focusing on small regions that had similar folding properties to pre-miRNA loci. We found miRNA-like secondary structures to be common in genes and most, but not all, superfamilies of RNA and DNA transposable elements (TEs). The miRNA-like regions map to a higher diversity of smRNAs than regions without miRNA-like structure, explaining up to 27% of variation in smRNA mapping for some TE superfamilies. This mapping bias is more pronounced among putatively autonomous TEs relative to nonautonomous TEs. Genome-wide, miRNA-like regions are also associated with elevated methylation levels, particularly in the CHH context. Among genes, those with miRNA-like secondary structure are 1.5-fold more highly expressed, on average, than other genes. However, these genes are also more variably expressed across the 26 nested association mapping founder lines, and this variability positively correlates with the number of mapping smRNAs. We conclude that local miRNA-like structures are a nearly ubiquitous feature of expressed regions of the maize genome, that they correlate with higher smRNA mapping and methylation, and that they may represent a trade-off between functional requirements and the potentially negative consequences of smRNA production.
Collapse
Affiliation(s)
- Galen T Martin
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
| | - Edwin Solares
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
- Department of Ecology and Evolutionary Biology, University of California, Davis, California 95616, USA
| | - Jeanelle Guadardo-Mendez
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
| | - Aline Muyle
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
- CEFE, University of Montpellier, CNRS, EPHE, IRD, 34090 Montpellier, France
| | - Alexandros Bousios
- School of Life Sciences, University of Sussex, Brighton BN1 9QG, United Kingdom
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA;
| |
Collapse
|
2
|
Planta J, Liang YY, Xin H, Chansler MT, Prather LA, Jiang N, Jiang J, Childs KL. Chromosome-scale genome assemblies and annotations for Poales species Carex cristatella, Carex scoparia, Juncus effusus, and Juncus inflexus. G3 GENES|GENOMES|GENETICS 2022; 12:6670624. [PMID: 35976112 PMCID: PMC9526063 DOI: 10.1093/g3journal/jkac211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 07/18/2022] [Indexed: 12/03/2022]
Abstract
The majority of sequenced genomes in the monocots are from species belonging to Poaceae, which include many commercially important crops. Here, we expand the number of sequenced genomes from the monocots to include the genomes of 4 related cyperids: Carex cristatella and Carex scoparia from Cyperaceae and Juncus effusus and Juncus inflexus from Juncaceae. The high-quality, chromosome-scale genome sequences from these 4 cyperids were assembled by combining whole-genome shotgun sequencing of Nanopore long reads, Illumina short reads, and Hi-C sequencing data. Some members of the Cyperaceae and Juncaceae are known to possess holocentric chromosomes. We examined the repeat landscapes in our sequenced genomes to search for potential repeats associated with centromeres. Several large satellite repeat families, comprising 3.2–9.5% of our sequenced genomes, showed dispersed distribution of large satellite repeat clusters across all Carex chromosomes, with few instances of these repeats clustering in the same chromosomal regions. In contrast, most large Juncus satellite repeats were clustered in a single location on each chromosome, with sporadic instances of large satellite repeats throughout the Juncus genomes. Recognizable transposable elements account for about 20% of each of the 4 genome assemblies, with the Carex genomes containing more DNA transposons than retrotransposons while the converse is true for the Juncus genomes. These genome sequences and annotations will facilitate better comparative analysis within monocots.
Collapse
Affiliation(s)
- Jose Planta
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
- National Institute of Molecular Biology and Biotechnology, University of the Philippines , Diliman, Quezon City 1101, Philippines
| | - Yu-Ya Liang
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - Haoyang Xin
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - Matthew T Chansler
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - L Alan Prather
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - Ning Jiang
- Department of Horticulture, MSU AgBioResearch, Michigan State University , East Lansing, MI 48824, USA
| | - Jiming Jiang
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
- Department of Horticulture, MSU AgBioResearch, Michigan State University , East Lansing, MI 48824, USA
| | - Kevin L Childs
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| |
Collapse
|
3
|
Dazenière J, Bousios A, Eyre-Walker A. Patterns of selection in the evolution of a transposable element. G3 GENES|GENOMES|GENETICS 2022; 12:6545286. [PMID: 35262706 PMCID: PMC9073684 DOI: 10.1093/g3journal/jkac056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 02/14/2022] [Indexed: 11/23/2022]
Abstract
Transposable elements are a major component of most eukaryotic genomes. Here, we present a new approach which allows us to study patterns of natural selection in the evolution of transposable elements over short time scales. The method uses the alignment of all elements with intact gag/pol genes of a transposable element family from a single genome. We predict that the ratio of nonsynonymous to synonymous variants in the alignment should decrease as a function of the frequency of the variants, because elements with nonsynonymous variants that reduce transposition will have fewer progeny. We apply our method to Sirevirus long-terminal repeat retrotransposons that are abundant in maize and other plant species and show that nonsynonymous to synonymous variants declines as variant frequency increases, indicating that negative selection is acting strongly on the Sirevirus genome. The asymptotic value of nonsynonymous to synonymous variants suggests that at least 85% of all nonsynonymous mutations in the transposable element reduce transposition. Crucially, these patterns in nonsynonymous to synonymous variants are only predicted to occur if the gene products from a particular transposable element insertion preferentially promote the transposition of the same insertion. Overall, by using large numbers of intact elements, this study sheds new light on the selective processes that act on transposable elements.
Collapse
Affiliation(s)
- Julie Dazenière
- School of Life Sciences, University of Sussex, Falmer, Brighton BN1 9RH, UK
| | - Alexandros Bousios
- School of Life Sciences, University of Sussex, Falmer, Brighton BN1 9RH, UK
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Falmer, Brighton BN1 9RH, UK
| |
Collapse
|
4
|
Muyle A, Seymour D, Darzentas N, Primetis E, Gaut BS, Bousios A. Gene capture by transposable elements leads to epigenetic conflict in maize. MOLECULAR PLANT 2021; 14:237-252. [PMID: 33171302 DOI: 10.1016/j.molp.2020.11.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 10/15/2020] [Accepted: 11/05/2020] [Indexed: 06/11/2023]
Abstract
Transposable elements (TEs) regularly capture fragments of genes. When the host silences these TEs, siRNAs homologous to the captured regions may also target the genes. This epigenetic crosstalk establishes an intragenomic conflict: silencing the TEs has the cost of silencing the genes. If genes are important, however, natural selection may maintain function by moderating the silencing response, which may also advantage the TEs. In this study, we examined this model by focusing on Helitrons, Pack-MULEs, and Sirevirus LTR retrotransposons in the maize genome. We documented 1263 TEs containing exon fragments from 1629 donor genes. Consistent with epigenetic conflict, donor genes mapped more siRNAs and were more methylated than genes with no evidence of capture. However, these patterns differed between syntelog versus translocated donor genes. Syntelogs appeared to maintain function, as measured by gene expression, consistent with moderation of silencing for functionally important genes. Epigenetic marks did not spread beyond their captured regions and 24nt crosstalk siRNAs were linked with CHH methylation. Translocated genes, in contrast, bore the signature of silencing. They were highly methylated and less expressed, but also overrepresented among donor genes and located away from chromosomal arms, which suggests a link between capture and gene movement. Splitting genes into potential functional categories based on evolutionary constraint supported the synteny-based findings. TE families captured genes in different ways, but the evidence for their advantage was generally less obvious; nevertheless, TEs with captured fragments were older, mapped fewer siRNAs, and were slightly less methylated than TEs without captured fragments. Collectively, our results argue that TE capture triggers an intragenomic conflict that may not affect the function of important genes but may lead to the pseudogenization of less-constrained genes.
Collapse
Affiliation(s)
- Aline Muyle
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92697, USA
| | - Danelle Seymour
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92697, USA; Department of Botany and Plant Sciences, UC Riverside, Riverside, CA 92521, USA
| | - Nikos Darzentas
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Elias Primetis
- School of Life Sciences, University of Sussex, Brighton, UK
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92697, USA.
| | | |
Collapse
|
5
|
Borredá C, Pérez-Román E, Ibanez V, Terol J, Talon M. Reprogramming of Retrotransposon Activity during Speciation of the Genus Citrus. Genome Biol Evol 2020; 11:3478-3495. [PMID: 31710678 PMCID: PMC7145672 DOI: 10.1093/gbe/evz246] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/04/2019] [Indexed: 12/13/2022] Open
Abstract
Speciation of the genus Citrus from a common ancestor has recently been established to begin ∼8 Ma during the late Miocene, a period of major climatic alterations. Here, we report the changes in activity of Citrus LTR retrotransposons during the process of diversification that gave rise to the current Citrus species. To reach this goal, we analyzed four pure species that diverged early during Citrus speciation, three recent admixtures derived from those species and an outgroup of the Citrus clade. More than 30,000 retrotransposons were grouped in ten linages. Estimations of LTR insertion times revealed that retrotransposon activity followed a species-specific pattern of change that could be ascribed to one of three different models. In some genomes, the expected pattern of gradual transposon accumulation was suddenly arrested during the radiation of the ancestor that gave birth to the current Citrus species. The individualized analyses of retrotransposon lineages showed that in each and every species studied, not all lineages follow the general pattern of the species itself. For instance, in most of the genomes, the retrotransposon activity of elements from the SIRE lineage reached its highest level just before Citrus speciation, while for Retrofit elements, it has been steadily growing. Based on these observations, we propose that Citrus retrotransposons may respond to stressful conditions driving speciation as a part of the genetic response involved in adaptation. This proposal implies that the evolving conditions of each species interact with the internal regulatory mechanisms of the genome controlling the proliferation of mobile elements.
Collapse
Affiliation(s)
- Carles Borredá
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Estela Pérez-Román
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Victoria Ibanez
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Javier Terol
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| | - Manuel Talon
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias (IVIA), Valencia, Spain
| |
Collapse
|
6
|
Roessler K, Muyle A, Diez CM, Gaut GRJ, Bousios A, Stitzer MC, Seymour DK, Doebley JF, Liu Q, Gaut BS. The genome-wide dynamics of purging during selfing in maize. NATURE PLANTS 2019; 5:980-990. [PMID: 31477888 DOI: 10.1038/s41477-019-0508-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 07/26/2019] [Indexed: 05/07/2023]
Abstract
Self-fertilization (also known as selfing) is an important reproductive strategy in plants and a widely applied tool for plant genetics and plant breeding. Selfing can lead to inbreeding depression by uncovering recessive deleterious variants, unless these variants are purged by selection. Here we investigated the dynamics of purging in a set of eleven maize lines that were selfed for six generations. We show that heterozygous, putatively deleterious single nucleotide polymorphisms are preferentially lost from the genome during selfing. Deleterious single nucleotide polymorphisms were lost more rapidly in regions of high recombination, presumably because recombination increases the efficacy of selection by uncoupling linked variants. Overall, heterozygosity decreased more slowly than expected, by an estimated 35% to 40% per generation instead of the expected 50%, perhaps reflecting pervasive associative overdominance. Finally, three lines exhibited marked decreases in genome size due to the purging of transposable elements. Genome loss was more likely to occur for lineages that began with larger genomes with more transposable elements and chromosomal knobs. These three lines purged an average of 398 Mb from their genomes, an amount equivalent to three Arabidopsis thaliana genomes per lineage, in only a few generations.
Collapse
Affiliation(s)
- Kyria Roessler
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA, USA
| | - Aline Muyle
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA, USA
| | | | | | | | | | - Danelle K Seymour
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA, USA
| | - John F Doebley
- Department of Genetics, University of Wisconsin, Madison, WI, USA
| | - Qingpo Liu
- The Key Laboratory for Quality Improvement of Agricultural Products of Zheijang Province, College of Agriculture and Food Sciences, Zhejiang A&F University, Lin'an, Hangzhou, China.
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA, USA.
| |
Collapse
|
7
|
Neumann P, Novák P, Hoštáková N, Macas J. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mob DNA 2019; 10:1. [PMID: 30622655 PMCID: PMC6317226 DOI: 10.1186/s13100-018-0144-1] [Citation(s) in RCA: 189] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 12/20/2018] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Plant LTR-retrotransposons are classified into two superfamilies, Ty1/copia and Ty3/gypsy. They are further divided into an enormous number of families which are, due to the high diversity of their nucleotide sequences, usually specific to a single or a group of closely related species. Previous attempts to group these families into broader categories reflecting their phylogenetic relationships were limited either to analyzing a narrow range of plant species or to analyzing a small numbers of elements. Furthermore, there is no reference database that allows for similarity based classification of LTR-retrotransposons. RESULTS We have assembled a database of retrotransposon encoded polyprotein domains sequences extracted from 5410 Ty1/copia elements and 8453 Ty3/gypsy elements sampled from 80 species representing major groups of green plants (Viridiplantae). Phylogenetic analysis of the three most conserved polyprotein domains (RT, RH and INT) led to dividing Ty1/copia and Ty3/gypsy retrotransposons into 16 and 14 lineages respectively. We also characterized various features of LTR-retrotransposon sequences including additional polyprotein domains, extra open reading frames and primer binding sites, and found that the occurrence and/or type of these features correlates with phylogenies inferred from the three protein domains. CONCLUSIONS We have established an improved classification system applicable to LTR-retrotransposons from a wide range of plant species. This system reflects phylogenetic relationships as well as distinct sequence and structural features of the elements. A comprehensive database of retrotransposon protein domains (REXdb) that reflects this classification provides a reference for efficient and unified annotation of LTR-retrotransposons in plant genomes. Access to REXdb related tools is implemented in the RepeatExplorer web server (https://repeatexplorer-elixir.cerit-sc.cz/) or using a standalone version of REXdb that can be downloaded seaparately from RepeatExplorer web page (http://repeatexplorer.org/).
Collapse
Affiliation(s)
- Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Nina Hoštáková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| |
Collapse
|
8
|
Roessler K, Bousios A, Meca E, Gaut BS. Modeling Interactions between Transposable Elements and the Plant Epigenetic Response: A Surprising Reliance on Element Retention. Genome Biol Evol 2018; 10:803-815. [PMID: 29608716 PMCID: PMC5841382 DOI: 10.1093/gbe/evy043] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/20/2018] [Indexed: 12/16/2022] Open
Abstract
Transposable elements (TEs) compose the majority of angiosperm DNA. Plants counteract TE activity by silencing them epigenetically. One form of epigenetic silencing requires 21-22 nt small interfering RNAs that act to degrade TE mRNA and may also trigger DNA methylation. DNA methylation is reinforced by a second mechanism, the RNA-dependent DNA methylation (RdDM) pathway. RdDM relies on 24 nt small interfering RNAs and ultimately establishes TEs in a quiescent state. These host factors interact at a systems level, but there have been no system level analyses of their interactions. Here, we define a deterministic model that represents the propagation of active TEs, aspects of the host response and the accumulation of silenced TEs. We describe general properties of the model and also fit it to biological data in order to explore two questions. The first is why two overlapping pathways are maintained, given that both are likely energetically expensive. Under our model, RdDM silenced TEs effectively even when the initiation of silencing was weak. This relationship implies that only a small amount of RNAi is needed to initiate TE silencing, but reinforcement by RdDM is necessary to efficiently counter TE propagation. Second, we investigated the reliance of the host response on rates of TE deletion. The model predicted that low levels of deletion lead to few active TEs, suggesting that silencing is most efficient when methylated TEs are retained in the genome, thereby providing one explanation for the large size of plant genomes.
Collapse
Affiliation(s)
- Kyria Roessler
- Department of Ecology and Evolutionary Biology, UC Irvine
| | | | - Esteban Meca
- Departamento de Agronomia, Universidad de Cordoba, Spain
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine
| |
Collapse
|
9
|
Wicker T, Schulman AH, Tanskanen J, Spannagl M, Twardziok S, Mascher M, Springer NM, Li Q, Waugh R, Li C, Zhang G, Stein N, Mayer KFX, Gundlach H. The repetitive landscape of the 5100 Mbp barley genome. Mob DNA 2017; 8:22. [PMID: 29270235 PMCID: PMC5738225 DOI: 10.1186/s13100-017-0102-3] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 11/22/2017] [Indexed: 01/07/2023] Open
Abstract
Background While transposable elements (TEs) comprise the bulk of plant genomic DNA, how they contribute to genome structure and organization is still poorly understood. Especially in large genomes where TEs make the majority of genomic DNA, it is still unclear whether TEs target specific chromosomal regions or whether they simply accumulate where they are best tolerated. Results Here, we present an analysis of the repetitive fraction of the 5100 Mb barley genome, the largest angiosperm genome to have a near-complete sequence assembly. Genes make only about 2% of the genome, while over 80% is derived from TEs. The TE fraction is composed of at least 350 different families. However, 50% of the genome is comprised of only 15 high-copy TE families, while all other TE families are present in moderate or low copy numbers. We found that the barley genome is highly compartmentalized with different types of TEs occupying different chromosomal “niches”, such as distal, interstitial, or proximal regions of chromosome arms. Furthermore, gene space represents its own distinct genomic compartment that is enriched in small non-autonomous DNA transposons, suggesting that these TEs specifically target promoters and downstream regions. Furthermore, their presence in gene promoters is associated with decreased methylation levels. Conclusions Our data show that TEs are major determinants of overall chromosome structure. We hypothesize that many of the the various chromosomal distribution patterns are the result of TE families targeting specific niches, rather than them accumulating where they have the least deleterious effects. Electronic supplementary material The online version of this article (10.1186/s13100-017-0102-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Thomas Wicker
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, CH-8008 Zurich, Switzerland
| | - Alan H Schulman
- Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Green Technology, Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Jaakko Tanskanen
- Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Green Technology, Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Manuel Spannagl
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Sven Twardziok
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, 1479 Gortner Avenue, Saint Paul, MN 55108 USA
| | - Qing Li
- Department of Plant and Microbial Biology, University of Minnesota, 1479 Gortner Avenue, Saint Paul, MN 55108 USA.,Present address: National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070 China
| | - Robbie Waugh
- The James Hutton Institute, Dundee, UK.,School of Life Sciences, University of Dundee, Dundee, UK
| | - Chengdao Li
- Western Barley Genetics Alliance/the State Agricultural Biotechnology Centre, School of Veterinary and Life Sciences, Murdoch University, Murdoch, WA6150 Australia.,Department of Primary Industry and Regional Development, Government of Western Australia, South Perth, WA6155 Australia
| | - Guoping Zhang
- College of Agriculture and Biotechnology, Wuhan, ZU China
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, Germany
| | - Klaus F X Mayer
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany.,TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Heidrun Gundlach
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| |
Collapse
|
10
|
Bousios A, Gaut BS, Darzentas N. Considerations and complications of mapping small RNA high-throughput data to transposable elements. Mob DNA 2017; 8:3. [PMID: 28228849 PMCID: PMC5311732 DOI: 10.1186/s13100-017-0086-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Accepted: 01/31/2017] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND High-throughput sequencing (HTS) has revolutionized the way in which epigenetic research is conducted. When coupled with fully-sequenced genomes, millions of small RNA (sRNA) reads are mapped to regions of interest and the results scrutinized for clues about epigenetic mechanisms. However, this approach requires careful consideration in regards to experimental design, especially when one investigates repetitive parts of genomes such as transposable elements (TEs), or when such genomes are large, as is often the case in plants. RESULTS Here, in an attempt to shed light on complications of mapping sRNAs to TEs, we focus on the 2,300 Mb maize genome, 85% of which is derived from TEs, and scrutinize methodological strategies that are commonly employed in TE studies. These include choices for the reference dataset, the normalization of multiply mapping sRNAs, and the selection among sRNA metrics. We further examine how these choices influence the relationship between sRNAs and the critical feature of TE age, and contrast their effect on low copy genomic regions and other popular HTS data. CONCLUSIONS Based on our analyses, we share a series of take-home messages that may help with the design, implementation, and interpretation of high-throughput TE epigenetic studies specifically, but our conclusions may also apply to any work that involves analysis of HTS data.
Collapse
Affiliation(s)
- Alexandros Bousios
- School of Life Sciences, University of Sussex, Brighton, East Sussex BN1 9RH UK
| | - Brandon S. Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92697 USA
| | - Nikos Darzentas
- Central European Institute of Technology, Masaryk University, Brno, 62500 Czech Republic
| |
Collapse
|
11
|
Bousios A, Gaut BS. Mechanistic and evolutionary questions about epigenetic conflicts between transposable elements and their plant hosts. CURRENT OPINION IN PLANT BIOLOGY 2016; 30:123-33. [PMID: 26950253 DOI: 10.1016/j.pbi.2016.02.009] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Revised: 02/16/2016] [Accepted: 02/17/2016] [Indexed: 05/02/2023]
Abstract
Transposable elements (TEs) constitute the majority of plant genomes, but most are epigenetically inactivated by their host. Research over the last decade has elucidated many of the molecular components that are required for TE silencing. In contrast, the evolutionary dynamics between TEs and silencing pathways are less clear. Here, we discuss current information about these dynamics from both mechanistic and evolutionary perspectives. We highlight new evidence that palindromic sequences within TEs may act as signals for host recognition and that cis-regulatory regions of TEs may be sites of ongoing arms races with host defenses. We also discuss patterns of TE aging after they are silenced; while there is not yet a consensus, it appears that TEs are removed more rapidly near genes, such that older TE insertions tend to be farther from genes. We conclude by discussing the energetic costs for maintaining silencing pathways, which appear to be substantive. The maintenance of silencing pathways across many species suggests that epigenetic emergencies are frequent.
Collapse
Affiliation(s)
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92697, USA.
| |
Collapse
|
12
|
Bousios A, Diez CM, Takuno S, Bystry V, Darzentas N, Gaut BS. A role for palindromic structures in the cis-region of maize Sirevirus LTRs in transposable element evolution and host epigenetic response. Genome Res 2015; 26:226-37. [PMID: 26631490 PMCID: PMC4728375 DOI: 10.1101/gr.193763.115] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2015] [Accepted: 12/01/2015] [Indexed: 01/06/2023]
Abstract
Transposable elements (TEs) proliferate within the genome of their host, which responds by silencing them epigenetically. Much is known about the mechanisms of silencing in plants, particularly the role of siRNAs in guiding DNA methylation. In contrast, little is known about siRNA targeting patterns along the length of TEs, yet this information may provide crucial insights into the dynamics between hosts and TEs. By focusing on 6456 carefully annotated, full-length Sirevirus LTR retrotransposons in maize, we show that their silencing associates with underlying characteristics of the TE sequence and also uncover three features of the host–TE interaction. First, siRNA mapping varies among families and among elements, but particularly along the length of elements. Within the cis-regulatory portion of the LTRs, a complex palindrome-rich region acts as a hotspot of both siRNA matching and sequence evolution. These patterns are consistent across leaf, tassel, and immature ear libraries, but particularly emphasized for floral tissues and 21- to 22-nt siRNAs. Second, this region has the ability to form hairpins, making it a potential template for the production of miRNA-like, hairpin-derived small RNAs. Third, Sireviruses are targeted by siRNAs as a decreasing function of their age, but the oldest elements remain highly targeted, partially by siRNAs that cross-map to the youngest elements. We show that the targeting of older Sireviruses reflects their conserved palindromes. Altogether, we hypothesize that the palindromes aid the silencing of active elements and influence transposition potential, siRNA targeting levels, and ultimately the fate of an element within the genome.
Collapse
Affiliation(s)
- Alexandros Bousios
- School of Life Sciences, University of Sussex, Brighton BN1 9RH, United Kingdom; Institute of Applied Biosciences, Centre for Research and Technology Hellas, 57001 Thessaloniki, Greece
| | - Concepcion M Diez
- Department of Agronomy, University of Cordoba, 14014 Cordoba, Spain; Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, California 92697, USA
| | - Shohei Takuno
- SOKENDAI (Graduate University for Advanced Studies), Hayama, Kanagawa 240-0193, Japan
| | - Vojtech Bystry
- Central European Institute of Technology, Masaryk University, 62500 Brno, Czech Republic
| | - Nikos Darzentas
- Central European Institute of Technology, Masaryk University, 62500 Brno, Czech Republic
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, California 92697, USA
| |
Collapse
|
13
|
|
14
|
|
15
|
Cai Z, Liu H, He Q, Pu M, Chen J, Lai J, Li X, Jin W. Differential genome evolution and speciation of Coix lacryma-jobi L. and Coix aquatica Roxb. hybrid guangxi revealed by repetitive sequence analysis and fine karyotyping. BMC Genomics 2014; 15:1025. [PMID: 25425126 PMCID: PMC4256728 DOI: 10.1186/1471-2164-15-1025] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2014] [Accepted: 11/19/2014] [Indexed: 02/07/2023] Open
Abstract
Abstract Background Coix, Sorghum and Zea are closely related plant genera in the subtribe Maydeae. Coix comprises 9–11 species with different ploidy levels (2n = 10, 20, 30, and 40). The exclusively cultivated C. lacryma-jobi L. (2n = 20) is widely used in East and Southeast Asia for food and medicinal applications. Three fertile cytotypes (2n = 10, 20, and 40) have been reported for C. aquatica Roxb. One sterile cytotype (2n = 30) closely related to C. aquatica has been recently found in Guangxi of China. This putative hybrid has been named C. aquatica HG (Hybrid Guangxi). The genome composition and the evolutionary history of C. lacryma-jobi and C. aquatica HG are largely unclear. Results About 76% of the genome of C. lacryma-jobi and 73% of the genome of C. aquatica HG are repetitive DNA sequences as shown by low coverage genome sequencing followed by similarity-based cluster analysis. In addition, long terminal repeat (LTR) retrotransposable elements are dominant repetitive sequences in these two genomes, and the proportions of many repetitive sequences in whole genome varied greatly between the two species, indicating evolutionary divergence of them. We also found that a novel 102 bp variant of centromeric satellite repeat CentX and two other satellites only appeared in C. aquatica HG. The results from FISH analysis with repeat probe cocktails and the data from chromosomes pairing in meiosis metaphase showed that C. lacryma-jobi is likely a diploidized paleotetraploid species and C. aquatica HG is possibly a recently formed hybrid. Furthermore, C. lacryma-jobi and C. aquatica HG shared more co-existing repeat families and higher sequence similarity with Sorghum than with Zea. Conclusions The composition and abundance of repetitive sequences are divergent between the genomes of C. lacryma-jobi and C. aquatica HG. The results from fine karyotyping analysis and chromosome pairing suggested diploidization of C. lacryma-jobi during evolution and C. aquatica HG is a recently formed hybrid. The genome-wide comparison of repetitive sequences indicated that the repeats in Coix were more similar to those in Sorghum than to those in Zea, which is consistent with the phylogenetic relationship reported by previous work. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1025) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Weiwei Jin
- National Maize Improvement Center of China, Beijing Key Laboratory of Crop Genetic Improvement, Coordinated Research Center for Crop Biology, China Agricultural University, Beijing 100193, China.
| |
Collapse
|
16
|
Diez CM, Meca E, Tenaillon MI, Gaut BS. Three groups of transposable elements with contrasting copy number dynamics and host responses in the maize (Zea mays ssp. mays) genome. PLoS Genet 2014; 10:e1004298. [PMID: 24743518 PMCID: PMC3990487 DOI: 10.1371/journal.pgen.1004298] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2013] [Accepted: 02/21/2014] [Indexed: 12/19/2022] Open
Abstract
Most angiosperm nuclear DNA is repetitive and derived from silenced transposable elements (TEs). TE silencing requires substantial resources from the plant host, including the production of small interfering RNAs (siRNAs). Thus, the interaction between TEs and siRNAs is a critical aspect of both the function and the evolution of plant genomes. Yet the co-evolutionary dynamics between these two entities remain poorly characterized. Here we studied the organization of TEs within the maize (Zea mays ssp mays) genome, documenting that TEs fall within three groups based on the class and copy numbers. These groups included DNA elements, low copy RNA elements and higher copy RNA elements. The three groups varied statistically in characteristics that included length, location, age, siRNA expression and 24∶22 nucleotide (nt) siRNA targeting ratios. In addition, the low copy retroelements encompassed a set of TEs that had previously been shown to decrease expression within a 24 nt siRNA biogenesis mutant (mop1). To investigate the evolutionary dynamics of the three groups, we estimated their abundance in two landraces, one with a genome similar in size to that of the maize reference and the other with a 30% larger genome. For all three accessions, we assessed TE abundance as well as 22 nt and 24 nt siRNA content within leaves. The high copy number retroelements are under targeted similarly by siRNAs among accessions, appear to be born of a rapid bust of activity, and may be currently transpositionally dead or limited. In contrast, the lower copy number group of retrolements are targeted more dynamically and have had a long and ongoing history of transposition in the maize genome. Because transposable elements (TEs) constitute most angiosperm nuclear DNA, the interaction between TEs and their host genome is a key component for understanding the function and evolution of plant genomes. The diversity of the host response has been studied a great deal, including the biogenesis of small interfering RNAs (siRNAs) that target TEs for epigenetic modifications. However, little is known about variation in TE content among closely related genomes and whether siRNA expression tracks this variation. To that end, we surveyed both the copy number and the siRNA targeting of more than 1500 distinct TE subfamilies in the B73 maize reference genome. These surveys indicated that TE subfamilies fall naturally into three distinctive groups based on their class and copy number, but these groups also differ with respect to their location in the genome, their age, their expression and their siRNA regulation. The presence and consistency of these TE groups was also assessed in two genetically distant maize landraces with contrasting genome sizes. The variation in siRNA targeting across different TE groups and families, as well as the lack of correlation between TE and siRNA abundances, argues for the existence of multiple mechanisms and strategies for TE silencing.
Collapse
Affiliation(s)
- Concepcion M. Diez
- Dept. of Ecology and Evolutionary Biology, UC Irvine, Irvine, California, United States of America
- Departamento de Agronomía, Universidad de Córdoba, Campus de Excelencia Internacional Agroalimentario, ceiA3, Cordoba, Spain
| | - Esteban Meca
- Department of Mathematics, UC Irvine, Irvine, California, United States of America
| | - Maud I. Tenaillon
- CNRS, UMR de Génétique Végétale, INRA/CNRS/Univ Paris-Sud/AgroParisTech, Ferme du Moulon, Gif-sur-Yvette, France
| | - Brandon S. Gaut
- Dept. of Ecology and Evolutionary Biology, UC Irvine, Irvine, California, United States of America
- * E-mail:
| |
Collapse
|
17
|
Kolano B, Bednara E, Weiss-Schneeweiss H. Isolation and characterization of reverse transcriptase fragments of LTR retrotransposons from the genome of Chenopodium quinoa (Amaranthaceae). PLANT CELL REPORTS 2013; 32:1575-1588. [PMID: 23754338 PMCID: PMC3778962 DOI: 10.1007/s00299-013-1468-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Revised: 04/30/2013] [Accepted: 05/28/2013] [Indexed: 05/29/2023]
Abstract
High heterogeneity was observed among conserved domains of reverse transcriptase ( rt ) isolated from quinoa. Only one Ty1- copia rt was highly amplified. Reverse transcriptase sequences were located predominantly in pericentromeric region of quinoa chromosomes. The heterogeneity, genomic abundance, and chromosomal distribution of reverse transcriptase (rt)-coding fragments of Ty1-copia and Ty3-gypsy long terminal repeat retrotransposons were analyzed in the Chenopodium quinoa genome. Conserved domains of the rt gene were amplified and characterized using degenerate oligonucleotide primer pairs. Sequence analyses indicated that half of Ty1-copia rt (51 %) and 39 % of Ty3-gypsy rt fragments contained intact reading frames. High heterogeneity among rt sequences was observed for both Ty1-copia and Ty3-gypsy rt amplicons, with Ty1-copia more heterogeneous than Ty3-gypsy. Most of the isolated rt fragments were present in quinoa genome in low copy numbers, with only one highly amplified Ty1-copia rt sequence family. The gypsy-like RNase H fragments co-amplified with Ty1-copia-degenerate primers were shown to be highly amplified in the quinoa genome indicating either higher abundance of some gypsy families of which rt domains could not be amplified, or independent evolution of this gypsy-region in quinoa. Both Ty1-copia and Ty3-gypsy retrotransposons were preferentially located in pericentromeric heterochromatin of quinoa chromosomes. Phylogenetic analyses of newly amplified rt fragments together with well-characterized retrotransposon families from other organisms allowed identification of major lineages of retroelements in the genome of quinoa and provided preliminary insight into their evolutionary dynamics.
Collapse
Affiliation(s)
- Bozena Kolano
- Department of Plant Anatomy and Cytology, University of Silesia, Jagiellonska 28, 40-032, Katowice, Poland,
| | | | | |
Collapse
|
18
|
Bousios A, Darzentas N. Sirevirus LTR retrotransposons: phylogenetic misconceptions in the plant world. Mob DNA 2013; 4:9. [PMID: 23452336 PMCID: PMC3599292 DOI: 10.1186/1759-8753-4-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Accepted: 01/22/2013] [Indexed: 01/19/2023] Open
Abstract
Sireviruses are an ancient and plant-specific LTR retrotransposon genus. They possess a unique genome structure that is characterized by a plethora of highly conserved sequence motifs in key domains of the non-coding genome, and often, by the presence of an envelope-like gene. Recently, their crucial role in the organization of the maize genome, where Sireviruses occupy approximately 21% of its nuclear content, was revealed, followed by an analysis of their distribution across the plant kingdom. It is now suggested that Sireviruses have been a major mediator of the evolution of many plant genomes. However, the name ‘Sirevirus’ has caused confusion in the scientific community in regards to their classification within the LTR retrotransposon order and their relationship with viruses - a situation that is not unique to Sireviruses, but also affects other LTR retrotransposon genera. Here, we clarify the phylogenetic position of Sireviruses as typical LTR retrotransposons of the Copia superfamily and explain that the confusion stems from the discrepancy in the categorization of LTR retrotransposons by the two main classification systems: the International Committee on the Taxonomy of Viruses (ICTV) system and the unified classification system for eukaryotic transposable elements. While the name ‘Sirevirus’ has been given by ICTV, we show that the transposable element system, which is more suitable for eukaryotic genome studies, lacks an appropriate taxonomic level for describing them. We urge for this inconsistency to be addressed. Finally, we provide data suggesting that of the three ICTV-proposed genera of the Pseudoviridae (that is, Copia) family, only Sireviruses form a monophyletic group, while the phylogenetic distinction between Pseudoviruses and Hemiviruses is unclear. We conclude that because of their ongoing important contribution to the classification of transposable elements, these schemes need to be frequently revisited and revised - as shown by the example of the Sirevirus LTR retrotransposon genus.
Collapse
Affiliation(s)
- Alexandros Bousios
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki 57001, Greece.
| | | |
Collapse
|
19
|
Lisch D. Regulation of transposable elements in maize. CURRENT OPINION IN PLANT BIOLOGY 2012; 15:511-516. [PMID: 22824142 DOI: 10.1016/j.pbi.2012.07.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2012] [Accepted: 07/04/2012] [Indexed: 06/01/2023]
Abstract
Maize is a typical plant with respect to the proportion of its genome that is composed of transposable elements (TEs), but it is unusual in the number of well-characterized active TEs that it hosts. This has made it possible to examine in some detail the factors responsible for regulating the activity of these elements, particularly the means by which they are recognized and epigenetically silenced. That analysis has revealed that TE silencing is a complex process that involves careful distinctions of different developmental times and tissue types. The available evidence from maize and other species suggests that these distinctions are made in order to generate information in somatic tissues that can be used to induce or reinforce silencing in germinal tissues.
Collapse
Affiliation(s)
- Damon Lisch
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, United States.
| |
Collapse
|
20
|
Kapazoglou A, Engineer C, Drosou V, Kalloniati C, Tani E, Tsaballa A, Kouri ED, Ganopoulos I, Flemetakis E, Tsaftaris AS. The study of two barley type I-like MADS-box genes as potential targets of epigenetic regulation during seed development. BMC PLANT BIOLOGY 2012; 12:166. [PMID: 22985436 PMCID: PMC3499179 DOI: 10.1186/1471-2229-12-166] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2012] [Accepted: 08/30/2012] [Indexed: 05/09/2023]
Abstract
BACKGROUND MADS-box genes constitute a large family of transcription factors functioning as key regulators of many processes during plant vegetative and reproductive development. Type II MADS-box genes have been intensively investigated and are mostly involved in vegetative and flowering development. A growing number of studies of Type I MADS-box genes in Arabidopsis, have assigned crucial roles for these genes in gamete and seed development and have demonstrated that a number of Type I MADS-box genes are epigenetically regulated by DNA methylation and histone modifications. However, reports on agronomically important cereals such as barley and wheat are scarce. RESULTS Here we report the identification and characterization of two Type I-like MADS-box genes, from barley (Hordeum vulgare), a monocot cereal crop of high agronomic importance. Protein sequence and phylogenetic analysis showed that the putative proteins are related to Type I MADS-box proteins, and classified them in a distinct cereal clade. Significant differences in gene expression among seed developmental stages and between barley cultivars with varying seed size were revealed for both genes. One of these genes was shown to be induced by the seed development- and stress-related hormones ABA and JA whereas in situ hybridizations localized the other gene to specific endosperm sub-compartments. The genomic organization of the latter has high conservation with the cereal Type I-like MADS-box homologues and the chromosomal position of both genes is close to markers associated with seed quality traits. DNA methylation differences are present in the upstream and downstream regulatory regions of the barley Type I-like MADS-box genes in two different developmental stages and in response to ABA treatment which may be associated with gene expression differences. CONCLUSIONS Two barley MADS-box genes were studied that are related to Type I MADS-box genes. Differential expression in different seed developmental stages as well as in barley cultivars with different seed size was evidenced for both genes. The two barley Type I MADS-box genes were found to be induced by ABA and JA. DNA methylation differences in different seed developmental stages and after exogenous application of ABA is suggestive of epigenetic regulation of gene expression. The study of barley Type I-like MADS-box genes extends our investigations of gene regulation during endosperm and seed development in a monocot crop like barley.
Collapse
Affiliation(s)
- Aliki Kapazoglou
- Institute of Agrobiotechnology (INA), CERTH, Thermi-Thessaloniki, GR-57001, Greece
| | - Cawas Engineer
- Institute of Agrobiotechnology (INA), CERTH, Thermi-Thessaloniki, GR-57001, Greece
| | - Vicky Drosou
- Institute of Agrobiotechnology (INA), CERTH, Thermi-Thessaloniki, GR-57001, Greece
| | - Chrysanthi Kalloniati
- Department of Agricultural Biotechnology, Agricultural University of Athens, Iera Odos 75, Athens, GR-11855, Greece
| | - Eleni Tani
- Institute of Agrobiotechnology (INA), CERTH, Thermi-Thessaloniki, GR-57001, Greece
| | - Aphrodite Tsaballa
- Department of Genetics and Plant Breeding, Aristotle University of Thessaloniki, Thessaloniki, GR-54124, Greece
| | - Evangelia D Kouri
- Department of Agricultural Biotechnology, Agricultural University of Athens, Iera Odos 75, Athens, GR-11855, Greece
| | - Ioannis Ganopoulos
- Department of Genetics and Plant Breeding, Aristotle University of Thessaloniki, Thessaloniki, GR-54124, Greece
| | - Emmanouil Flemetakis
- Department of Agricultural Biotechnology, Agricultural University of Athens, Iera Odos 75, Athens, GR-11855, Greece
| | - Athanasios S Tsaftaris
- Institute of Agrobiotechnology (INA), CERTH, Thermi-Thessaloniki, GR-57001, Greece
- Department of Genetics and Plant Breeding, Aristotle University of Thessaloniki, Thessaloniki, GR-54124, Greece
| |
Collapse
|
21
|
Bousios A, Minga E, Kalitsou N, Pantermali M, Tsaballa A, Darzentas N. MASiVEdb: the Sirevirus Plant Retrotransposon Database. BMC Genomics 2012; 13:158. [PMID: 22545773 PMCID: PMC3414828 DOI: 10.1186/1471-2164-13-158] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2011] [Accepted: 04/30/2012] [Indexed: 11/10/2022] Open
Abstract
Background Sireviruses are an ancient genus of the Copia superfamily of LTR retrotransposons, and the only one that has exclusively proliferated within plant genomes. Based on experimental data and phylogenetic analyses, Sireviruses have successfully infiltrated many branches of the plant kingdom, extensively colonizing the genomes of grass species. Notably, it was recently shown that they have been a major force in the make-up and evolution of the maize genome, where they currently occupy ~21% of the nuclear content and ~90% of the Copia population. It is highly likely, therefore, that their life dynamics have been fundamental in the genome composition and organization of a plethora of plant hosts. To assist studies into their impact on plant genome evolution and also facilitate accurate identification and annotation of transposable elements in sequencing projects, we developed MASiVEdb (Mapping and Analysis of SireVirus Elements Database), a collective and systematic resource of Sireviruses in plants. Description Taking advantage of the increasing availability of plant genomic sequences, and using an updated version of MASiVE, an algorithm specifically designed to identify Sireviruses based on their highly conserved genome structure, we populated MASiVEdb (http://bat.infspire.org/databases/masivedb/) with data on 16,243 intact Sireviruses (total length >158Mb) discovered in 11 fully-sequenced plant genomes. MASiVEdb is unlike any other transposable element database, providing a multitude of highly curated and detailed information on a specific genus across its hosts, such as complete set of coordinates, insertion age, and an analytical breakdown of the structure and gene complement of each element. All data are readily available through basic and advanced query interfaces, batch retrieval, and downloadable files. A purpose-built system is also offered for detecting and visualizing similarity between user sequences and Sireviruses, as well as for coding domain discovery and phylogenetic analysis. Conclusion MASiVEdb is currently the most comprehensive directory of Sireviruses, and as such complements other efforts in cataloguing plant transposable elements and elucidating their role in host genome evolution. Such insights will gradually deepen, as we plan to further improve MASiVEdb by phylogenetically mapping Sireviruses into families, by including data on fragments and solo LTRs, and by incorporating elements from newly-released genomes.
Collapse
Affiliation(s)
- Alexandros Bousios
- Institute of Agrobiotechnology, Centre for Research and Technology Hellas, Thessaloniki, 57001, Greece.
| | | | | | | | | | | |
Collapse
|
22
|
Grandbastien MA, Casacuberta JM. Plant Endogenous Retroviruses? A Case of Mysterious ORFs. PLANT TRANSPOSABLE ELEMENTS 2012. [PMCID: PMC7123213 DOI: 10.1007/978-3-642-31842-9_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
| | - Josep M. Casacuberta
- , Centre de Recerca en Agrigenomica (CRAG), CSIC-RTA-UAB, Barcelona, 08193 Spain
| |
Collapse
|