1
|
Neumann P, Novák P, Hoštáková N, Macas J. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mob DNA 2019; 10:1. [PMID: 30622655 PMCID: PMC6317226 DOI: 10.1186/s13100-018-0144-1] [Citation(s) in RCA: 189] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 12/20/2018] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Plant LTR-retrotransposons are classified into two superfamilies, Ty1/copia and Ty3/gypsy. They are further divided into an enormous number of families which are, due to the high diversity of their nucleotide sequences, usually specific to a single or a group of closely related species. Previous attempts to group these families into broader categories reflecting their phylogenetic relationships were limited either to analyzing a narrow range of plant species or to analyzing a small numbers of elements. Furthermore, there is no reference database that allows for similarity based classification of LTR-retrotransposons. RESULTS We have assembled a database of retrotransposon encoded polyprotein domains sequences extracted from 5410 Ty1/copia elements and 8453 Ty3/gypsy elements sampled from 80 species representing major groups of green plants (Viridiplantae). Phylogenetic analysis of the three most conserved polyprotein domains (RT, RH and INT) led to dividing Ty1/copia and Ty3/gypsy retrotransposons into 16 and 14 lineages respectively. We also characterized various features of LTR-retrotransposon sequences including additional polyprotein domains, extra open reading frames and primer binding sites, and found that the occurrence and/or type of these features correlates with phylogenies inferred from the three protein domains. CONCLUSIONS We have established an improved classification system applicable to LTR-retrotransposons from a wide range of plant species. This system reflects phylogenetic relationships as well as distinct sequence and structural features of the elements. A comprehensive database of retrotransposon protein domains (REXdb) that reflects this classification provides a reference for efficient and unified annotation of LTR-retrotransposons in plant genomes. Access to REXdb related tools is implemented in the RepeatExplorer web server (https://repeatexplorer-elixir.cerit-sc.cz/) or using a standalone version of REXdb that can be downloaded seaparately from RepeatExplorer web page (http://repeatexplorer.org/).
Collapse
Affiliation(s)
- Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Nina Hoštáková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| |
Collapse
|
2
|
Variation in Copy Number of Ty3/Gypsy Centromeric Retrotransposons in the Genomes of Thinopyrum intermedium and Its Diploid Progenitors. PLoS One 2016; 11:e0154241. [PMID: 27119343 PMCID: PMC4847875 DOI: 10.1371/journal.pone.0154241] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2015] [Accepted: 04/11/2016] [Indexed: 01/20/2023] Open
Abstract
Speciation and allopolyploidization in cereals may be accompanied by dramatic changes in abundance of centromeric repeated transposable elements. Here we demonstrate that the reverse transcriptase part of Ty3/gypsy centromeric retrotransposon (RT-CR) is highly conservative in the segmental hexaploid Thinopyrum intermedium (JrJvsSt) and its possible diploid progenitors Th. bessarabicum (Jb), Pseudoroegneria spicata (St) and Dasypyrum villosum (V) but the abundance of the repeats varied to a large extent. Fluorescence in situ hybridization (FISH) showed hybridization signals in centromeric region of all chromosomes in the studied species, although the intensity of the signals drastically differed. In Th. intermedium, the strongest signal of RT-CR probe was detected on the chromosomes of Jv, intermediate on Jr and faint on Js and St subgenome suggesting different abundance of RT-CR on the individual chromosomes rather than the sequence specificity of RT-CRs of the subgenomes. RT-CR quantification using real-time PCR revealed that its content per genome in Th. bessarabicum is ~ 2 times and P. spicata is ~ 1,5 times higher than in genome of D. villosum. The possible burst of Ty3/gypsy centromeric retrotransposon in Th. intermedium during allopolyploidization and its role in proper mitotic and meiotic chromosome behavior in a nascent allopolyploid is discussed.
Collapse
|
3
|
Guo Y, Singh PK, Levin HL. A long terminal repeat retrotransposon of Schizosaccharomyces japonicus integrates upstream of RNA pol III transcribed genes. Mob DNA 2015; 6:19. [PMID: 26457121 PMCID: PMC4600332 DOI: 10.1186/s13100-015-0048-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Accepted: 09/22/2015] [Indexed: 01/29/2023] Open
Abstract
Background Transposable elements (TEs) are common constituents of centromeres. However, it is not known what causes this relationship. Schizosaccharomyces japonicus contains 10 families of Long Terminal Repeat (LTR)-retrotransposons and these elements cluster in centromeres and telomeres. In the related yeast, Schizosaccharomyces pombe LTR-retrotransposons Tf1 and Tf2 are distributed in the promoter regions of RNA pol II transcribed genes. Sequence analysis of TEs indicates that Tj1 of S. japonicus is related to Tf1 and Tf2, and uses the same mechanism of self-primed reverse transcription. Thus, we wondered why these related retrotransposons localized in different regions of the genome. Results To characterize the integration behavior of Tj1 we expressed it in S. pombe. We found Tj1 was active and capable of generating de novo integration in the chromosomes of S. pombe. The expression of Tj1 is similar to Type C retroviruses in that a stop codon at the end of Gag must be present for efficient integration. 17 inserts were sequenced, 13 occurred within 12 bp upstream of tRNA genes and 3 occurred at other RNA pol III transcribed genes. The link between Tj1 integration and RNA pol III transcription is reminiscent of Ty3, an LTR-retrotransposon of Saccharomyces cerevisiae that interacts with TFIIIB and integrates upstream of tRNA genes. Conclusion The integration of Tj1 upstream of tRNA genes and the centromeric clustering of tRNA genes in S. japonicus demonstrate that the clustering of this TE in centromere sequences is due to a unique pattern of integration.
Collapse
Affiliation(s)
- Yabin Guo
- Present address: University of Texas Southwestern Medical Center, Dallas, Texas USA
| | - Parmit Kumar Singh
- Section on Eukaryotic Transposable Elements, Program in Cellular Regulation and Metabolism, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Building 18 T, room 106, Bethesda, MD 20892 USA
| | - Henry L Levin
- Section on Eukaryotic Transposable Elements, Program in Cellular Regulation and Metabolism, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Building 18 T, room 106, Bethesda, MD 20892 USA
| |
Collapse
|
4
|
Abstract
Centromeric retrotransposons (CRs) constitute a family of plant retroelements, some of which have the ability to target their insertion almost exclusively to the functional centromeres. Our exhaustive analysis of CR family members in four grass genomes revealed not only horizontal transfer (HT) of CR elements between the oryzoid and panicoid grass lineages but also their subsequent recombination with endogenous elements that in some cases created prolific recombinants in foxtail millet and sorghum. HT events are easily identifiable only in cases where host genome divergence significantly predates HT, thus documented HT events likely represent only a fraction of the total. If the more difficult to detect ancient HT events occurred at frequencies similar to those observable in present day grasses, the extant long terminal repeat retrotransposons represent the mosaic products of HT and recombination that are optimized for retrotransposition in their host genomes. This complicates not only phylogenetic analysis but also the establishment of a meaningful retrotransposon nomenclature, which we have nevertheless attempted to implement here. In contrast to the plant-centric naming convention used currently for CR elements, we classify elements primarily based on their phylogenetic relationships regardless of host plant, using the exhaustively studied maize elements assigned to six different subfamilies as a standard. The CR2 subfamily is the most widely distributed of the six CR subfamilies discovered in grass genomes to date and thus the most likely to play a functional role at grass centromeres.
Collapse
Affiliation(s)
- Anupma Sharma
- Department of Molecular Biosciences and Bioengineering, University of Hawaii, Mānoa
| | - Gernot G Presting
- Department of Molecular Biosciences and Bioengineering, University of Hawaii, Mānoa
| |
Collapse
|
5
|
Schulman AH. Retrotransposon replication in plants. Curr Opin Virol 2013; 3:604-14. [PMID: 24035277 DOI: 10.1016/j.coviro.2013.08.009] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Revised: 08/16/2013] [Accepted: 08/19/2013] [Indexed: 12/31/2022]
Abstract
Retrotransposons comprise the bulk of large plant genomes, replicating via an RNA intermediate whereby the original, integrated element remains in place. Of the two main orders, the LTR retrotransposons considerably outnumber the LINEs. LINEs integrate into target sites simultaneously with the RNA transcript being copied into cDNA by target-primed reverse transcription. LTR retrotransposon replication is basically equivalent to the intracellular phase of retroviral life cycles. The envelope gene giving extracellular mobility to retroviruses is in fact widespread in plants and their retrotransposons. Evolutionary analyses of the retrotransposons and retroviruses suggest that both form an ancient monophyletic group. The particular adaptations of LTR retrotransposons to plant life cycles enabling their success remain to be clarified.
Collapse
Affiliation(s)
- Alan H Schulman
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, P.O. Box 65, Helsinki FIN-00014, Finland; Biotechnology and Food Research, MTT Agrifood Research Finland, Jokioinen FIN-31600, Finland.
| |
Collapse
|
6
|
Sharma A, Wolfgruber TK, Presting GG. Tandem repeats derived from centromeric retrotransposons. BMC Genomics 2013; 14:142. [PMID: 23452340 PMCID: PMC3648361 DOI: 10.1186/1471-2164-14-142] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Accepted: 02/23/2013] [Indexed: 12/26/2022] Open
Abstract
Background Tandem repeats are ubiquitous and abundant in higher eukaryotic genomes and constitute, along with transposable elements, much of DNA underlying centromeres and other heterochromatic domains. In maize, centromeric satellite repeat (CentC) and centromeric retrotransposons (CR), a class of Ty3/gypsy retrotransposons, are enriched at centromeres. Some satellite repeats have homology to retrotransposons and several mechanisms have been proposed to explain the expansion, contraction as well as homogenization of tandem repeats. However, the origin and evolution of tandem repeat loci remain largely unknown. Results CRM1TR and CRM4TR are novel tandem repeats that we show to be entirely derived from CR elements belonging to two different subfamilies, CRM1 and CRM4. Although these tandem repeats clearly originated in at least two separate events, they are derived from similar regions of their respective parent element, namely the long terminal repeat (LTR) and untranslated region (UTR). The 5′ ends of the monomer repeat units of CRM1TR and CRM4TR map to different locations within their respective LTRs, while their 3′ ends map to the same relative position within a conserved region of their UTRs. Based on the insertion times of heterologous retrotransposons that have inserted into these tandem repeats, amplification of the repeats is estimated to have begun at least ~4 (CRM1TR) and ~1 (CRM4TR) million years ago. Distinct CRM1TR sequence variants occupy the two CRM1TR loci, indicating that there is little or no movement of repeats between loci, even though they are separated by only ~1.4 Mb. Conclusions The discovery of two novel retrotransposon derived tandem repeats supports the conclusions from earlier studies that retrotransposons can give rise to tandem repeats in eukaryotic genomes. Analysis of monomers from two different CRM1TR loci shows that gene conversion is the major cause of sequence variation. We propose that successive intrastrand deletions generated the initial repeat structure, and gene conversions increased the size of each tandem repeat locus.
Collapse
|
7
|
Kokošar J, Kordiš D. Genesis and regulatory wiring of retroelement-derived domesticated genes: a phylogenomic perspective. Mol Biol Evol 2013; 30:1015-31. [PMID: 23348003 PMCID: PMC3670739 DOI: 10.1093/molbev/mst014] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Molecular domestications of transposable elements have occurred repeatedly during the evolution of eukaryotes. Vertebrates, especially mammals, possess numerous single copy domesticated genes (DGs) that have originated from the intronless multicopy transposable elements. However, the origin and evolution of the retroelement-derived DGs (RDDGs) that originated from Metaviridae has been only partially elucidated, due to absence of genome data or to limited analysis of a single family of DGs. We traced the genesis and regulatory wiring of the Metaviridae-derived DGs through phylogenomic analysis, using whole-genome information from more than 90 chordate genomes. Phylogenomic analysis of these DGs in chordate genomes provided direct evidence that major diversification has occurred in the ancestor of placental mammals. Mammalian RDDGs have been shown to originate in several steps by independent domestication events and to diversify later by gene duplications. Analysis of syntenic loci has shown that diverse RDDGs and their chromosomal positions were fully established in the ancestor of placental mammals. By analysis of active Metaviridae lineages in amniotes, we have demonstrated that RDDGs originated from retroelement remains. The chromosomal gene movements of RDDGs were highly dynamic only in the ancestor of placental mammals. During the domestication process, de novo acquisition of regulatory regions is shown to be a prerequisite for the survival of the DGs. The origin and evolution of de novo acquired promoters and untranslated regions in diverse mammalian RDDGs have been explained by comparative analysis of orthologous gene loci. The origin of placental mammal-specific innovations and adaptations, such as placenta and newly evolved brain functions, was most probably connected to the regulatory wiring of DGs and their rapid fixation in the ancestor of placental mammals.
Collapse
Affiliation(s)
- Janez Kokošar
- Department of Molecular and Biomedical Sciences, Josef Stefan Institute, Ljubljana, Slovenia
| | | |
Collapse
|
8
|
Chromodomains read the arginine code of post-translational targeting. Nat Struct Mol Biol 2012; 19:260-3. [PMID: 22231402 DOI: 10.1038/nsmb.2196] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2011] [Accepted: 11/07/2011] [Indexed: 11/08/2022]
Abstract
Chromodomains typically recruit protein complexes to chromatin and read the epigenetic histone code by recognizing lysine methylation in histone tails. We report the crystal structure of the chloroplast signal recognition particle (cpSRP) core from Arabidopsis thaliana, with the cpSRP54 tail comprising an arginine-rich motif bound to the second chromodomain of cpSRP43. A twinned aromatic cage reads out two neighboring nonmethylated arginines and adapts chromodomains to a non-nuclear function in post-translational targeting.
Collapse
|
9
|
Extensive intron gain in the ancestor of placental mammals. Biol Direct 2011; 6:59. [PMID: 22112745 PMCID: PMC3257199 DOI: 10.1186/1745-6150-6-59] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2011] [Accepted: 11/23/2011] [Indexed: 01/29/2023] Open
Abstract
Background Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this study was to search for evidence of de novo intron gain in domesticated genes from an analysis of their exon/intron structures. Results A phylogenomic approach has been used to analyse all domesticated genes in mammals and chordates that originated from the coding parts of transposable elements. Gain of introns in domesticated genes has been reconstructed on well established mammalian, vertebrate and chordate phylogenies, and examined as to where and when the gain events occurred. The locations, sizes and amounts of de novo introns gained in the domesticated genes during the evolution of mammals and chordates has been analyzed. A significant amount of intron gain was found only in domesticated genes of placental mammals, where more than 70 cases were identified. De novo gained introns show clear positional bias, since they are distributed mainly in 5' UTR and coding regions, while 3' UTR introns are very rare. In the coding regions of some domesticated genes up to 8 de novo gained introns have been found. Intron densities in Eutheria-specific domesticated genes and in older domesticated genes that originated early in vertebrates are lower than those for normal mammalian and vertebrate genes. Surprisingly, the majority of intron gains have occurred in the ancestor of placentals. Conclusions This study provides the first evidence for numerous intron gains in the ancestor of placental mammals and demonstrates that adequate taxon sampling is crucial for reconstructing intron evolution. The findings of this comprehensive study slightly challenge the current view on the evolutionary stasis in intron dynamics during the last 100 - 200 My. Domesticated genes could constitute an excellent system on which to analyse the mechanisms of intron gain in placental mammals. Reviewers: this article was reviewed by Dan Graur, Eugene V. Koonin and Jürgen Brosius.
Collapse
|
10
|
Abstract
The chromatin organization modifier domain (chromodomain) was first identified as a motif associated with chromatin silencing in Drosophila. There is growing evidence that chromodomains are evolutionary conserved across different eukaryotic species to control diverse aspects of epigenetic regulation. Although originally reported as histone H3 methyllysine readers, the chromodomain functions have now expanded to recognition of other histone and non-histone partners as well as interaction with nucleic acids. Chromodomain binding to a diverse group of targets is mediated by a conserved substructure called the chromobox homology region. This motif can be used to predict methyllysine binding and distinguish chromodomains from related Tudor "Royal" family members. In this review, we discuss and classify various chromodomains according to their context, structure and the mechanism of target recognition.
Collapse
Affiliation(s)
- Bartlomiej J Blus
- Diabetes and Obesity Research Center, Sanford-Burnham Medical Research Institute, Orlando, FL, USA
| | | | | |
Collapse
|
11
|
Abstract
All life must survive their corresponding viruses. Thus antiviral systems are essential in all living organisms. Remnants of virus derived information are also found in all life forms but have historically been considered mostly as junk DNA. However, such virus derived information can strongly affect host susceptibility to viruses. In this review, I evaluate the role viruses have had in the origin and evolution of host antiviral systems. From Archaea through bacteria and from simple to complex eukaryotes I trace the viral components that became essential elements of antiviral immunity. I conclude with a reexamination of the 'Big Bang' theory for the emergence of the adaptive immune system in vertebrates by horizontal transfer and note how viruses could have and did provide crucial and coordinated features.
Collapse
|
12
|
Viral ancestors of antiviral systems. Viruses 2011; 3:1933-58. [PMID: 22069523 PMCID: PMC3205389 DOI: 10.3390/v3101933] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2011] [Revised: 10/01/2011] [Accepted: 10/10/2011] [Indexed: 02/06/2023] Open
Abstract
All life must survive their corresponding viruses. Thus antiviral systems are essential in all living organisms. Remnants of virus derived information are also found in all life forms but have historically been considered mostly as junk DNA. However, such virus derived information can strongly affect host susceptibility to viruses. In this review, I evaluate the role viruses have had in the origin and evolution of host antiviral systems. From Archaea through bacteria and from simple to complex eukaryotes I trace the viral components that became essential elements of antiviral immunity. I conclude with a reexamination of the ‘Big Bang’ theory for the emergence of the adaptive immune system in vertebrates by horizontal transfer and note how viruses could have and did provide crucial and coordinated features.
Collapse
|
13
|
Plant centromeric retrotransposons: a structural and cytogenetic perspective. Mob DNA 2011; 2:4. [PMID: 21371312 PMCID: PMC3059260 DOI: 10.1186/1759-8753-2-4] [Citation(s) in RCA: 138] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2010] [Accepted: 03/03/2011] [Indexed: 12/12/2022] Open
Abstract
Background The centromeric and pericentromeric regions of plant chromosomes are colonized by Ty3/gypsy retrotransposons, which, on the basis of their reverse transcriptase sequences, form the chromovirus CRM clade. Despite their potential importance for centromere evolution and function, they have remained poorly characterized. In this work, we aimed to carry out a comprehensive survey of CRM clade elements with an emphasis on their diversity, structure, chromosomal distribution and transcriptional activity. Results We have surveyed a set of 190 CRM elements belonging to 81 different retrotransposon families, derived from 33 host species and falling into 12 plant families. The sequences at the C-terminus of their integrases were unexpectedly heterogeneous, despite the understanding that they are responsible for targeting to the centromere. This variation allowed the division of the CRM clade into the three groups A, B and C, and the members of each differed considerably with respect to their chromosomal distribution. The differences in chromosomal distribution coincided with variation in the integrase C-terminus sequences possessing a putative targeting domain (PTD). A majority of the group A elements possess the CR motif and are concentrated in the centromeric region, while members of group C have the type II chromodomain and are dispersed throughout the genome. Although representatives of the group B lack a PTD of any type, they appeared to be localized preferentially in the centromeres of tested species. All tested elements were found to be transcriptionally active. Conclusions Comprehensive analysis of the CRM clade elements showed that genuinely centromeric retrotransposons represent only a fraction of the CRM clade (group A). These centromeric retrotransposons represent an active component of centromeres of a wide range of angiosperm species, implying that they play an important role in plant centromere evolution. In addition, their transcriptional activity is consistent with the notion that the transcription of centromeric retrotransposons has a role in normal centromere function.
Collapse
|
14
|
Lisch D, Slotkin RK. Strategies for silencing and escape: the ancient struggle between transposable elements and their hosts. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2011; 292:119-52. [PMID: 22078960 DOI: 10.1016/b978-0-12-386033-0.00003-7] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Over the past several years, there has been an explosion in our understanding of the mechanisms by which plant transposable elements (TEs) are epigenetically silenced and maintained in an inactive state over long periods of time. This highly efficient process results in vast numbers of inactive TEs; indeed, the majority of many plant genomes are composed of these quiescent elements. This observation has led to the rather static view that TEs represent an essentially inert portion of plant genomes. However, recent work has demonstrated that TE silencing is a highly dynamic process that often involves transcription of TEs at particular times and places during plant development. Plants appear to use transcripts from silenced TEs as an ongoing source of information concerning the mobile portion of the genome. In contrast to our understanding of silencing pathways, we know relatively little about the ways in which TEs evade silencing. However, vast differences in TE content between even closely related plant species suggest that they are often wildly successful at doing so. Here, we discuss TE activity in plants as the result of a constantly shifting balance between host strategies for TE silencing and TE strategies for escape and amplification.
Collapse
Affiliation(s)
- Damon Lisch
- Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
| | | |
Collapse
|
15
|
Gottlieb AM, Poggio L. Genomic screening in dioecious "yerba mate" tree (Ilex paraguariensis A. St. Hill., Aquifoliaceae) through representational difference analysis. Genetica 2010; 138:567-78. [PMID: 20221672 DOI: 10.1007/s10709-010-9449-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2009] [Accepted: 09/25/2009] [Indexed: 01/31/2023]
Abstract
The "yerba mate" tree, Ilex paraguariensis, is a functionally dioecious crop species with economic relevance in several South American countries. We report a genomic screening accomplished through representational difference analysis (RDA) in male and female I. paraguariensis trees. The aim of the present paper was to investigate the occurrence of sex-related genomic differences in order to develop an early gender detection molecular method that could help reducing energy inputs during the "yerba mate" processing and that could be suitable for breeding programs. An intra-experiment redundancy was detected via SSCP analysis and sequence characterization. Taking together both reciprocal RDA assays, fragments isolated can be discriminated into three main categories. The first category of fragments shows spurious affinities with available deposited sequences and could be considered as specific to I. paraguariensis. The second category comprises sequences identified as organellar or ribosomal plant DNA. Sequences grouped in the third category involve clones akin to conserved domains of retrotransposons (RNaseH, integrases and/or chromodomains) from at least two distinct lineages of Ty3/Gypsy retrotransposons and one from Ty1/Copia retroelements, which in addition are associated to sex determination regions of the Solanaceae, Caricaceae and Salicaceae. A contig sequence was assembled that codes for an integrase core domain and a chromodomain. The phylogenetic analysis of the so-called IPRE (for I. paraguariensis retroelement) integrase domain indicates that it belongs to the Del lineage of the Chromoviridae. This is the first report of mobile elements isolated and detected from the "yerba mate" tree. Although RDA derived fragments, so far tested, have been retrieved from both sexes with similar sequences, association to sex related regions cannot be completely discarded. Implications of present results are further discussed.
Collapse
Affiliation(s)
- Alexandra Marina Gottlieb
- Laboratorio de Citogenética y Evolución (LaCyE), Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Intendente Güiraldes y Costanera Norte s/n, 4to. Piso, Pabellón II, Ciudad Universitaria, C1428EHA, Ciudad Autónoma de Buenos Aires, Argentina.
| | | |
Collapse
|
16
|
Villarreal LP. The source of self: genetic parasites and the origin of adaptive immunity. Ann N Y Acad Sci 2009; 1178:194-232. [PMID: 19845639 DOI: 10.1111/j.1749-6632.2009.05020.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Stable colonization of the host by viruses (genetic parasites) can alter the systems of host identity and provide immunity against related viruses. To attain the needed stability, some viruses of prokaryotes (P1 phage) use a strategy called an addiction module. The linked protective and destructive gene functions of an addiction module insures both virus persistence but will also destroy cells that interrupt this module and thereby prevent infection by competitors. Previously, I have generalized this concept to also include persistent and lytic states of virus infection, which can be considered as a virus addiction module. Such states often involve defective viruses. In this report, I examine the origin of the adaptive immune system from the perspective of a virus addiction module. The likely role of both endogenous and exogenous retroviruses, DNA viruses, and their defective elements is considered in the origin of all the basal components of adaptive immunity (T-cell receptor, RAG-mediated gene rearrangement, clonal lymphocyte proliferation, antigen surface presentation, apoptosis, and education of immune cells). It is concluded that colonization by viruses and their defectives provides a more coherent explanation for the origin of adaptive immunity.
Collapse
Affiliation(s)
- Luis P Villarreal
- Center for Virus Research, Department of Molecular Biology and Biochemistry, University of California, Irvine, California 92697, USA.
| |
Collapse
|
17
|
Nested Ty3-gypsy retrotransposons of a single Beta procumbens centromere contain a putative chromodomain. Chromosome Res 2009; 17:379-96. [PMID: 19322668 DOI: 10.1007/s10577-009-9029-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Revised: 01/12/2009] [Accepted: 01/12/2009] [Indexed: 12/18/2022]
Abstract
LTR retrotransposons belong to a major group of DNA sequences that are often localized in plant centromeres. Using BAC inserts originating from the centromere of a monosomic wild beet (Beta procumbens) chromosome fragment in Beta vulgaris, two complete LTR retrotransposons were identified. Both elements, designated Beetle1 and Beetle2, possess a coding region with genes in the order characteristic for Ty3-gypsy retrotransposons. Beetle1 and Beetle2 have a chromodomain in the C-terminus of the integrase gene and are highly similar to the centromeric retrotransposons (CRs) of rice, maize, and barley. Both retroelements were localized in the centromeric region of B. procumbens chromosomes by fluorescence in-situ hybridization. They can therefore be classified as centromere-specific chromoviruses. PCR analysis using RNA as template indicated that Beetle1 and Beetle2 are transcriptionally active. On the basis of the sequence diversity between the LTR sequences, it was estimated that Beetle1 and Beetle2 transposed within the last 60,000 years and 130,000 years, respectively. The centromeric localization of Beetle1 and Beetle2 and their transcriptional activity combined with high sequence conservation within each family play an important structural role in the centromeres of B. procumbens chromosomes.
Collapse
|
18
|
Koonin EV, Wolf YI, Nagasaki K, Dolja VV. The Big Bang of picorna-like virus evolution antedates the radiation of eukaryotic supergroups. Nat Rev Microbiol 2008; 6:925-39. [PMID: 18997823 DOI: 10.1038/nrmicro2030] [Citation(s) in RCA: 196] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The recent discovery of RNA viruses in diverse unicellular eukaryotes and developments in evolutionary genomics have provided the means for addressing the origin of eukaryotic RNA viruses. The phylogenetic analyses of RNA polymerases and helicases presented in this Analysis article reveal close evolutionary relationships between RNA viruses infecting hosts from the Chromalveolate and Excavate supergroups and distinct families of picorna-like viruses of plants and animals. Thus, diversification of picorna-like viruses probably occurred in a 'Big Bang' concomitant with key events of eukaryogenesis. The origins of the conserved genes of picorna-like viruses are traced to likely ancestors including bacterial group II retroelements, the family of HtrA proteases and DNA bacteriophages.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20894, USA
| | | | | | | |
Collapse
|
19
|
Llorens C, Fares MA, Moya A. Relationships of gag-pol diversity between Ty3/Gypsy and Retroviridae LTR retroelements and the three kings hypothesis. BMC Evol Biol 2008; 8:276. [PMID: 18842133 PMCID: PMC2577118 DOI: 10.1186/1471-2148-8-276] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2008] [Accepted: 10/08/2008] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The origin of vertebrate retroviruses (Retroviridae) is yet to be thoroughly investigated, but due to their similarity and identical gag-pol (and env) genome structure, it is accepted that they evolve from Ty3/Gypsy LTR retroelements the retrotransposons and retroviruses of plants, fungi and animals. These 2 groups of LTR retroelements code for 3 proteins rarely studied due to the high variability - gag polyprotein, protease and GPY/F module. In relation to 3 previously proposed Retroviridae classes I, II and II, investigation of the above proteins conclusively uncovers important insights regarding the ancient history of Ty3/Gypsy and Retroviridae LTR retroelements. RESULTS We performed a comprehensive study of 120 non-redundant Ty3/Gypsy and Retroviridae LTR retroelements. Phylogenetic reconstruction inferred based on the concatenated analysis of the gag and pol polyproteins shows a robust phylogenetic signal regarding the clustering of OTUs. Evaluation of gag and pol polyproteins separately yields discordant information. While pol signal supports the traditional perspective (2 monophyletic groups), gag polyprotein describes an alternative scenario where each Retroviridae class can be distantly related with one or more Ty3/Gypsy lineages. We investigated more in depth this evidence through comparative analyses performed based on the gag polyprotein, the protease and the GPY/F module. Our results indicate that contrary to the traditional monophyletic view of the origin of vertebrate retroviruses, the Retroviridae class I is a molecular fossil, preserving features that were probably predominant among Ty3/Gypsy ancestors predating the split of plants, fungi and animals. In contrast, classes II and III maintain other phenotypes that emerged more recently during Ty3/Gypsy evolution. CONCLUSION The 3 Retroviridae classes I, II and III exhibit phenotypic differences that delineate a network never before reported between Ty3/Gypsy and Retroviridae LTR retroelements. This new scenario reveals how the diversity of vertebrate retroviruses is polyphyletically recurrent into the Ty3/Gypsy evolution, i.e. older than previously thought. The simplest hypothesis to explain this finding is that classes I, II and III trace back to at least 3 Ty3/Gypsy ancestors that emerged at different evolutionary times prior to protostomes-deuterostomes divergence. We have called this "the three kings hypothesis" concerning the origin of vertebrate retroviruses.
Collapse
Affiliation(s)
- Carlos Llorens
- Institut Cavanilles de Biodiversitat i Biología Evolutiva, Universitat de València, Polígono de la coma S/N, Paterna, Valencia, Spain
- Biotechvana, Parc Cientific, Universitat de Valencia, Paterna, Lab 16D Polígono de la coma S/N, Paterna, Valencia, Spain
| | - Mario A Fares
- Department of Genetics, University of Dublín, Trinity Collage Dublín, Dublín 2, Ireland
| | - Andres Moya
- Institut Cavanilles de Biodiversitat i Biología Evolutiva, Universitat de València, Polígono de la coma S/N, Paterna, Valencia, Spain
- CIBER de Epidemiología y Sal ud Pública (CIBERESP), Spain
| |
Collapse
|
20
|
Gao X, Hou Y, Ebina H, Levin HL, Voytas DF. Chromodomains direct integration of retrotransposons to heterochromatin. Genome Res 2008; 18:359-69. [PMID: 18256242 DOI: 10.1101/gr.7146408] [Citation(s) in RCA: 136] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The enrichment of mobile genetic elements in heterochromatin may be due, in part, to targeted integration. The chromoviruses are Ty3/gypsy retrotransposons with chromodomains at their integrase C termini. Chromodomains are logical determinants for targeting to heterochromatin, because the chromodomain of heterochromatin protein 1 (HP1) typically recognizes histone H3 K9 methylation, an epigenetic mark characteristic of heterochromatin. We describe three groups of chromoviruses based on amino acid sequence relationships of their integrase C termini. Genome sequence analysis indicates that representative chromoviruses from each group are enriched in gene-poor regions of the genome relative to other retrotransposons, and when fused to fluorescent marker proteins, the chromodomains target proteins to specific subnuclear foci coincident with heterochromatin. The chromodomain of the fungal element, MAGGY, interacts with histone H3 dimethyl- and trimethyl-K9, and when the MAGGY chromodomain is fused to integrase of the Schizosaccharomyces pombe Tf1 retrotransposon, new Tf1 insertions are directed to sites of H3 K9 methylation. Repetitive sequences such as transposable elements trigger the RNAi pathway resulting in their epigenetic modification. Our results suggest a dynamic interplay between retrotransposons and heterochromatin, wherein mobile elements recognize heterochromatin at the time of integration and then perpetuate the heterochromatic mark by triggering epigenetic modification.
Collapse
Affiliation(s)
- Xiang Gao
- Department of Genetics, Development & Cell Biology, Iowa State University, Ames, Iowa 50011, USA
| | | | | | | | | |
Collapse
|
21
|
Centromeric retrotransposon lineages predate the maize/rice divergence and differ in abundance and activity. Mol Genet Genomics 2007; 279:133-47. [PMID: 18000683 DOI: 10.1007/s00438-007-0302-5] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2007] [Accepted: 10/21/2007] [Indexed: 10/22/2022]
Abstract
Centromeric retrotransposons (CR) are located almost exclusively at the centromeres of plant chromosomes. Analysis of the emerging Zea mays inbred B73 genome sequence revealed two novel subfamilies of CR elements of maize (CRM), bringing the total number of known CRM subfamilies to four. Orthologous subfamilies of each of these CRM subfamilies were discovered in the rice lineage, and the orthologous relationships were demonstrated with extensive phylogenetic analyses. The much higher number of CRs in maize versus Oryza sativa is due primarily to the recent expansion of the CRM1 subfamily in maize. At least one incomplete copy of a CRM1 homolog was found in O. sativa ssp. indica and O. officinalis, but no member of this subfamily could be detected in the finished O. sativa ssp. japonica genome, implying loss of this prolific subfamily in that subspecies. CRM2 and CRM3, as well as the corresponding rice subfamilies, have been recently active but are present in low numbers. CRM3 is a full-length element related to the non-autonomous CentA, which is the first described CRM. The oldest subfamily (CRM4), as well as its rice counterpart, appears to contain only inactive members that are not located in currently active centromeres. The abundance of active CR elements is correlated with chromosome size in the three plant genomes for which high quality genomic sequence is available, and the emerging picture of CR elements is one in which different subfamilies are active at different evolutionary times. We propose a model by which CR elements might influence chromosome and genome size.
Collapse
|
22
|
Neumann P, Yan H, Jiang J. The centromeric retrotransposons of rice are transcribed and differentially processed by RNA interference. Genetics 2007; 176:749-61. [PMID: 17409063 PMCID: PMC1894605 DOI: 10.1534/genetics.107.071902] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Retrotransposons consist of significant portions of many complex eukaryotic genomes and are often enriched in heterochromatin. The centromeric retrotransposon (CR) family in grass species is colonized in the centromeres and highly conserved among species that have been diverged for >50 MY. These unique characteristics have inspired scientists to speculate about the roles of CR elements in organization and function of centromeric chromatin. Here we report that the CRR (CR of rice) elements in rice are highly enriched in chromatin associated with H3K9me2, a hallmark for heterochromatin. CRR elements were transcribed in root, leaf, and panicle tissues, suggesting a constitutive transcription of this retrotransposon family. However, the overall transcription level was low and the CRR transcripts appeared to be derived from relatively few loci. The majority of the CRR transcripts had chimerical structures and contained only partial CRR sequences. We detected small RNAs (smRNAs) cognate to nonautonomous CRR1 (noaCRR1) and CRR1, but not CRR2 elements. This result was also confirmed by in silico analysis of rice smRNA sequences. These results suggest that different CRR subfamilies may play different roles in the RNAi-mediated pathway for formation and maintenance of centromeric heterochromatin.
Collapse
Affiliation(s)
| | | | - Jiming Jiang
- Corresponding author: Department of Horticulture, University of Wisconsin, 1575 Linden Dr., Madison, WI 53706. E-mail:
| |
Collapse
|
23
|
Kordis D. A genomic perspective on the chromodomain-containing retrotransposons: Chromoviruses. Gene 2005; 347:161-73. [PMID: 15777633 DOI: 10.1016/j.gene.2004.12.017] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2004] [Revised: 12/01/2004] [Accepted: 12/07/2004] [Indexed: 12/31/2022]
Abstract
Chromoviruses, chromodomain-containing retrotransposons, are the only Metaviridae (Ty3/gypsy group of retrotransposons) clade with a Eukaryota-wide distribution. They have a common evolutionary origin and are the most prolific and diverse Metaviridae clade. The fusion of a retrotransposon and a chromodomain, was most probably responsible for their extreme evolutionary success in Eukaryota. Analysis of the massive amount of genome sequence data for different eukaryotic lineages has provided an in depth insight into the diversity, evolution, neofunctionalization, high rate of genomic turnover and origin of chromoviruses in Eukaryota. This review attempts to summarise the unique aspects of chromoviruses from a genomic perspective.
Collapse
Affiliation(s)
- Dusan Kordis
- Department of Biochemistry and Molecular Biology, Jozef Stefan Institute, Jamova 39, 1001 Ljubljana, Slovenia.
| |
Collapse
|