1
|
Gao D. Introduction of Plant Transposon Annotation for Beginners. BIOLOGY 2023; 12:1468. [PMID: 38132293 PMCID: PMC10741241 DOI: 10.3390/biology12121468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023]
Abstract
Transposons are mobile DNA sequences that contribute large fractions of many plant genomes. They provide exclusive resources for tracking gene and genome evolution and for developing molecular tools for basic and applied research. Despite extensive efforts, it is still challenging to accurately annotate transposons, especially for beginners, as transposon prediction requires necessary expertise in both transposon biology and bioinformatics. Moreover, the complexity of plant genomes and the dynamic evolution of transposons also bring difficulties for genome-wide transposon discovery. This review summarizes the three major strategies for transposon detection including repeat-based, structure-based, and homology-based annotation, and introduces the transposon superfamilies identified in plants thus far, and some related bioinformatics resources for detecting plant transposons. Furthermore, it describes transposon classification and explains why the terms 'autonomous' and 'non-autonomous' cannot be used to classify the superfamilies of transposons. Lastly, this review also discusses how to identify misannotated transposons and improve the quality of the transposon database. This review provides helpful information about plant transposons and a beginner's guide on annotating these repetitive sequences.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID 83210, USA
| |
Collapse
|
2
|
Goes CAG, dos Santos N, Rodrigues PHDM, Stornioli JHF, da Silva AB, dos Santos RZ, Vidal JAD, Silva DMZDA, Artoni RF, Foresti F, Hashimoto DT, Porto-Foresti F, Utsunomia R. The Satellite DNA Catalogues of Two Serrasalmidae (Teleostei, Characiformes): Conservation of General satDNA Features over 30 Million Years. Genes (Basel) 2022; 14:91. [PMID: 36672835 PMCID: PMC9859320 DOI: 10.3390/genes14010091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 12/08/2022] [Accepted: 12/24/2022] [Indexed: 12/31/2022] Open
Abstract
Satellite DNAs (satDNAs) are tandemly repeated sequences that are usually located on the heterochromatin, and the entire collection of satDNAs within a genome is called satellitome. Primarily, these sequences are not under selective pressure and evolve by concerted evolution, resulting in elevated rates of divergence between the satDNA profiles of reproductive isolated species/populations. Here, we characterized two additional satellitomes of Characiformes fish (Colossoma macropomum and Piaractus mesopotamicus) that diverged approximately 30 million years ago, while still retaining conserved karyotype features. The results we obtained indicated that several satDNAs (50% of satellite sequences in P. mesopotamicus and 43% in C. macropomum) show levels of conservation between the analyzed species, in the nucleotide and chromosomal levels. We propose that long-life cycles and few genomic changes could slow down rates of satDNA differentiation.
Collapse
Affiliation(s)
| | - Natalia dos Santos
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
| | | | - José Henrique Forte Stornioli
- Institute of Biological Sciences and Health, Federal Rural University of Rio de Janeiro, Seropédica 23890-000, RJ, Brazil
| | - Amanda Bueno da Silva
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
| | | | - Jhon Alex Dziechciarz Vidal
- Department of Structural, Molecular and Genetic Biology, State University of Ponta Grossa, Ponta Grossa 84030-900, PR, Brazil
- Department of Genetics and Evolution, Federal University of São Carlos, São Carlos 13565-905, SP, Brazil
| | | | - Roberto Ferreira Artoni
- Department of Structural, Molecular and Genetic Biology, State University of Ponta Grossa, Ponta Grossa 84030-900, PR, Brazil
- Department of Genetics and Evolution, Federal University of São Carlos, São Carlos 13565-905, SP, Brazil
| | - Fausto Foresti
- Department of Structural and Functional Biology, Institute of Biosciences, São Paulo State University, Botucatu 18618-970, SP, Brazil
| | - Diogo Teruo Hashimoto
- Aquaculture Center of UNESP, São Paulo State University, Jaboticabal 14884-900, SP, Brazil
| | - Fábio Porto-Foresti
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
- Aquaculture Center of UNESP, São Paulo State University, Jaboticabal 14884-900, SP, Brazil
| | - Ricardo Utsunomia
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
- Institute of Biological Sciences and Health, Federal Rural University of Rio de Janeiro, Seropédica 23890-000, RJ, Brazil
- Aquaculture Center of UNESP, São Paulo State University, Jaboticabal 14884-900, SP, Brazil
| |
Collapse
|
3
|
Papolu PK, Ramakrishnan M, Mullasseri S, Kalendar R, Wei Q, Zou L, Ahmad Z, Vinod KK, Yang P, Zhou M. Retrotransposons: How the continuous evolutionary front shapes plant genomes for response to heat stress. FRONTIERS IN PLANT SCIENCE 2022; 13:1064847. [PMID: 36570931 PMCID: PMC9780303 DOI: 10.3389/fpls.2022.1064847] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 11/21/2022] [Indexed: 05/28/2023]
Abstract
Long terminal repeat retrotransposons (LTR retrotransposons) are the most abundant group of mobile genetic elements in eukaryotic genomes and are essential in organizing genomic architecture and phenotypic variations. The diverse families of retrotransposons are related to retroviruses. As retrotransposable elements are dispersed and ubiquitous, their "copy-out and paste-in" life cycle of replicative transposition leads to new genome insertions without the excision of the original element. The overall structure of retrotransposons and the domains responsible for the various phases of their replication is highly conserved in all eukaryotes. The two major superfamilies of LTR retrotransposons, Ty1/Copia and Ty3/Gypsy, are distinguished and dispersed across the chromosomes of higher plants. Members of these superfamilies can increase in copy number and are often activated by various biotic and abiotic stresses due to retrotransposition bursts. LTR retrotransposons are important drivers of species diversity and exhibit great variety in structure, size, and mechanisms of transposition, making them important putative actors in genome evolution. Additionally, LTR retrotransposons influence the gene expression patterns of adjacent genes by modulating potential small interfering RNA (siRNA) and RNA-directed DNA methylation (RdDM) pathways. Furthermore, comparative and evolutionary analysis of the most important crop genome sequences and advanced technologies have elucidated the epigenetics and structural and functional modifications driven by LTR retrotransposon during speciation. However, mechanistic insights into LTR retrotransposons remain obscure in plant development due to a lack of advancement in high throughput technologies. In this review, we focus on the key role of LTR retrotransposons response in plants during heat stress, the role of centromeric LTR retrotransposons, and the role of LTR retrotransposon markers in genome expression and evolution.
Collapse
Affiliation(s)
- Pradeep K. Papolu
- State Key Laboratory of Subtropical Silviculture, Bamboo Industry Institute, Zhejiang A&F University, Hangzhou, Zhejiang, China
| | - Muthusamy Ramakrishnan
- State Key Laboratory of Subtropical Silviculture, Bamboo Industry Institute, Zhejiang A&F University, Hangzhou, Zhejiang, China
- Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, College of Biology and the Environment, Nanjing Forestry University, Nanjing, Jiangsu, China
| | - Sileesh Mullasseri
- Department of Zoology, St. Albert’s College (Autonomous), Kochi, Kerala, India
| | - Ruslan Kalendar
- Helsinki Institute of Life Science HiLIFE, Biocenter 3, University of Helsinki, Helsinki, Finland
- National Laboratory Astana, Nazarbayev University, Astana, Kazakhstan
| | - Qiang Wei
- Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, College of Biology and the Environment, Nanjing Forestry University, Nanjing, Jiangsu, China
| | - Long−Hai Zou
- State Key Laboratory of Subtropical Silviculture, Bamboo Industry Institute, Zhejiang A&F University, Hangzhou, Zhejiang, China
| | - Zishan Ahmad
- Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, College of Biology and the Environment, Nanjing Forestry University, Nanjing, Jiangsu, China
| | | | - Ping Yang
- State Key Laboratory of Subtropical Silviculture, Bamboo Industry Institute, Zhejiang A&F University, Hangzhou, Zhejiang, China
- Zhejiang Provincial Collaborative Innovation Center for Bamboo Resources and High-Efficiency Utilization, Zhejiang A&F University, Hangzhou, Zhejiang, China
| | - Mingbing Zhou
- State Key Laboratory of Subtropical Silviculture, Bamboo Industry Institute, Zhejiang A&F University, Hangzhou, Zhejiang, China
- Zhejiang Provincial Collaborative Innovation Center for Bamboo Resources and High-Efficiency Utilization, Zhejiang A&F University, Hangzhou, Zhejiang, China
| |
Collapse
|
4
|
The first long-read nuclear genome assembly of Oryza australiensis, a wild rice from northern Australia. Sci Rep 2022; 12:10823. [PMID: 35752642 PMCID: PMC9233661 DOI: 10.1038/s41598-022-14893-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 06/14/2022] [Indexed: 11/17/2022] Open
Abstract
Oryza australiensis is a wild rice native to monsoonal northern Australia. The International Oryza Map Alignment Project emphasises its significance as the sole representative of the EE genome clade. Assembly of the O. australiensis genome has previously been challenging due to its high Long Terminal Repeat (LTR) retrotransposon (RT) content. Oxford Nanopore long reads were combined with Illumina short reads to generate a high-quality ~ 858 Mbp genome assembly within 850 contigs with 46× long read coverage. Reference-guided scaffolding increased genome contiguity, placing 88.2% of contigs into 12 pseudomolecules. After alignment to the Oryza sativa cv. Nipponbare genome, we observed several structural variations. PacBio Iso-Seq data were generated for five distinct tissues to improve the functional annotation of 34,587 protein-coding genes and 42,329 transcripts. We also report SNV numbers for three additional O. australiensis genotypes based on Illumina re-sequencing. Although genetic similarity reflected geographical separation, the density of SNVs also correlated with our previous report on variations in salinity tolerance. This genome re-confirms the genetic remoteness of the O. australiensis lineage within the O. officinalis genome complex. Assembly of a high-quality genome for O. australiensis provides an important resource for the discovery of critical genes involved in development and stress tolerance.
Collapse
|
5
|
Gao D, Caspersen AM, Hu G, Bockelman HE, Chen X. A Novel Mutator-Like Transposable Elements With Unusual Structure and Recent Transpositions in Barley ( Hordeum vulgare). FRONTIERS IN PLANT SCIENCE 2022; 13:904619. [PMID: 35677233 PMCID: PMC9168764 DOI: 10.3389/fpls.2022.904619] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 05/03/2022] [Indexed: 06/15/2023]
Abstract
Mutator-like transposable elements (MULEs) represent a unique superfamily of DNA transposons as they can capture host genes and cause higher frequency of mutations in some eukaryotes. Despite their essential roles in plant evolution and functional genomics, MULEs are not fully understood yet in many important crops including barley (Hordeum vulgare). In this study, we analyzed the barley genome and identified a new mutator transposon Hvu_Abermu. This transposon is present at extremely high copy number in barley and shows unusual structure as it contains three open reading frames (ORFs) including one ORF (ORF1) encoding mutator transposase protein and one ORF (ORFR) showing opposite transcriptional orientation. We identified homologous sequences of Hvu_Abermu in both monocots and dicots and grouped them into a large mutator family named Abermu. Abermu transposons from different species share significant sequence identity, but they exhibit distinct sequence structures. Unlike the transposase proteins which are highly conserved between Abermu transposons from different organisms, the ORFR-encoded proteins are quite different from distant species. Phylogenetic analysis indicated that Abermu transposons shared closer evolutionary relationships with the maize MuDR transposon than other reported MULEs. We also found phylogenetic incongruence for the Abermu transposons identified in rice and its wild species implying the possibility of horizontal transfer of transposon. Further comparison indicated that over 200 barley genes contain Abermu-related sequences. We analyzed the barley pan genomes and detected polymorphic Hvu_Abermu transposons between the sequenced 23 wild and cultivated barley genomes. Our efforts identified a novel mutator transposon and revealed its recent transposition activity, which may help to develop genetic tools for barley and other crops.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, United States
| | - Ann M. Caspersen
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, United States
| | - Gongshe Hu
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, United States
| | - Harold E. Bockelman
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, United States
| | - Xianming Chen
- Wheat Health, Genetics, and Quality Research Unit, USDA-ARS, Pullman, WA, United States
| |
Collapse
|
6
|
Gao D, Nascimento EFMB, Leal-Bertioli SCM, Abernathy B, Jackson SA, Araujo ACG, Bertioli DJ. TAR30, a homolog of the canonical plant TTTAGGG telomeric repeat, is enriched in the proximal chromosome regions of peanut (Arachis hypogaea L.). Chromosome Res 2022; 30:77-90. [DOI: 10.1007/s10577-022-09684-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 01/07/2022] [Accepted: 01/11/2022] [Indexed: 11/03/2022]
|
7
|
Gao D, Chu Y, Xia H, Xu C, Heyduk K, Abernathy B, Ozias-Akins P, Leebens-Mack JH, Jackson SA. Horizontal Transfer of Non-LTR Retrotransposons from Arthropods to Flowering Plants. Mol Biol Evol 2019; 35:354-364. [PMID: 29069493 PMCID: PMC5850137 DOI: 10.1093/molbev/msx275] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Even though lateral movements of transposons across families and even phyla within multicellular eukaryotic kingdoms have been found, little is known about transposon transfer between the kingdoms Animalia and Plantae. We discovered a novel non-LTR retrotransposon, AdLINE3, in a wild peanut species. Sequence comparisons and phylogenetic analyses indicated that AdLINE3 is a member of the RTE clade, originally identified in a nematode and rarely reported in plants. We identified RTE elements in 82 plants, spanning angiosperms to algae, including recently active elements in some flowering plants. RTE elements in flowering plants were likely derived from a single family we refer to as An-RTE. Interestingly, An-RTEs show significant DNA sequence identity with non-LTR retroelements from 42 animals belonging to four phyla. Moreover, the sequence identity of RTEs between two arthropods and two plants was higher than that of homologous genes. Phylogenetic and evolutionary analyses of RTEs from both animals and plants suggest that the An-RTE family was likely transferred horizontally into angiosperms from an ancient aphid(s) or ancestral arthropod(s). Notably, some An-RTEs were recruited as coding sequences of functional genes participating in metabolic or other biochemical processes in plants. This is the first potential example of horizontal transfer of transposons between animals and flowering plants. Our findings help to understand exchanges of genetic material between the kingdom Animalia and Plantae and suggest arthropods likely impacted on plant genome evolution.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| | - Ye Chu
- Department of Horticulture, University of Georgia, Tifton, GA
| | - Han Xia
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA.,Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Jinan, Shandong, China
| | - Chunming Xu
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| | - Karolina Heyduk
- Department of Plant Biology, University of Georgia, Athens, GA
| | - Brian Abernathy
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| | | | | | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| |
Collapse
|
8
|
Wang N, Dawe RK. Centromere Size and Its Relationship to Haploid Formation in Plants. MOLECULAR PLANT 2018; 11:398-406. [PMID: 29277426 DOI: 10.1016/j.molp.2017.12.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Revised: 12/07/2017] [Accepted: 12/12/2017] [Indexed: 05/18/2023]
Abstract
Wide species crosses often result in uniparental genome elimination and visible failures in centromere function. Crosses involving lines with mutated forms of the CENH3 histone variant that organizes the centromere/kinetochore interface have been shown to have similar effects, inducing haploids at high frequencies. Here, we propose a simple centromere size model that endeavors to explain both observations. It is based on the idea of a quantitative centromere architecture where each centromere in an individual is the same size, and the average size is dictated by a natural equilibrium between bound and unbound CENH3 (and its chaperones or binding proteins). While centromere size is determined by the cellular milieu, centromere positions are heritable and defined by the interactions of a small set of proteins that bind to both DNA and CENH3. Lines with defective or mutated CENH3 have a lower loading capacity and support smaller centromeres. In cases where a line with small or defective centromeres is crossed to a line with larger or normal centromeres, the smaller/defective centromeres are selectively degraded or not maintained, resulting in chromosome loss from the small-centromere parent. The model is testable and generalizable, and helps to explain the counterintuitive observation that inducer lines do not induce haploids when crossed to themselves.
Collapse
Affiliation(s)
- Na Wang
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - R Kelly Dawe
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA; Department of Genetics, University of Georgia, Athens, GA 30602, USA.
| |
Collapse
|
9
|
Gao D, Li Y, Kim KD, Abernathy B, Jackson SA. Landscape and evolutionary dynamics of terminal repeat retrotransposons in miniature in plant genomes. Genome Biol 2016; 17:7. [PMID: 26781660 PMCID: PMC4717578 DOI: 10.1186/s13059-015-0867-y] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 12/29/2015] [Indexed: 12/05/2022] Open
Abstract
Background Terminal repeat retrotransposons in miniature (TRIMs) are a unique group of small long terminal repeat retrotransposons that are difficult to identify. Thus far, only a few TRIMs have been characterized in the euphyllophytes, and their evolutionary and biological significance as well as their transposition mechanisms are poorly understood. Results Using a combination of de novo and homology-based methods, we annotate TRIMs in 48 plant genome sequences, spanning land plants to algae. The TRIMs are grouped into 156 families including 145 that were previously undefined. Notably, we identify the first TRIMs in a lycophyte and non-vascular plants. The majority of the TRIM families are highly conserved and shared within and between plant families. Unlike other long terminal repeat retrotransposons, TRIMs are enriched in or near genes; they are also targeted by sRNAs between 21 and 24 nucleotides in length, and are frequently found in CG body-methylated genes. Importantly, we also identify putative autonomous retrotransposons and very recent transpositions of a TRIM element in Oryza sativa. Conclusions We perform the most comprehensive analysis of TRIM transposons thus far and report that TRIMs are ubiquitous across plant genomes. Our results show that TRIMs are more frequently associated with large and CG body-methylated genes that have undergone strong purifying selection. Our findings also indicate that TRIMs are likely derived from internal deletions of large long terminal repeat retrotransposons. Finally, our data and methodology are important resources for the characterization and evolutionary and genomic studies of long terminal repeat retrotransposons in other genomes. Electronic supplementary material The online version of this article (doi:10.1186/s13059-015-0867-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA, 30602, USA.
| | - Yupeng Li
- Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA, 30602, USA.
| | - Kyung Do Kim
- Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA, 30602, USA.
| | - Brian Abernathy
- Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA, 30602, USA.
| | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA, 30602, USA.
| |
Collapse
|
10
|
Gao D, Jiang N, Wing RA, Jiang J, Jackson SA. Transposons play an important role in the evolution and diversification of centromeres among closely related species. FRONTIERS IN PLANT SCIENCE 2015; 6:216. [PMID: 25904926 PMCID: PMC4387472 DOI: 10.3389/fpls.2015.00216] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2015] [Accepted: 03/17/2015] [Indexed: 05/18/2023]
Abstract
Centromeres are important chromosomal regions necessary for eukaryotic cell segregation and replication. Due to high amounts of tandem repeats and transposons, centromeres have been difficult to sequence in most multicellular organisms, thus their sequence structure and evolution are poorly understood. In this study, we analyzed transposons in the centromere 8 (Cen8) from the African cultivated rice (O. glaberrima) and two subspecies of the Asian cultivated rice (O. sativa), indica and japonica. We detected much higher transposon contents (>69%) in centromere regions than in the whole genomes of O. sativa ssp. japonica and O. glaberrima (~35%). We compared the three Cen8s and identified numerous recent insertions of transposons that were frequently organized into multiple-layer nested blocks, similar to nested transposons in maize. Except for the Hopi retrotransposon, all LTR retrotransposons were shared but exhibit different abundances amongst the three Cen8s. Even though a majority of the transposons were located in intergenic regions, some gene-related transposons were found and may be involved in gene diversification. Chromatin immunoprecipitated (ChIP) data analysis revealed that 165 families from both Class I and Class II transposons were found in CENH3-associated chromatin sequences. These results indicate essential roles for transposons in centromeres and that the rapid divergence of the Cen8 sequences between the two cultivated rice species was primarily caused by recent transposon insertions.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies, University of GeorgiaAthens, GA, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State UniversityEast Lansing, MI, USA
| | - Rod A. Wing
- Department of Plant Sciences, Arizona Genome Institute, University of ArizonaTucson, AZ, USA
| | - Jiming Jiang
- Department of Horticulture, University of Wisconsin-MadisonMadison, WI, USA
| | - Scott A. Jackson
- Center for Applied Genetic Technologies, University of GeorgiaAthens, GA, USA
- *Correspondence: Scott A. Jackson, Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Rd, Athens, GA 30602, USA
| |
Collapse
|
11
|
Yi C, Zhang W, Dai X, Li X, Gong Z, Zhou Y, Liang G, Gu M. Identification and diversity of functional centromere satellites in the wild rice species Oryza brachyantha. Chromosome Res 2014; 21:725-37. [PMID: 24077888 DOI: 10.1007/s10577-013-9374-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Accepted: 07/05/2013] [Indexed: 11/28/2022]
Abstract
The centromere is a key chromosomal component for sister chromatid cohesion and is the site for kinetochore assembly and spindle fiber attachment, allowing each sister chromatid to faithfully segregate to each daughter cell during cell division. It is not clear what types of sequences act as functional centromeres and how centromere sequences are organized in Oryza brachyantha, an FF genome species. In this study, we found that the three classes of centromere-specific CentO-F satellites (CentO-F1, CentO-F2, and CentOF3) in O. brachyantha share no homology with the CentO satellites in Oryza sativa. The three classes of CentO-F satellites are all located within the chromosomal regions to which the spindle fibers attach and are characterized by megabase tandem arrays that are flanked by centromere-specific retrotransposons, CRR-F, in the O. brachyantha centromeres. Although these CentO-F satellites are quantitatively variable among 12 O. brachyantha centromeres, immunostaining with an antibody specific to CENH3 indicates that they are colocated with CENH3 in functional centromere regions. Our results demonstrate that the three classes of CentO-F satellites may be the major components of functional centromeres in O. brachyantha.
Collapse
|
12
|
Zhang H, Koblížková A, Wang K, Gong Z, Oliveira L, Torres GA, Wu Y, Zhang W, Novák P, Buell CR, Macas J, Jiang J. Boom-Bust Turnovers of Megabase-Sized Centromeric DNA in Solanum Species: Rapid Evolution of DNA Sequences Associated with Centromeres. THE PLANT CELL 2014; 26:1436-1447. [PMID: 24728646 PMCID: PMC4036563 DOI: 10.1105/tpc.114.123877] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2014] [Revised: 03/14/2014] [Accepted: 03/26/2014] [Indexed: 05/18/2023]
Abstract
Centromeres are composed of long arrays of satellite repeats in most multicellular eukaryotes investigated to date. The satellite repeat-based centromeres are believed to have evolved from "neocentromeres" that originally contained only single- or low-copy sequences. However, the emergence and evolution of the satellite repeats in centromeres has been elusive. Potato (Solanum tuberosum) provides a model system for studying centromere evolution because each of its 12 centromeres contains distinct DNA sequences, allowing comparative analysis of homoeologous centromeres from related species. We conducted genome-wide analysis of the centromeric sequences in Solanum verrucosum, a wild species closely related to potato. Unambiguous homoeologous centromeric sequences were detected in only a single centromere (Cen9) between the two species. Four centromeres (Cen2, Cen4, Cen7, and Cen10) in S. verrucosum contained distinct satellite repeats that were amplified from retrotransposon-related sequences. Strikingly, the same four centromeres in potato contain either different satellite repeats (Cen2 and Cen7) or exclusively single- and low-copy sequences (Cen4 and Cen10). Our sequence comparison of five homoeologous centromeres in two Solanum species reveals rapid divergence of centromeric sequences among closely related species. We propose that centromeric satellite repeats undergo boom-bust cycles before a favorable repeat is fixed in the population.
Collapse
Affiliation(s)
- Haiqin Zhang
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706 Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Sichuan 611130, People's Republic of China
| | - Andrea Koblížková
- Institute of Plant Molecular Biology, Biology Centre ASCR, Ceske Budejovice CZ-37005, Czech Republic
| | - Kai Wang
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706
| | - Zhiyun Gong
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706 Key Laboratory of Crop Genetics and Physiology of Jiangsu Province/Key Laboratory of Plant Functional Genomics of Ministry of Education, Yangzhou University, Yangzhou 225009, People's Republic of China
| | - Ludmila Oliveira
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706 Departmento de Biologia, Universidade Federal de Lavras, Lavras MG 37200, Brazil
| | - Giovana A Torres
- Departmento de Biologia, Universidade Federal de Lavras, Lavras MG 37200, Brazil
| | - Yufeng Wu
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706
| | - Wenli Zhang
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706
| | - Petr Novák
- Institute of Plant Molecular Biology, Biology Centre ASCR, Ceske Budejovice CZ-37005, Czech Republic
| | - C Robin Buell
- Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824
| | - Jiří Macas
- Institute of Plant Molecular Biology, Biology Centre ASCR, Ceske Budejovice CZ-37005, Czech Republic
| | - Jiming Jiang
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706
| |
Collapse
|
13
|
Gao D, Abernathy B, Rohksar D, Schmutz J, Jackson SA. Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris). FRONTIERS IN PLANT SCIENCE 2014; 5:339. [PMID: 25071814 PMCID: PMC4093653 DOI: 10.3389/fpls.2014.00339] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2014] [Accepted: 06/25/2014] [Indexed: 05/21/2023]
Abstract
Common bean (Phaseolus vulgaris) is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs) are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs) were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF) termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3'LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. These transposon data provide a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies, University of GeorgiaAthens, GA, USA
| | - Brian Abernathy
- Center for Applied Genetic Technologies, University of GeorgiaAthens, GA, USA
| | - Daniel Rohksar
- US Department of Energy Joint Genome InstituteWalnut Creek, CA, USA
| | - Jeremy Schmutz
- US Department of Energy Joint Genome InstituteWalnut Creek, CA, USA
- HudsonAlpha Institute of BiotechnologyHuntsville, AL, USA
| | - Scott A. Jackson
- Center for Applied Genetic Technologies, University of GeorgiaAthens, GA, USA
- *Correspondence: Scott A. Jackson, Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA 30602, USA e-mail:
| |
Collapse
|
14
|
Bennetzen JL, Wang H. The contributions of transposable elements to the structure, function, and evolution of plant genomes. ANNUAL REVIEW OF PLANT BIOLOGY 2014; 65:505-30. [PMID: 24579996 DOI: 10.1146/annurev-arplant-050213-035811] [Citation(s) in RCA: 304] [Impact Index Per Article: 30.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Transposable elements (TEs) are the key players in generating genomic novelty by a combination of the chromosome rearrangements they cause and the genes that come under their regulatory sway. Genome size, gene content, gene order, centromere function, and numerous other aspects of nuclear biology are driven by TE activity. Although the origins and attitudes of TEs have the hallmarks of selfish DNA, there are numerous cases where TE components have been co-opted by the host to create new genes or modify gene regulation. In particular, epigenetic regulation has been transformed from a process to silence invading TEs and viruses into a key strategy for regulating plant genes. Most, perhaps all, of this epigenetic regulation is derived from TE insertions near genes or TE-encoded factors that act in trans. Enormous pools of genome data and new technologies for reverse genetics will lead to a powerful new era of TE analysis in plants.
Collapse
Affiliation(s)
- Jeffrey L Bennetzen
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | | |
Collapse
|
15
|
Qi LL, Wu JJ, Friebe B, Qian C, Gu YQ, Fu DL, Gill BS. Sequence organization and evolutionary dynamics of Brachypodium-specific centromere retrotransposons. Chromosome Res 2013; 21:507-21. [PMID: 23955173 DOI: 10.1007/s10577-013-9378-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 07/30/2013] [Accepted: 07/30/2013] [Indexed: 12/18/2022]
Abstract
Brachypodium distachyon is a wild annual grass belonging to the Pooideae, more closely related to wheat, barley, and forage grasses than rice and maize. As an experimental model, the completed genome sequence of B. distachyon provides a unique opportunity to study centromere evolution during the speciation of grasses. Centromeric satellite sequences have been identified in B. distachyon, but little is known about centromeric retrotransposons in this species. In the present study, bacterial artificial chromosome (BAC)-fluorescence in situ hybridization was conducted in maize, rice, barley, wheat, and rye using B. distachyon (Bd) centromere-specific BAC clones. Eight Bd centromeric BAC clones gave no detectable fluorescence in situ hybridization (FISH) signals on the chromosomes of rice and maize, and three of them also did not yield any FISH signals in barley, wheat, and rye. In addition, four of five Triticeae centromeric BAC clones did not hybridize to the B. distachyon centromeres, implying certain unique features of Brachypodium centromeres. Analysis of Brachypodium centromeric BAC sequences identified a long terminal repeat (LTR)-centromere retrotransposon of B. distachyon (CRBd1). This element was found in high copy number accounting for 1.6 % of the B. distachyon genome, and is enriched in Brachypodium centromeric regions. CRBd1 accumulated in active centromeres, but was lost from inactive ones. The LTR of CRBd1 appears to be specific to B. distachyon centromeres. These results reveal different evolutionary events of this retrotransposon family across grass species.
Collapse
Affiliation(s)
- L L Qi
- Northern Crop Science Laboratory, USDA-ARS, 1605 Albrecht Blvd N, Fargo, ND 58102-2765, USA.
| | | | | | | | | | | | | |
Collapse
|
16
|
Gao D, Jimenez-Lopez JC, Iwata A, Gill N, Jackson SA. Functional and structural divergence of an unusual LTR retrotransposon family in plants. PLoS One 2012; 7:e48595. [PMID: 23119066 PMCID: PMC3485330 DOI: 10.1371/journal.pone.0048595] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 09/28/2012] [Indexed: 12/24/2022] Open
Abstract
Retrotransposons with long terminal repeats (LTRs) more than 3 kb are not frequent in most eukaryotic genomes. Rice LTR retrotransposon, Retrosat2, has LTRs greater than 3.2 kb and two open reading frames (ORF): ORF1 encodes enzymes for retrotransposition whereas no function can be assigned to ORF0 as it is not found in any other organism. A variety of experimental and in silico approaches were used to determine the origin of Retrosat2 and putative function of ORF0. Our data show that not only is Retrosat2 highly abundant in the Oryza genus, it may yet be active in rice. Homologs of Retrosat2 were identified in maize, sorghum, Arabidopsis and other plant genomes suggesting that the Retrosat2 family is of ancient origin. Several putatively cis-acting elements, some multicopy, that regulate retrotransposon replication or responsiveness to environmental factors were found in the LTRs of Retrosat2. Unlike the ORF1, the ORF0 sequences from Retrosat2 and homologs are divergent at the sequence level, 3D-structures and predicted biological functions. In contrast to other retrotransposon families, Retrosat2 and its homologs are dispersed throughout genomes and not concentrated in the specific chromosomal regions, such as centromeres. The genomic distribution of Retrosat2 homologs varies across species which likely reflects the differing evolutionary trajectories of this retrotransposon family across diverse species.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
| | - Jose C. Jimenez-Lopez
- Department of Biochemistry, Cell & Molecular Biology of Plants, Estacion Experimental del Zaidin, High Council for Scientific Research, Granada, Spain
| | - Aiko Iwata
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
| | - Navdeep Gill
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada
| | - Scott A. Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
| |
Collapse
|
17
|
Montiel EE, Cabrero J, Camacho JPM, López-León MD. Gypsy, RTE and Mariner transposable elements populate Eyprepocnemis plorans genome. Genetica 2012; 140:365-74. [PMID: 23073915 DOI: 10.1007/s10709-012-9686-1] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2012] [Accepted: 10/08/2012] [Indexed: 12/12/2022]
Abstract
We analyze here the presence and abundance of three types of transposable elements (TEs), i.e. Gypsy, RTE and Mariner, in the genome of the grasshopper Eyprepocnemis plorans. PCR experiments allowed amplification, cloning and sequencing of these elements (EploGypI, EploRTE5, EploMar20) from the E. plorans genome. Fluorescent in situ hybridization (FISH) showed that all three elements are restricted to euchromatic regions, thus being absent from the pericentromeric region of all A chromosomes, which contain a satellite DNA (satDNA) and ribosomal DNA (rDNA), and being very scarce in B chromosomes mostly made up of these two types of repetitive DNA. FISH suggested that EploGypI is the most abundant and EploMar20 is the least abundant, with EploRTE5 showing intermediate abundance. An estimation of copy number, by means of quantitative PCR, showed that EploGypI is, by far, the most abundant element, followed by EploRTE5 and EploMar20, in consistency with FISH results. RNA isolation and PCR experiments on complementary DNA (cDNA) showed the presence of transcripts for the three TE elements. The implications of the preferential location of these TE elements into euchromatin, the significance of TE abundance in the giant genome of this species, and a possible relationship between TEs and B chromosome mutability, are discussed.
Collapse
Affiliation(s)
- Eugenia E Montiel
- Departamento de Genética, Universidad de Granada, 18071 Granada, Spain
| | | | | | | |
Collapse
|
18
|
Yang L, Koo DH, Li Y, Zhang X, Luan F, Havey MJ, Jiang J, Weng Y. Chromosome rearrangements during domestication of cucumber as revealed by high-density genetic mapping and draft genome assembly. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2012; 71:895-906. [PMID: 22487099 DOI: 10.1111/j.1365-313x.2012.05017.x] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Cucumber, Cucumis sativus L. is the only taxon with 2n = 2x = 14 chromosomes in the genus Cucumis. It consists of two cross-compatible botanical varieties: the cultivated C. sativus var. sativus and the wild C. sativus var. hardwickii. There is no consensus on the evolutionary relationship between the two taxa. Whole-genome sequencing of the cucumber genome provides a new opportunity to advance our understanding of chromosome evolution and the domestication history of cucumber. In this study, a high-density genetic map for cultivated cucumber was developed that contained 735 marker loci in seven linkage groups spanning 707.8 cM. Integration of genetic and physical maps resulted in a chromosome-level draft genome assembly comprising 193 Mbp, or 53% of the 367 Mbp cucumber genome. Strategically selected markers from the genetic map and draft genome assembly were employed to screen for fosmid clones for use as probes in comparative fluorescence in situ hybridization analysis of pachytene chromosomes to investigate genetic differentiation between wild and cultivated cucumbers. Significant differences in the amount and distribution of heterochromatins, as well as chromosomal rearrangements, were uncovered between the two taxa. In particular, six inversions, five paracentric and one pericentric, were revealed in chromosomes 4, 5 and 7. Comparison of the order of fosmid loci on chromosome 7 of cultivated and wild cucumbers, and the syntenic melon chromosome I suggested that the paracentric inversion in this chromosome occurred during domestication of cucumber. The results support the sub-species status of these two cucumber taxa, and suggest that C. sativus var. hardwickii is the progenitor of cultivated cucumber.
Collapse
Affiliation(s)
- Luming Yang
- Horticulture Department, University of Wisconsin, Madison, WI 53706, USA
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Steinbauerová V, Neumann P, Novák P, Macas J. A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons. Genetica 2012; 139:1543-55. [PMID: 22544262 DOI: 10.1007/s10709-012-9654-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 04/16/2012] [Indexed: 01/21/2023]
Abstract
Long terminal repeat (LTR) retrotransposons make up substantial parts of most higher plant genomes where they accumulate due to their replicative mode of transposition. Although the transposition is facilitated by proteins encoded within the gag-pol region which is common to all autonomous elements, some LTR retrotransposons were found to potentially carry an additional protein coding capacity represented by extra open reading frames located upstream or downstream of gag-pol. In this study, we performed a comprehensive in silico survey and comparative analysis of these extra open reading frames (ORFs) in the group of Ty3/gypsy LTR retrotransposons as the first step towards our understanding of their origin and function. We found that extra ORFs occur in all three major lineages of plant Ty3/gypsy elements, being the most frequent in the Tat lineage where most (77 %) of identified elements contained extra ORFs. This lineage was also characterized by the highest diversity of extra ORF arrangement (position and orientation) within the elements. On the other hand, all of these ORFs could be classified into only two broad groups based on their mutual similarities or the presence of short conserved motifs in their inferred protein sequences. In the Athila lineage, the extra ORFs were confined to the element 3' regions but they displayed much higher sequence diversity compared to those found in Tat. In the lineage of Chromoviruses the extra ORFs were relatively rare, occurring only in 5' regions of a group of elements present in a single plant family (Poaceae). In all three lineages, most extra ORFs lacked sequence similarities to characterized gene sequences or functional protein domains, except for two Athila-like elements with similarities to LOGL4 gene and part of the Chromoviruses extra ORFs that displayed partial similarity to histone H3 gene. Thus, in these cases the extra ORFs most likely originated by transduction or recombination of cellular gene sequences. In addition, the protein domain which is otherwise associated with DNA transposons have been detected in part of the Tat-like extra ORFs, pointing to their origin from an insertion event of a mobile element.
Collapse
Affiliation(s)
- Veronika Steinbauerová
- Institute of Plant Molecular Biology, Biology Centre ASCR, Branišovská 31, Ceske Budejovice, Czech Republic
| | | | | | | |
Collapse
|
20
|
Gao D, Chen J, Chen M, Meyers BC, Jackson S. A highly conserved, small LTR retrotransposon that preferentially targets genes in grass genomes. PLoS One 2012; 7:e32010. [PMID: 22359654 PMCID: PMC3281118 DOI: 10.1371/journal.pone.0032010] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2011] [Accepted: 01/18/2012] [Indexed: 12/31/2022] Open
Abstract
LTR retrotransposons are often the most abundant components of plant genomes and can impact gene and genome evolution. Most reported LTR retrotransposons are large elements (>4 kb) and are most often found in heterochromatic (gene poor) regions. We report the smallest LTR retrotransposon found to date, only 292 bp. The element is found in rice, maize, sorghum and other grass genomes, which indicates that it was present in the ancestor of grass species, at least 50-80 MYA. Estimated insertion times, comparisons between sequenced rice lines, and mRNA data indicate that this element may still be active in some genomes. Unlike other LTR retrotransposons, the small LTR retrotransposons (SMARTs) are distributed throughout the genomes and are often located within or near genes with insertion patterns similar to MITEs (miniature inverted repeat transposable elements). Our data suggests that insertions of SMARTs into or near genes can, in a few instances, alter both gene structures and gene expression. Further evidence for a role in regulating gene expression, SMART-specific small RNAs (sRNAs) were identified that may be involved in gene regulation. Thus, SMARTs may have played an important role in genome evolution and genic innovation and may provide a valuable tool for gene tagging systems in grass.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies and Institute for Plant Breeding Genetics and Genomics, University of Georgia, Athens, Georgia, United States of America
| | - Jinfeng Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Mingsheng Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Blake C. Meyers
- Department of Plant and Soil Sciences, and Delaware Biotechnology Institute, University of Delaware, Newark, Delaware, United States of America
| | - Scott Jackson
- Center for Applied Genetic Technologies and Institute for Plant Breeding Genetics and Genomics, University of Georgia, Athens, Georgia, United States of America
- * E-mail:
| |
Collapse
|
21
|
Fan C, Walling JG, Zhang J, Hirsch CD, Jiang J, Wing RA. Conservation and purifying selection of transcribed genes located in a rice centromere. THE PLANT CELL 2011; 23:2821-30. [PMID: 21856794 PMCID: PMC3180794 DOI: 10.1105/tpc.111.085605] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Recombination is strongly suppressed in centromeric regions. In chromosomal regions with suppressed recombination, deleterious mutations can easily accumulate and cause degeneration of genes and genomes. Surprisingly, the centromere of chromosome8 (Cen8) of rice (Oryza sativa) contains several transcribed genes. However, it remains unclear as to what selective forces drive the evolution and existence of transcribed genes in Cen8. Sequencing of orthologous Cen8 regions from two additional Oryza species, Oryza glaberrima and Oryza brachyantha, which diverged from O. sativa 1 and 10 million years ago, respectively, revealed a set of seven transcribed Cen8 genes conserved across all three species. Chromatin immunoprecipitation analysis with the centromere-specific histone CENH3 confirmed that the sequenced orthologous regions are part of the functional centromere. All seven Cen8 genes have undergone purifying selection, representing a striking phenomenon of active gene survival within a recombination-free zone over a long evolutionary time. The coding sequences of the Cen8 genes showed sequence divergence and mutation rates that were significantly reduced from those of genes located on the chromosome arms. This suggests that Oryza has a mechanism to maintain the fidelity and functionality of Cen8 genes, even when embedded in a sea of repetitive sequences and transposable elements.
Collapse
MESH Headings
- Base Sequence
- Centromere/genetics
- Chromatin Immunoprecipitation
- Chromosomes, Plant/genetics
- DNA Transposable Elements
- DNA, Plant/genetics
- Evolution, Molecular
- Genes, Plant/genetics
- Genetic Variation/genetics
- Genome, Plant/genetics
- Molecular Sequence Data
- Mutation Rate
- Oryza/classification
- Oryza/genetics
- Polymorphism, Single Nucleotide
- Repetitive Sequences, Nucleic Acid
- Selection, Genetic
- Sequence Analysis, DNA
- Transcription, Genetic
Collapse
Affiliation(s)
- Chuanzhu Fan
- Arizona Genomics Institute, School of Plant Sciences, Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Jason G. Walling
- Department of Horticulture, University of Wisconsin, Madison, Wisconsin 53706
| | - Jianwei Zhang
- Arizona Genomics Institute, School of Plant Sciences, Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Cory D. Hirsch
- Department of Horticulture, University of Wisconsin, Madison, Wisconsin 53706
| | - Jiming Jiang
- Department of Horticulture, University of Wisconsin, Madison, Wisconsin 53706
| | - Rod A. Wing
- Arizona Genomics Institute, School of Plant Sciences, Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
- Address correspondence to
| |
Collapse
|
22
|
Llorens C, Futami R, Covelli L, Domínguez-Escribá L, Viu JM, Tamarit D, Aguilar-Rodríguez J, Vicente-Ripolles M, Fuster G, Bernet GP, Maumus F, Munoz-Pomer A, Sempere JM, Latorre A, Moya A. The Gypsy Database (GyDB) of mobile genetic elements: release 2.0. Nucleic Acids Res 2011; 39:D70-4. [PMID: 21036865 PMCID: PMC3013669 DOI: 10.1093/nar/gkq1061] [Citation(s) in RCA: 236] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
This article introduces the second release of the Gypsy Database of Mobile Genetic Elements (GyDB 2.0): a research project devoted to the evolutionary dynamics of viruses and transposable elements based on their phylogenetic classification (per lineage and protein domain). The Gypsy Database (GyDB) is a long-term project that is continuously progressing, and that owing to the high molecular diversity of mobile elements requires to be completed in several stages. GyDB 2.0 has been powered with a wiki to allow other researchers participate in the project. The current database stage and scope are long terminal repeats (LTR) retroelements and relatives. GyDB 2.0 is an update based on the analysis of Ty3/Gypsy, Retroviridae, Ty1/Copia and Bel/Pao LTR retroelements and the Caulimoviridae pararetroviruses of plants. Among other features, in terms of the aforementioned topics, this update adds: (i) a variety of descriptions and reviews distributed in multiple web pages; (ii) protein-based phylogenies, where phylogenetic levels are assigned to distinct classified elements; (iii) a collection of multiple alignments, lineage-specific hidden Markov models and consensus sequences, called GyDB collection; (iv) updated RefSeq databases and BLAST and HMM servers to facilitate sequence characterization of new LTR retroelement and caulimovirus queries; and (v) a bibliographic server. GyDB 2.0 is available at http://gydb.org.
Collapse
Affiliation(s)
- Carlos Llorens
- Biotechvana, Parc Científic, Universitat de València, Calle Catedrático José Beltrán 2, 46980 Paterna, València, Spain.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
The compact Brachypodium genome conserves centromeric regions of a common ancestor with wheat and rice. Funct Integr Genomics 2010; 10:477-92. [DOI: 10.1007/s10142-010-0190-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2010] [Revised: 08/20/2010] [Accepted: 08/24/2010] [Indexed: 12/19/2022]
|
24
|
Du J, Tian Z, Hans CS, Laten HM, Cannon SB, Jackson SA, Shoemaker RC, Ma J. Evolutionary conservation, diversity and specificity of LTR-retrotransposons in flowering plants: insights from genome-wide analysis and multi-specific comparison. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2010; 63:584-98. [PMID: 20525006 DOI: 10.1111/j.1365-313x.2010.04263.x] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The availability of complete or nearly complete genome sequences from several plant species permits detailed discovery and cross-species comparison of transposable elements (TEs) at the whole genome level. We initially investigated 510 long terminal repeat-retrotransposon (LTR-RT) families comprising 32370 elements in soybean (Glycine max (L.) Merr.). Approximately 87% of these elements were located in recombination-suppressed pericentromeric regions, where the ratio (1.26) of solo LTRs to intact elements (S/I) is significantly lower than that of chromosome arms (1.62). Further analysis revealed a significant positive correlation between S/I and LTR sizes, indicating that larger LTRs facilitate solo LTR formation. Phylogenetic analysis revealed seven Copia and five Gypsy evolutionary lineages that were present before the divergence of eudicot and monocot species, but the scales and timeframes within which they proliferated vary dramatically across families, lineages and species, and notably, a Copia lineage has been lost in soybean. Analysis of the physical association of LTR-RTs with centromere satellite repeats identified two putative centromere retrotransposon (CR) families of soybean, which were grouped into the CR (e.g. CRR and CRM) lineage found in grasses, indicating that the 'functional specification' of CR pre-dates the bifurcation of eudicots and monocots. However, a number of families of the CR lineage are not concentrated in centromeres, suggesting that their CR roles may now be defunct. Our data also suggest that the envelope-like genes in the putative Copia retrovirus-like family are probably derived from the Gypsy retrovirus-like lineage, and thus we propose the hypothesis of a single ancient origin of envelope-like genes in flowering plants.
Collapse
Affiliation(s)
- Jianchang Du
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Ammiraju JS, Song X, Luo M, Sisneros N, Angelova A, Kudrna D, Kim H, Yu Y, Goicoechea JL, Lorieux M, Kurata N, Brar D, Ware D, Jackson S, Wing RA. The Oryza BAC resource: a genus-wide and genome scale tool for exploring rice genome evolution and leveraging useful genetic diversity from wild relatives. BREEDING SCIENCE 2010; 60:536-543. [PMID: 0 DOI: 10.1270/jsbbs.60.536] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Affiliation(s)
- Jetty S.S. Ammiraju
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| | - Xiang Song
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| | - Meizhong Luo
- College of Life Sciences and Technology, Huazhong Agricultural University
| | - Nicholas Sisneros
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| | - Angelina Angelova
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| | - David Kudrna
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| | - HyeRan Kim
- Plant Genomics Institute, Chungnam National University
| | - Yeisoo Yu
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| | - Jose Luis Goicoechea
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| | - Mathias Lorieux
- Agrobiodiversity and Biotechnology Project, International Center for Tropical Agriculture (CIAT)
| | | | - Darshan Brar
- Department of Plant Breeding and Genetics, International Rice Research Institute (IRRI)
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor
- USDA-ARS NAA Plant, Soil and Nutrition Laboratory Research Unit
| | | | - Rod A. Wing
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona
| |
Collapse
|
26
|
Wu J, Fujisawa M, Tian Z, Yamagata H, Kamiya K, Shibata M, Hosokawa S, Ito Y, Hamada M, Katagiri S, Kurita K, Yamamoto M, Kikuta A, Machita K, Karasawa W, Kanamori H, Namiki N, Mizuno H, Ma J, Sasaki T, Matsumoto T. Comparative analysis of complete orthologous centromeres from two subspecies of rice reveals rapid variation of centromere organization and structure. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2009; 60:805-19. [PMID: 19702669 DOI: 10.1111/j.1365-313x.2009.04002.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Centromeres are sites for assembly of the chromosomal structures that mediate faithful segregation at mitosis and meiosis. This function is conserved across species, but the DNA components that are involved in kinetochore formation differ greatly, even between closely related species. To shed light on the nature, evolutionary timing and evolutionary dynamics of rice centromeres, we decoded a 2.25-Mb DNA sequence covering the centromeric region of chromosome 8 of an indica rice variety, 'Kasalath' (Kas-Cen8). Analysis of repetitive sequences in Kas-Cen8 led to the identification of 222 long terminal repeat (LTR)-retrotransposon elements and 584 CentO satellite monomers, which account for 59.2% of the region. A comparison of the Kas-Cen8 sequence with that of japonica rice 'Nipponbare' (Nip-Cen8) revealed that about 66.8% of the Kas-Cen8 sequence was collinear with that of Nip-Cen8. Although the 27 putative genes are conserved between the two subspecies, only 55.4% of the total LTR-retrotransposon elements in 'Kasalath' had orthologs in 'Nipponbare', thus reflecting recent proliferation of a considerable number of LTR-retrotransposons since the divergence of two rice subspecies of indica and japonica within Oryza sativa. Comparative analysis of the subfamilies, time of insertion, and organization patterns of inserted LTR-retrotransposons between the two Cen8 regions revealed variations between 'Kasalath' and 'Nipponbare' in the preferential accumulation of CRR elements, and the expansion of CentO satellite repeats within the core domain of Cen8. Together, the results provide insights into the recent proliferation of LTR-retrotransposons, and the rapid expansion of CentO satellite repeats, underlying the dynamic variation and plasticity of plant centromeres.
Collapse
Affiliation(s)
- Jianzhong Wu
- Plant Genome Research Unit, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|