1
|
Grishko EO, Borodin PM. Structure and evolution of metapolycentromeres. Vavilovskii Zhurnal Genet Selektsii 2024; 28:592-601. [PMID: 39440311 PMCID: PMC11492452 DOI: 10.18699/vjgb-24-66] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 07/15/2024] [Accepted: 07/18/2024] [Indexed: 10/25/2024] Open
Abstract
Metapolycentromeres consist of multiple sequential domains of centromeric chromatin associated with a centromere-specific variant of histone H3 (CENP-A), functioning collectively as a single centromere. To date, they have been revealed in nine flowering plant, five insect and six vertebrate species. In this paper, we focus on their structure and possible mechanisms of emergence and evolution. The metapolycentromeres may vary in the number of centromeric domains and in their genetic content and epigenetic modifications. However, these variations do not seem to affect their function. The emergence of metapolycentromeres has been attributed to multiple Robertsonian translocations and segmental duplications. Conditions of genomic instability, such as interspecific hybridization and malignant neoplasms, are suggested as triggers for the de novo emergence of metapolycentromeres. Addressing the "centromere paradox" - the rapid evolution of centromeric DNA and proteins despite their conserved cellular function - we explore the centromere drive hypothesis as a plausible explanation for the dynamic evolution of centromeres in general, and in particular the emergence of metapolycentromeres and holocentromeres. Apparently, metapolycentromeres are more common across different species than it was believed until recently. Indeed, a systematic review of the available cytogenetic publications allowed us to identify 27 candidate species with metapolycentromeres. Тhe list of the already established and newly revealed candidate species thus spans 27 species of flowering plants and eight species of gymnosperm plants, five species of insects, and seven species of vertebrates. This indicates an erratic phylogenetic distribution of the species with metapolycentromeres and may suggest an independent emergence of the metapolycentromeres in the course of evolution. However, the current catalog of species with identified and likely metapolycentromeres remains too short to draw reliable conclusions about their evolution, particularly in the absence of knowledge about related species without metapolycentromeres for comparative analysis. More studies are necessary to shed light on the mechanisms of metapolycentromere formation and evolution.
Collapse
Affiliation(s)
- E O Grishko
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - P M Borodin
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
2
|
Lv Y, Liu C, Li X, Wang Y, He H, He W, Chen W, Yang L, Dai X, Cao X, Yu X, Liu J, Zhang B, Wei H, Zhang H, Qian H, Shi C, Leng Y, Liu X, Guo M, Wang X, Zhang Z, Wang T, Zhang B, Xu Q, Cui Y, Zhang Q, Yuan Q, Jahan N, Ma J, Zheng X, Zhou Y, Qian Q, Guo L, Shang L. A centromere map based on super pan-genome highlights the structure and function of rice centromeres. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2024; 66:196-207. [PMID: 38158885 DOI: 10.1111/jipb.13607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 12/28/2023] [Indexed: 01/03/2024]
Abstract
Rice (Oryza sativa) is a significant crop worldwide with a genome shaped by various evolutionary factors. Rice centromeres are crucial for chromosome segregation, and contain some unreported genes. Due to the diverse and complex centromere region, a comprehensive understanding of rice centromere structure and function at the population level is needed. We constructed a high-quality centromere map based on the rice super pan-genome consisting of a 251-accession panel comprising both cultivated and wild species of Asian and African rice. We showed that rice centromeres have diverse satellite repeat CentO, which vary across chromosomes and subpopulations, reflecting their distinct evolutionary patterns. We also revealed that long terminal repeats (LTRs), especially young Gypsy-type LTRs, are abundant in the peripheral CentO-enriched regions and drive rice centromere expansion and evolution. Furthermore, high-quality genome assembly and complete telomere-to-telomere (T2T) reference genome enable us to obtain more centromeric genome information despite mapping and cloning of centromere genes being challenging. We investigated the association between structural variations and gene expression in the rice centromere. A centromere gene, OsMAB, which positively regulates rice tiller number, was further confirmed by expression quantitative trait loci, haplotype analysis and clustered regularly interspaced palindromic repeats (CRISPR)/CRISPR-associated protein 9 methods. By revealing the new insights into the evolutionary patterns and biological roles of rice centromeres, our finding will facilitate future research on centromere biology and crop improvement.
Collapse
Affiliation(s)
- Yang Lv
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Congcong Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Xiaoxia Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Yueying Wang
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
| | - Huiying He
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Wenchuang He
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Wu Chen
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Longbo Yang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Xiaofan Dai
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Xinglan Cao
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Xiaoman Yu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Jiajia Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Bin Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Hua Wei
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Hong Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Hongge Qian
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Chuanlin Shi
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Yue Leng
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Xiangpei Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Mingliang Guo
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Xianmeng Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Zhipeng Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Tianyi Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Bintao Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Qiang Xu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Yan Cui
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Qianqian Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Qiaoling Yuan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Noushin Jahan
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
| | - Jie Ma
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
| | - Xiaoming Zheng
- Yazhouwan National Laboratory, No. 8 Huanjin Road, Yazhou District, Sanya City, 572024, China
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yongfeng Zhou
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Qian Qian
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
- Yazhouwan National Laboratory, No. 8 Huanjin Road, Yazhou District, Sanya City, 572024, China
| | - Longbiao Guo
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, 310006, China
| | - Lianguang Shang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
- Yazhouwan National Laboratory, No. 8 Huanjin Road, Yazhou District, Sanya City, 572024, China
| |
Collapse
|
3
|
Yang LL, Zhang XY, Wang LY, Li YG, Li XT, Yang Y, Su Q, Chen N, Zhang YL, Li N, Deng CL, Li SF, Gao WJ. Lineage-specific amplification and epigenetic regulation of LTR-retrotransposons contribute to the structure, evolution, and function of Fabaceae species. BMC Genomics 2023; 24:423. [PMID: 37501164 PMCID: PMC10373317 DOI: 10.1186/s12864-023-09530-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 07/22/2023] [Indexed: 07/29/2023] Open
Abstract
BACKGROUND Long terminal repeat (LTR)-retrotransposons (LTR-RTs) are ubiquitous and make up the majority of nearly all sequenced plant genomes, whereas their pivotal roles in genome evolution, gene expression regulation as well as their epigenetic regulation are still not well understood, especially in a large number of closely related species. RESULTS Here, we analyzed the abundance and dynamic evolution of LTR-RTs in 54 species from an economically and agronomically important family, Fabaceae, and also selected two representative species for further analysis in expression of associated genes, transcriptional activity and DNA methylation patterns of LTR-RTs. Annotation results revealed highly varied proportions of LTR-RTs in these genomes (5.1%~68.4%) and their correlation with genome size was highly positive, and they were significantly contributed to the variance in genome size through species-specific unique amplifications. Almost all of the intact LTR-RTs were inserted into the genomes 4 Mya (million years ago), and more than 50% of them were inserted in the last 0.5 million years, suggesting that recent amplifications of LTR-RTs were an important force driving genome evolution. In addition, expression levels of genes with intronic, promoter, and downstream LTR-RT insertions of Glycine max and Vigna radiata, two agronomically important crops in Fabaceae, showed that the LTR-RTs located in promoter or downstream regions suppressed associated gene expression. However, the LTR-RTs within introns promoted gene expression or had no contribution to gene expression. Additionally, shorter and younger LTR-RTs maintained higher mobility and transpositional potential. Compared with the transcriptionally silent LTR-RTs, the active elements showed significantly lower DNA methylation levels in all three contexts. The distributions of transcriptionally active and silent LTR-RT methylation varied across different lineages due to the position of LTR-RTs located or potentially epigenetic regulation. CONCLUSION Lineage-specific amplification patterns were observed and higher methylation level may repress the activity of LTR-RTs, further influence evolution in Fabaceae species. This study offers valuable clues into the evolution, function, transcriptional activity and epigenetic regulation of LTR-RTs in Fabaceae genomes.
Collapse
Affiliation(s)
- Long-Long Yang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Xin-Yu Zhang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Li-Ying Wang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Yan-Ge Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Xiao-Ting Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Yi Yang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Qing Su
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Ning Chen
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Yu-Lan Zhang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Ning Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Chuan-Liang Deng
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Shu-Fen Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China.
| | - Wu-Jun Gao
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China.
| |
Collapse
|
4
|
Zhang F, Chen F, Schwarzacher T, Heslop-Harrison JS, Teng N. The nature and genomic landscape of repetitive DNA classes in Chrysanthemum nankingense shows recent genomic changes. ANNALS OF BOTANY 2023; 131:215-228. [PMID: 35639931 PMCID: PMC9904347 DOI: 10.1093/aob/mcac066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 05/24/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND AIMS Tandemly repeated DNA and transposable elements represent most of the DNA in higher plant genomes. High-throughput sequencing allows a survey of the DNA in a genome, but whole-genome assembly can miss a substantial fraction of highly repeated sequence motifs. Chrysanthemum nankingense (2n = 2x = 18; genome size = 3.07 Gb; Asteraceae), a diploid reference for the many auto- and allopolyploids in the genus, was considered as an ancestral species and serves as an ornamental plant and high-value food. We aimed to characterize the major repetitive DNA motifs, understand their structure and identify key features that are shaped by genome and sequence evolution. METHODS Graph-based clustering with RepeatExplorer was used to identify and classify repetitive motifs in 2.14 millions of 250-bp paired-end Illumina reads from total genomic DNA of C. nankingense. Independently, the frequency of all canonical motifs k-bases long was counted in the raw read data and abundant k-mers (16, 21, 32, 64 and 128) were extracted and assembled to generate longer contigs for repetitive motif identification. For comparison, long terminal repeat retrotransposons were checked in the published C. nankingense reference genome. Fluorescent in situ hybridization was performed to show the chromosomal distribution of the main types of repetitive motifs. KEY RESULTS Apart from rDNA (0.86 % of the total genome), a few microsatellites (0.16 %), and telomeric sequences, no highly abundant tandem repeats were identified. There were many transposable elements: 40 % of the genome had sequences with recognizable domains related to transposable elements. Long terminal repeat retrotransposons showed widespread distribution over chromosomes, although different sequence families had characteristic features such as abundance at or exclusion from centromeric or subtelomeric regions. Another group of very abundant repetitive motifs, including those most identified as low-complexity sequences (9.07 %) in the genome, showed no similarity to known sequence motifs or tandemly repeated elements. CONCLUSIONS The Chrysanthemum genome has an unusual structure with a very low proportion of tandemly repeated sequences (~1.02 %) in the genome, and a high proportion of low-complexity sequences, most likely degenerated remains of transposable elements. Identifying the presence, nature and genomic organization of major genome fractions enables inference of the evolutionary history of sequences, including degeneration and loss, critical to understanding biodiversity and diversification processes in the genomes of diploid and polyploid Chrysanthemum, Asteraceae and plants more widely.
Collapse
Affiliation(s)
- Fengjiao Zhang
- Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, Key Laboratory of Biology of Ornamental Plants in East China, National Forestry and Grassland Administration, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences (Nanjing Botanical Garden Mem. Sun Yat-Sen), Nanjing, 210014, China
- Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, UK
| | - Fadi Chen
- Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, Key Laboratory of Biology of Ornamental Plants in East China, National Forestry and Grassland Administration, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Trude Schwarzacher
- Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, UK
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | | | | |
Collapse
|
5
|
Macas J, Ávila Robledillo L, Kreplak J, Novák P, Koblížková A, Vrbová I, Burstin J, Neumann P. Assembly of the 81.6 Mb centromere of pea chromosome 6 elucidates the structure and evolution of metapolycentric chromosomes. PLoS Genet 2023; 19:e1010633. [PMID: 36735726 PMCID: PMC10027222 DOI: 10.1371/journal.pgen.1010633] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 03/20/2023] [Accepted: 01/23/2023] [Indexed: 02/04/2023] Open
Abstract
Centromeres in the legume genera Pisum and Lathyrus exhibit unique morphological characteristics, including extended primary constrictions and multiple separate domains of centromeric chromatin. These so-called metapolycentromeres resemble an intermediate form between monocentric and holocentric types, and therefore provide a great opportunity for studying the transitions between different types of centromere organizations. However, because of the exceedingly large and highly repetitive nature of metapolycentromeres, highly contiguous assemblies needed for these studies are lacking. Here, we report on the assembly and analysis of a 177.6 Mb region of pea (Pisum sativum) chromosome 6, including the 81.6 Mb centromere region (CEN6) and adjacent chromosome arms. Genes, DNA methylation profiles, and most of the repeats were uniformly distributed within the centromere, and their densities in CEN6 and chromosome arms were similar. The exception was an accumulation of satellite DNA in CEN6, where it formed multiple arrays up to 2 Mb in length. Centromeric chromatin, characterized by the presence of the CENH3 protein, was predominantly associated with arrays of three different satellite repeats; however, five other satellites present in CEN6 lacked CENH3. The presence of CENH3 chromatin was found to determine the spatial distribution of the respective satellites during the cell cycle. Finally, oligo-FISH painting experiments, performed using probes specifically designed to label the genomic regions corresponding to CEN6 in Pisum, Lathyrus, and Vicia species, revealed that metapolycentromeres evolved via the expansion of centromeric chromatin into neighboring chromosomal regions and the accumulation of novel satellite repeats. However, in some of these species, centromere evolution also involved chromosomal translocations and centromere repositioning.
Collapse
Affiliation(s)
- Jiří Macas
- Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, České Budějovice, Czech Republic
| | - Laura Ávila Robledillo
- Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, České Budějovice, Czech Republic
| | - Jonathan Kreplak
- Agroécologie, AgroSup Dijon, INRA, Univ. Bourgogne, Univ. Bourgogne Franche-Comté, Dijon, France
| | - Petr Novák
- Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, České Budějovice, Czech Republic
| | - Andrea Koblížková
- Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, České Budějovice, Czech Republic
| | - Iva Vrbová
- Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, České Budějovice, Czech Republic
| | - Judith Burstin
- Agroécologie, AgroSup Dijon, INRA, Univ. Bourgogne, Univ. Bourgogne Franche-Comté, Dijon, France
| | - Pavel Neumann
- Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, Branišovská 31, České Budějovice, Czech Republic
| |
Collapse
|
6
|
Silva BSML, Picorelli ACR, Kuhn GCS. In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group. Genes (Basel) 2023; 14:300. [PMID: 36833227 PMCID: PMC9957191 DOI: 10.3390/genes14020300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 01/13/2023] [Accepted: 01/16/2023] [Indexed: 01/24/2023] Open
Abstract
Satellite DNA (satDNA) is a class of tandemly repeated non-protein coding DNA sequences which can be found in abundance in eukaryotic genomes. They can be functional, impact the genomic architecture in many ways, and their rapid evolution has consequences for species diversification. We took advantage of the recent availability of sequenced genomes from 23 Drosophila species from the montium group to study their satDNA landscape. For this purpose, we used publicly available whole-genome sequencing Illumina reads and the TAREAN (tandem repeat analyzer) pipeline. We provide the characterization of 101 non-homologous satDNA families in this group, 93 of which are described here for the first time. Their repeat units vary in size from 4 bp to 1897 bp, but most satDNAs show repeat units < 100 bp long and, among them, repeats ≤ 10 bp are the most frequent ones. The genomic contribution of the satDNAs ranges from ~1.4% to 21.6%. There is no significant correlation between satDNA content and genome sizes in the 23 species. We also found that at least one satDNA originated from an expansion of the central tandem repeats (CTRs) present inside a Helitron transposon. Finally, some satDNAs may be useful as taxonomic markers for the identification of species or subgroups within the group.
Collapse
Affiliation(s)
| | | | - Gustavo C. S. Kuhn
- Department of Genetics, Ecology and Evolution, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil
| |
Collapse
|
7
|
Zhuang Y, Wang X, Li X, Hu J, Fan L, Landis JB, Cannon SB, Grimwood J, Schmutz J, Jackson SA, Doyle JJ, Zhang XS, Zhang D, Ma J. Phylogenomics of the genus Glycine sheds light on polyploid evolution and life-strategy transition. NATURE PLANTS 2022; 8:233-244. [PMID: 35288665 DOI: 10.1038/s41477-022-01102-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 01/29/2022] [Indexed: 06/14/2023]
Abstract
Polyploidy and life-strategy transitions between annuality and perenniality often occur in flowering plants. However, the evolutionary propensities of polyploids and the genetic bases of such transitions remain elusive. We assembled chromosome-level genomes of representative perennial species across the genus Glycine including five diploids and a young allopolyploid, and constructed a Glycine super-pangenome framework by integrating 26 annual soybean genomes. These perennial diploids exhibit greater genome stability and possess fewer centromere repeats than the annuals. Biased subgenomic fractionation occurred in the allopolyploid, primarily by accumulation of small deletions in gene clusters through illegitimate recombination, which was associated with pre-existing local subgenomic differentiation. Two genes annotated to modulate vegetative-reproductive phase transition and lateral shoot outgrowth were postulated as candidates underlying the perenniality-annuality transition. Our study provides insights into polyploid genome evolution and lays a foundation for unleashing genetic potential from the perennial gene pool for soybean improvement.
Collapse
Affiliation(s)
- Yongbin Zhuang
- College of Agriculture, and State Key Laboratory of Crop Biology, Shangdong Agricultural University, Tai'an, China
| | - Xutong Wang
- Department of Agronomy, and Center for Plant Biology, Purdue University, West Lafayette, IN, USA
| | - Xianchong Li
- College of Agriculture, and State Key Laboratory of Crop Biology, Shangdong Agricultural University, Tai'an, China
| | - Junmei Hu
- College of Agriculture, and State Key Laboratory of Crop Biology, Shangdong Agricultural University, Tai'an, China
| | - Lichuan Fan
- College of Agriculture, and State Key Laboratory of Crop Biology, Shangdong Agricultural University, Tai'an, China
| | - Jacob B Landis
- School of Integrative Plant Science Plant Biology Section, Cornell University, Ithaca, NY, USA
| | - Steven B Cannon
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA, USA
| | - Jeffrey J Doyle
- School of Integrative Plant Science Plant Biology Section, Cornell University, Ithaca, NY, USA
| | - Xian Sheng Zhang
- College of Life Sciences, and State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an, China
| | - Dajian Zhang
- College of Agriculture, and State Key Laboratory of Crop Biology, Shangdong Agricultural University, Tai'an, China.
| | - Jianxin Ma
- Department of Agronomy, and Center for Plant Biology, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
8
|
Schmidt N, Seibt KM, Weber B, Schwarzacher T, Schmidt T, Heitkam T. Broken, silent, and in hiding: tamed endogenous pararetroviruses escape elimination from the genome of sugar beet (Beta vulgaris). ANNALS OF BOTANY 2021; 128:281-299. [PMID: 33729490 PMCID: PMC8389469 DOI: 10.1093/aob/mcab042] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 03/16/2021] [Indexed: 05/13/2023]
Abstract
BACKGROUND AND AIMS Endogenous pararetroviruses (EPRVs) are widespread components of plant genomes that originated from episomal DNA viruses of the Caulimoviridae family. Due to fragmentation and rearrangements, most EPRVs have lost their ability to replicate through reverse transcription and to initiate viral infection. Similar to the closely related retrotransposons, extant EPRVs were retained and often amplified in plant genomes for several million years. Here, we characterize the complete genomic EPRV fraction of the crop sugar beet (Beta vulgaris, Amaranthaceae) to understand how they shaped the beet genome and to suggest explanations for their absent virulence. METHODS Using next- and third-generation sequencing data and genome assembly, we reconstructed full-length in silico representatives for the three host-specific EPRVs (beetEPRVs) in the B. vulgaris genome. Focusing on the endogenous caulimovirid beetEPRV3, we investigated its chromosomal localization, abundance and distribution by fluorescent in situ and Southern hybridization. KEY RESULTS Full-length beetEPRVs range between 7.5 and 10.7 kb in size, are heterogeneous in structure and sequence, and occupy about 0.3 % of the beet genome. Although all three beetEPRVs were assigned to the florendoviruses, they showed variably arranged protein-coding domains, different fragmentation, and preferences for diverse sequence contexts. We observed small RNAs that specifically target the individual beetEPRVs, indicating stringent epigenetic suppression. BeetEPRV3 sequences occur along all sugar beet chromosomes, preferentially in the vicinity of each other and are associated with heterochromatic, centromeric and intercalary satellite DNAs. BeetEPRV3 members also exist in genomes of related wild species, indicating an initial beetEPRV3 integration 13.4-7.2 million years ago. CONCLUSIONS Our study in beet illustrates the variability of EPRV structure and sequence in a single host genome. Evidence of sequence fragmentation and epigenetic silencing implies possible plant strategies to cope with long-term persistence of EPRVs, including amplification, fixation in the heterochromatin, and containment of EPRV virulence.
Collapse
Affiliation(s)
- Nicola Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Beatrice Weber
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Trude Schwarzacher
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Xingke Road 723, Tianhe District, Guangzhou, PR China
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
9
|
Genome wide annotation and characterization of young, intact long terminal repeat retrotransposons (In-LTR-RTs) of seven legume species. Genetica 2020; 148:253-268. [PMID: 32949338 DOI: 10.1007/s10709-020-00103-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 09/02/2020] [Indexed: 10/23/2022]
Abstract
Availability of genome sequence of different legume species has provided an opportunity to characterize the abundance, distribution, and divergence of canonical intact long terminal retrotransposons (In-LTR-RT) superfamilies. Among seven legume species, Arachis ipaensis (Aip) showed the highest number of full-length canonical In-LTR-RTs (3325), followed by Glycine max (Gma, 2328), Vigna angularis (Van, 1625), Arachis durensis (Adu, 1348), Lotus japonicus (Lja, 1294), Medicago truncatula (Mtr, 788), and Circer arietinum (Car, 124). Divergence time analysis demonstrated that the amplification timeframe of LTR-RTs dramatically varied in different families. The average insertion time of Copia element varied from 0.51 (Van) to 1.37 million years ago (Mya) (Adu, and Aip), whereas that of Gypsy was between 0.22 (Mtr) and 1.82 Mya (Adu). Bayesian phylogenetic tree analysis suggested that the 1397 and 1917 reverse transcriptase (RT) domains of Copia and Gypsy families of the seven legume species were clustered into 7 and 14 major groups, respectively. The highest proportion (approximately 94.79-100%) of transposable element (TE)-associated genes assigned to pathways was mapped to metabolism-related pathways in all species. The results enabled the structural understanding of full-length In-LTR-RTs and will be valuable resource for the further study of the impact of TEs on gene structure and expression in legume species.
Collapse
|
10
|
Pinosio S, Marroni F, Zuccolo A, Vitulo N, Mariette S, Sonnante G, Aravanopoulos FA, Ganopoulos I, Palasciano M, Vidotto M, Magris G, Iezzoni A, Vendramin GG, Morgante M. A draft genome of sweet cherry (Prunus avium L.) reveals genome-wide and local effects of domestication. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:1420-1432. [PMID: 32391598 DOI: 10.1111/tpj.14809] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/24/2020] [Accepted: 05/01/2020] [Indexed: 05/26/2023]
Abstract
Sweet cherry (Prunus avium L.) trees are both economically important fruit crops but also important components of natural forest ecosystems in Europe, Asia and Africa. Wild and domesticated trees currently coexist in the same geographic areas with important questions arising on their historical relationships. Little is known about the effects of the domestication process on the evolution of the sweet cherry genome. We assembled and annotated the genome of the cultivated variety "Big Star*" and assessed the genetic diversity among 97 sweet cherry accessions representing three different stages in the domestication and breeding process (wild trees, landraces and modern varieties). The genetic diversity analysis revealed significant genome-wide losses of variation among the three stages and supports a clear distinction between wild and domesticated trees, with only limited gene flow being detected between wild trees and domesticated landraces. We identified 11 domestication sweeps and five breeding sweeps covering, respectively, 11.0 and 2.4 Mb of the P. avium genome. A considerable fraction of the domestication sweeps overlaps with those detected in the related species, Prunus persica (peach), indicating that artificial selection during domestication may have acted independently on the same regions and genes in the two species. We detected 104 candidate genes in sweep regions involved in different processes, such as the determination of fruit texture, the regulation of flowering and fruit ripening and the resistance to pathogens. The signatures of selection identified will enable future evolutionary studies and provide a valuable resource for genetic improvement and conservation programs in sweet cherry.
Collapse
Affiliation(s)
- Sara Pinosio
- Institute of Biosciences and Bioresources (IBBR), National Research Council, Via Madonna del Piano 10, Sesto Fiorentino, 50019, Italy
- Istituto di Genomica Applicata (IGA), Via Jacopo Linussio 51, Udine, 33100, Italy
| | - Fabio Marroni
- Istituto di Genomica Applicata (IGA), Via Jacopo Linussio 51, Udine, 33100, Italy
- Dipartimento di Scienze Agro-alimentari Ambientali e Animali (DI4A), Università di Udine, via delle Scienze 206, Udine, 33100, Italy
| | - Andrea Zuccolo
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, 56124, Italy
| | - Nicola Vitulo
- Dipartimento di Biotecnologie, Università degli Studi di Verona, Strada Le Grazie 15, Verona, 37134, Italy
| | - Stephanie Mariette
- BIOGECO, INRA, University of Bordeaux, route d'Arcachon 69, Cestas, 33612, France
| | - Gabriella Sonnante
- Institute of Biosciences and Bioresources (IBBR), National Research Council, via Amendola 165/A, Bari, 70126, Italy
| | - Filippos A Aravanopoulos
- Faculty of Forestry and Natural Environment, Laboratory of Forest Genetics and Tree Breeding, Aristotle University of Thessaloniki, Thessaloníki, 54124, Greece
| | - Ioannis Ganopoulos
- Institute of Plant Breeding and Genetic Resources, Hellenic Agricultural Organization-DEMETER, Thermi, 57001, Greece
| | - Marino Palasciano
- Dipartimento di Scienze del Suolo, Università degli Studi di Bari Aldo Moro, della Pianta e degli Alimenti, Piazza Umberto I, Bari, 70121, Italy
| | - Michele Vidotto
- Istituto di Genomica Applicata (IGA), Via Jacopo Linussio 51, Udine, 33100, Italy
| | - Gabriele Magris
- Istituto di Genomica Applicata (IGA), Via Jacopo Linussio 51, Udine, 33100, Italy
- Dipartimento di Scienze Agro-alimentari Ambientali e Animali (DI4A), Università di Udine, via delle Scienze 206, Udine, 33100, Italy
| | - Amy Iezzoni
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824-1325, USA
| | - Giovanni G Vendramin
- Institute of Biosciences and Bioresources (IBBR), National Research Council, Via Madonna del Piano 10, Sesto Fiorentino, 50019, Italy
| | - Michele Morgante
- Istituto di Genomica Applicata (IGA), Via Jacopo Linussio 51, Udine, 33100, Italy
- Dipartimento di Scienze Agro-alimentari Ambientali e Animali (DI4A), Università di Udine, via delle Scienze 206, Udine, 33100, Italy
| |
Collapse
|
11
|
Perumal S, James B, Tang L, Kagale S, Robinson SJ, Yang TJ, Parkin IAP. Characterization of B-Genome Specific High Copy hAT MITE Families in Brassica nigra Genome. FRONTIERS IN PLANT SCIENCE 2020; 11:1104. [PMID: 32793262 PMCID: PMC7385995 DOI: 10.3389/fpls.2020.01104] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 07/06/2020] [Indexed: 06/11/2023]
Abstract
Miniature inverted-repeat transposable elements (MITEs) are non-autonomous class II transposons which have been shown to influence genome evolution. Brassica nigra L. (B-genome) is one of three Brassica diploids cultivated primarily as an oil crop, which harbors novel alleles important for breeding. Two new high copy hAT MITE families (BniHAT-1 and BniHAT-2) from the B-genome were characterized and their prevalence assessed in the genomes of the related diploids, rapa L. (A) and Brassica oleracea L. (C). Both novel MITE families were present at high copy numbers in the B-genome with 434 and 331 copies of BniHAT-1 and BniHAT-2, respectively. Yet less than 20 elements were identified in the genome assemblies of the A, and C -genomes, supporting B-genome specific proliferation of these MITE families. Although apparently randomly distributed across the genome, 68 and 70% of the B-genome MITEs were present within 2 kb flanking regions of annotated genes suggesting they might influence gene expression and/or function. In addition, MITE derived microRNAs and transcription factor binding sites suggested a putative role in gene regulation. Age of insertion analysis revealed that the major proliferation of these elements occurred during 2-3 million years ago. Additionally, site-specific polymorphism analyses showed that 44% MITEs were undergoing active amplification into the B-genome. Overall, this study provides a comprehensive analysis of two high copy MITE families, which were specifically amplified in the B-genome, suggesting a potential role in shaping the Brassica B-genome.
Collapse
Affiliation(s)
| | - Brian James
- Agriculture and Agri-Food Canada, Saskatoon, SK, Canada
| | - Lily Tang
- Agriculture and Agri-Food Canada, Saskatoon, SK, Canada
| | | | | | - Tae-Jin Yang
- Department of Plant Science, Plant Genomics and Breeding Institute, and Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University, Seoul, South Korea
| | | |
Collapse
|
12
|
Vondrak T, Ávila Robledillo L, Novák P, Koblížková A, Neumann P, Macas J. Characterization of repeat arrays in ultra-long nanopore reads reveals frequent origin of satellite DNA from retrotransposon-derived tandem repeats. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:484-500. [PMID: 31559657 PMCID: PMC7004042 DOI: 10.1111/tpj.14546] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 09/09/2019] [Accepted: 09/12/2019] [Indexed: 05/21/2023]
Abstract
Amplification of monomer sequences into long contiguous arrays is the main feature distinguishing satellite DNA from other tandem repeats, yet it is also the main obstacle in its investigation because these arrays are in principle difficult to assemble. Here we explore an alternative, assembly-free approach that utilizes ultra-long Oxford Nanopore reads to infer the length distribution of satellite repeat arrays, their association with other repeats and the prevailing sequence periodicities. Using the satellite DNA-rich legume plant Lathyrus sativus as a model, we demonstrated this approach by analyzing 11 major satellite repeats using a set of nanopore reads ranging from 30 to over 200 kb in length and representing 0.73× genome coverage. We found surprising differences between the analyzed repeats because only two of them were predominantly organized in long arrays typical for satellite DNA. The remaining nine satellites were found to be derived from short tandem arrays located within LTR-retrotransposons that occasionally expanded in length. While the corresponding LTR-retrotransposons were dispersed across the genome, this array expansion occurred mainly in the primary constrictions of the L. sativus chromosomes, which suggests that these genome regions are favourable for satellite DNA accumulation.
Collapse
Affiliation(s)
- Tihana Vondrak
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Laura Ávila Robledillo
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Petr Novák
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Andrea Koblížková
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Pavel Neumann
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Jiří Macas
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| |
Collapse
|
13
|
Bolsheva NL, Melnikova NV, Kirov IV, Dmitriev AA, Krasnov GS, Amosova АV, Samatadze TE, Yurkevich OY, Zoshchuk SA, Kudryavtseva AV, Muravenko OV. Characterization of repeated DNA sequences in genomes of blue-flowered flax. BMC Evol Biol 2019; 19:49. [PMID: 30813893 PMCID: PMC6391757 DOI: 10.1186/s12862-019-1375-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Members of different sections of the genus Linum are characterized by wide variability in size, morphology and number of chromosomes in karyotypes. Since such variability is determined mainly by the amount and composition of repeated sequences, we conducted a comparative study of the repeatomes of species from four sections forming a clade of blue-flowered flax. Based on the results of high-throughput genome sequencing performed in this study as well as available WGS data, bioinformatic analyses of repeated sequences from 12 flax samples were carried out using a graph-based clustering method. RESULTS It was found that the genomes of closely related species, which have a similar karyotype structure, are also similar in the repeatome composition. In contrast, the repeatomes of karyologically distinct species differed significantly, and no similar tandem-organized repeats have been identified in their genomes. At the same time, many common mobile element families have been identified in genomes of all species, among them, Athila Ty3/gypsy LTR retrotransposon was the most abundant. The 30-chromosome members of the sect. Linum (including the cultivated species L. usitatissimum) differed significantly from other studied species by a great number of satellite DNA families as well as their relative content in genomes. CONCLUSIONS The evolution of studied flax species was accompanied by waves of amplification of satellite DNAs and LTR retrotransposons. The observed inverse correlation between the total contents of dispersed repeats and satellite DNAs allowed to suggest a relationship between both classes of repeating sequences. Significant interspecific differences in satellite DNA sets indicated a high rate of evolution of this genomic fraction. The phylogenetic relationships between the investigated flax species, obtained by comparison of the repeatomes, agreed with the results of previous molecular phylogenetic studies.
Collapse
Affiliation(s)
- Nadezhda L. Bolsheva
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Nataliya V. Melnikova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Ilya V. Kirov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Alexey A. Dmitriev
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - George S. Krasnov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Аlexandra V. Amosova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Tatiana E. Samatadze
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Olga Yu. Yurkevich
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | | | - Anna V. Kudryavtseva
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Olga V. Muravenko
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
14
|
Dennenmoser S, Sedlazeck FJ, Schatz MC, Altmüller J, Zytnicki M, Nolte AW. Genome‐wide patterns of transposon proliferation in an evolutionary young hybrid fish. Mol Ecol 2019; 28:1491-1505. [DOI: 10.1111/mec.14969] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 10/15/2018] [Accepted: 10/23/2018] [Indexed: 01/19/2023]
Affiliation(s)
- Stefan Dennenmoser
- Institute for Biology and Environmental Sciences Carl von Ossietzky University Oldenburg Oldenburg Germany
| | | | - Michael C. Schatz
- Cold Spring Harbor Laboratory Cold Spring Harbor New York
- Departments of Computer Science and Biology Johns Hopkins University Baltimore Maryland
| | - Janine Altmüller
- Cologne Center for Genomics, and Institute of Human Genetics University of Cologne Cologne Germany
| | | | - Arne W. Nolte
- Institute for Biology and Environmental Sciences Carl von Ossietzky University Oldenburg Oldenburg Germany
| |
Collapse
|
15
|
Liao Y, Zhang X, Li B, Liu T, Chen J, Bai Z, Wang M, Shi J, Walling JG, Wing RA, Jiang J, Chen M. Comparison of Oryza sativa and Oryza brachyantha Genomes Reveals Selection-Driven Gene Escape from the Centromeric Regions. THE PLANT CELL 2018; 30:1729-1744. [PMID: 29967288 PMCID: PMC6139686 DOI: 10.1105/tpc.18.00163] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Revised: 05/23/2018] [Accepted: 06/28/2018] [Indexed: 05/03/2023]
Abstract
Centromeres are dynamic chromosomal regions, and the genetic and epigenetic environment of the centromere is often regarded as oppressive to protein-coding genes. Here, we used comparative genomic and phylogenomic approaches to study the evolution of centromeres and centromere-linked genes in the genus Oryza We report a 12.4-Mb high-quality BAC-based pericentromeric assembly for Oryza brachyantha, which diverged from cultivated rice (Oryza sativa) ∼15 million years ago. The synteny analyses reveal seven medium (>50 kb) pericentric inversions in O. sativa and 10 in O. brachyantha Of these inversions, three resulted in centromere movement (Chr1, Chr7, and Chr9). Additionally, we identified a potential centromere-repositioning event, in which the ancestral centromere on chromosome 12 in O. brachyantha jumped ∼400 kb away, possibly mediated by a duplicated transposition event (>28 kb). More strikingly, we observed an excess of syntenic gene loss at and near the centromeric regions (P < 2.2 × 10-16). Most (33/47) of the missing genes moved to other genomic regions; therefore such excess could be explained by the selective loss of the copy in or near centromeric regions after gene duplication. The pattern of gene loss immediately adjacent to centromeric regions suggests centromere chromatin dynamics (e.g., spreading or microrepositioning) may drive such gene loss.
Collapse
Affiliation(s)
- Yi Liao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xuemei Zhang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Bo Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Tieyan Liu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Jinfeng Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zetao Bai
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Meijiao Wang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinfeng Shi
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Jason G Walling
- USDA-ARS-MWA-Cereal Crops Research Unit, Madison, Wisconsin 53726
| | - Rod A Wing
- Arizona Genomics Institute, School of Plant Sciences, BIO5 Institute, University of Arizona, Tucson, Arizona 85721
| | - Jiming Jiang
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706
- Department of Plant Biology, Department of Horticulture, Michigan State University, East Lansing, Michigan 48824
| | - Mingsheng Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
16
|
Ávila Robledillo L, Koblížková A, Novák P, Böttinger K, Vrbová I, Neumann P, Schubert I, Macas J. Satellite DNA in Vicia faba is characterized by remarkable diversity in its sequence composition, association with centromeres, and replication timing. Sci Rep 2018; 8:5838. [PMID: 29643436 PMCID: PMC5895790 DOI: 10.1038/s41598-018-24196-3] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2017] [Accepted: 03/28/2018] [Indexed: 11/17/2022] Open
Abstract
Satellite DNA, a class of repetitive sequences forming long arrays of tandemly repeated units, represents substantial portions of many plant genomes yet remains poorly characterized due to various methodological obstacles. Here we show that the genome of the field bean (Vicia faba, 2n = 12), a long-established model for cytogenetic studies in plants, contains a diverse set of satellite repeats, most of which remained concealed until their present investigation. Using next-generation sequencing combined with novel bioinformatics tools, we reconstructed consensus sequences of 23 novel satellite repeats representing 0.008–2.700% of the genome and mapped their distribution on chromosomes. We found that in addition to typical satellites with monomers hundreds of nucleotides long, V. faba contains a large number of satellite repeats with unusually long monomers (687–2033 bp), which are predominantly localized in pericentromeric regions. Using chromatin immunoprecipitation with CenH3 antibody, we revealed an extraordinary diversity of centromeric satellites, consisting of seven repeats with chromosome-specific distribution. We also found that in spite of their different nucleotide sequences, all centromeric repeats are replicated during mid-S phase, while most other satellites are replicated in the first part of late S phase, followed by a single family of FokI repeats representing the latest replicating chromatin.
Collapse
Affiliation(s)
- Laura Ávila Robledillo
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, 37005, Czech Republic.,University of South Bohemia, Faculty of Science, České Budějovice, 37005, Czech Republic
| | - Andrea Koblížková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, 37005, Czech Republic
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, 37005, Czech Republic
| | - Katharina Böttinger
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, 37005, Czech Republic.,University of South Bohemia, Faculty of Science, České Budějovice, 37005, Czech Republic
| | - Iva Vrbová
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, 37005, Czech Republic
| | - Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, 37005, Czech Republic
| | - Ingo Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Gatersleben, Stadt Seeland, Germany
| | - Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, 37005, Czech Republic.
| |
Collapse
|
17
|
Zhou M, Liang L, Hänninen H. A transposition-active Phyllostachys edulis long terminal repeat (LTR) retrotransposon. JOURNAL OF PLANT RESEARCH 2018; 131:203-210. [PMID: 29110195 DOI: 10.1007/s10265-017-0983-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 09/04/2017] [Indexed: 06/07/2023]
Abstract
Due to infrequent sexual reproduction, moso bamboo breeding by hybridization is extremely technically difficult. Insertional mutagenesis based on endogenous active transposons may thus serve as an alternative method to create new germplasm of moso bamboo. In the present study, using LTR-STRUC, a full-length intact long terminal repeat (LTR) retrotransposon was identified in the moso bamboo genome and was named PHRE2 (Phyllostachys edulis retrotransposon 2). The 5' and 3' LTR sequences of PHRE2 were highly (98.39%) similar. PHRE2 contains all domains necessary for transposition such as gag, pr, rt, rh, and int. The coding frames of these essential domains were complete and had no apparent mutations. In addition, PHRE2 possessed a prime binding site (PBS), a polypurine tract (PPT), and two typical sequences of LTR retrotransposons. A genome-wide scan showed that the moso bamboo genome has only one full-length sequence of PHYRE2. After its transfer to Arabidopsis thaliana, an increase in PHRE2 copy number occurred in the T3 plants compared to in the T2 plants. After moso bamboo seedlings were grown in tissue culture or treated by irradiation or plant hormones, the copy number of PHRE2 significantly increased. These findings indicate that PHRE2 has the capacity for transposition, which can be induced by environmental conditions.
Collapse
Affiliation(s)
- Mingbing Zhou
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Linan, 311300, Zhejiang, People's Republic of China.
- Zhejiang Provincial Collaborative Innovation Center for Bamboo Resources and High-efficiency Utilization, Zhejiang A & F University, Linan, 311300, Zhejiang, People's Republic of China.
| | - Linlin Liang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Linan, 311300, Zhejiang, People's Republic of China
| | - Heikki Hänninen
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Linan, 311300, Zhejiang, People's Republic of China
| |
Collapse
|
18
|
Liu W, Xiao Z, Fan C, Jiang N, Meng X, Xiang X. Cloning and Characterization of a Flavonol Synthase Gene From Litchi chinensis and Its Variation Among Litchi Cultivars With Different Fruit Maturation Periods. FRONTIERS IN PLANT SCIENCE 2018; 9:567. [PMID: 29922308 PMCID: PMC5996885 DOI: 10.3389/fpls.2018.00567] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Accepted: 04/11/2018] [Indexed: 05/20/2023]
Abstract
Litchi (Litchi chinensis) is an important subtropical fruit tree with high commercial value. However, the short and centralized fruit maturation period of litchi cultivars represents a bottleneck for litchi production. Therefore, the development of novel cultivars with extremely early fruit maturation period is critical. Previously, we showed that the genotypes of extremely early-maturing (EEM), early-maturing (EM), and middle-to-late-maturing (MLM) cultivars at a specific locus SNP51 (substitution type C/T) were consistent with their respective genetic background at the whole-genome level; a homozygous C/C genotype at SNP51 systematically differentiated EEM cultivars from others. The litchi gene on which SNP51 was located was annotated as flavonol synthase (FLS), which catalyzes the formation of flavonols. Here, we further elucidate the variation of the FLS gene from L. chinensis (LcFLS) among EEM, EM, and MLM cultivars. EEM cultivars with a homozygous C/C genotype at SNP51 all contained the same 2,199-bp sequence of the LcFLS gene. For MLM cultivars with a homozygous T/T genotype at SNP51, the sequence lengths of the LcFLS gene were 2,202-2,222 bp. EM cultivars with heterozygous C/T genotypes at SNP51 contained two different alleles of the LcFLS gene: a 2,199-bp sequence identical to that in EEM cultivars and a 2,205-bp sequence identical to that in MLM cultivar 'Heiye.' Moreover, the coding regions of LcFLS genes of other MLM cultivars were almost identical to that of 'Heiye.' Therefore, the LcFLS gene coding region may be used as a source of diagnostic SNP markers to discriminate or identify genotypes with the EEM trait. The expression pattern of the LcFLS gene and accumulation pattern of flavonol from EEM, EM, and MLM cultivars were analyzed and compared using quantitative real-time PCR (qRT-PCR) and high-performance liquid chromatography (HPLC) for mature leaves, flower buds, and fruits, 15, 30, 45, and 60 days after anthesis. Flavonol content and LcFLS gene expression levels were positively correlated in all three cultivars: both decreased from the EEM to MLM cultivars, with moderate levels in the EM cultivars. LcFLS gene function could be further analyzed to elucidate its correlation with phenotype variation among litchi cultivars with different fruit maturation periods.
Collapse
Affiliation(s)
- Wei Liu
- Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
- Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China
| | - Zhidan Xiao
- Guangdong Provincial Key Laboratory of Biotechnology for Plant Development, School of Life Sciences, South China Normal University, Guangzhou, China
| | - Chao Fan
- Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
- Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China
| | - Nonghui Jiang
- Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
- Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China
| | - Xiangchun Meng
- Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
- Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China
| | - Xu Xiang
- Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
- Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China
- *Correspondence: Xu Xiang,
| |
Collapse
|
19
|
Xu Z, Liu J, Ni W, Peng Z, Guo Y, Ye W, Huang F, Zhang X, Xu P, Guo Q, Shen X, Du J. GrTEdb: the first web-based database of transposable elements in cotton (Gossypium raimondii). DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2017; 2017:3084694. [PMID: 28365739 PMCID: PMC5467567 DOI: 10.1093/database/bax013] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Accepted: 01/13/2017] [Indexed: 11/21/2022]
Abstract
Although several diploid and tetroploid Gossypium species genomes have been sequenced, the well annotated web-based transposable elements (TEs) database is lacking. To better understand the roles of TEs in structural, functional and evolutionary dynamics of the cotton genome, a comprehensive, specific, and user-friendly web-based database, Gossypium raimondii transposable elements database (GrTEdb), was constructed. A total of 14 332 TEs were structurally annotated and clearly categorized in G. raimondii genome, and these elements have been classified into seven distinct superfamilies based on the order of protein-coding domains, structures and/or sequence similarity, including 2929 Copia-like elements, 10 368 Gypsy-like elements, 299 L1, 12 Mutators, 435 PIF-Harbingers, 275 CACTAs and 14 Helitrons. Meanwhile, the web-based sequence browsing, searching, downloading and blast tool were implemented to help users easily and effectively to annotate the TEs or TE fragments in genomic sequences from G. raimondii and other closely related Gossypium species. GrTEdb provides resources and information related with TEs in G. raimondii, and will facilitate gene and genome analyses within or across Gossypium species, evaluating the impact of TEs on their host genomes, and investigating the potential interaction between TEs and protein-coding genes in Gossypium species. Database URL: http://www.grtedb.org/
Collapse
Affiliation(s)
- Zhenzhen Xu
- Key Laboratory of Cotton and Rapeseed (Nanjing), The Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Jing Liu
- Provincial Key Laboratory of Agrobiology, The Institute of Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Wanchao Ni
- Key Laboratory of Cotton and Rapeseed (Nanjing), The Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Zhen Peng
- Provincial Key Laboratory of Agrobiology, The Institute of Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Yue Guo
- Provincial Key Laboratory of Agrobiology, The Institute of Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Wuwei Ye
- State Key Laboratory of Cotton Biology, The Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000, China
| | - Fang Huang
- Key Laboratory of Cotton and Rapeseed (Nanjing), The Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Xianggui Zhang
- Key Laboratory of Cotton and Rapeseed (Nanjing), The Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Peng Xu
- Key Laboratory of Cotton and Rapeseed (Nanjing), The Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Qi Guo
- Key Laboratory of Cotton and Rapeseed (Nanjing), The Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Xinlian Shen
- Key Laboratory of Cotton and Rapeseed (Nanjing), The Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Jianchang Du
- Provincial Key Laboratory of Agrobiology, The Institute of Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| |
Collapse
|
20
|
Dennenmoser S, Sedlazeck FJ, Iwaszkiewicz E, Li X, Altmüller J, Nolte AW. Copy number increases of transposable elements and protein-coding genes in an invasive fish of hybrid origin. Mol Ecol 2017; 26:4712-4724. [PMID: 28390096 PMCID: PMC5638112 DOI: 10.1111/mec.14134] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Revised: 03/23/2017] [Accepted: 03/27/2017] [Indexed: 12/25/2022]
Abstract
Evolutionary dynamics of structural genetic variation in lineages of hybrid origin is not well explored, although structural mutations may increase in controlled hybrid crosses. We therefore tested whether structural variants accumulate in a fish of recent hybrid origin, invasive Cottus, relative to both parental species Cottus rhenanus and Cottus perifretum. Copy-number variation in exons of 10,979 genes was assessed using comparative genome hybridization arrays. Twelve genes showed significantly higher copy numbers in invasive Cottus compared to both parents. This coincided with increased expression for three genes related to vision, detoxification and muscle development, suggesting possible gene dosage effects. Copy number increases of putative transposons were assessed by comparative mapping of genomic DNA reads against a de novo assembly of 1,005 repetitive elements. In contrast to exons, copy number increases of repetitive elements were common (20.7%) in invasive Cottus, whereas decrease was very rare (0.01%). Among the increased repetitive elements, 53.8% occurred at higher numbers in C. perifretum compared to C. rhenanus, while only 1.4% were more abundant in C. rhenanus. This implies a biased mutational process that amplifies genetic material from one ancestor. To assess the frequency of de novo mutations through hybridization, we screened 64 laboratory-bred F2 offspring between the parental species for copy-number changes at five candidate loci. We found no evidence for new structural variants, indicating that they are too rare to be detected given our sampling scheme. Instead, they must have accumulated over more generations than we observed in a controlled cross.
Collapse
Affiliation(s)
- Stefan Dennenmoser
- Department for Evolutionary GeneticsMax‐Planck Institute for Evolutionary BiologyPlönGermany
- Institute for BiologyCarl von Ossietzky University OldenburgOldenburgGermany
| | | | - Elzbieta Iwaszkiewicz
- Department for Evolutionary GeneticsMax‐Planck Institute for Evolutionary BiologyPlönGermany
| | - Xiang‐Yi Li
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland
| | - Janine Altmüller
- Cologne Center for Genomics, and Institute of Human GeneticsUniversity of CologneCologneGermany
| | - Arne W. Nolte
- Department for Evolutionary GeneticsMax‐Planck Institute for Evolutionary BiologyPlönGermany
- Institute for BiologyCarl von Ossietzky University OldenburgOldenburgGermany
| |
Collapse
|
21
|
Zhou M, Tao G, Pi P, Zhu Y, Bai Y, Meng X. Genome-wide characterization and evolution analysis of miniature inverted-repeat transposable elements (MITEs) in moso bamboo (Phyllostachys heterocycla). PLANTA 2016; 244:775-787. [PMID: 27160169 DOI: 10.1007/s00425-016-2544-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 05/01/2016] [Indexed: 06/05/2023]
Abstract
Moso bamboo MITEs were genome-wide identified first time, and data shows that MITEs contribute to the genomic diversity and differentiation of bamboo. Miniature inverted-repeat transposable elements (MITEs) are widespread in animals and plants. There are a large number of transposable elements in moso bamboo (Phyllostachys heterocycla var. pubescens) genome, but the genome-wide information of moso bamboo MITEs is not known yet. Here we identified 362 MITE families with a total of 489,592 MITE-related sequences, accounting for 4.74 % of the moso bamboo genome. The 362 MITE families are clustered into six known and one unknown super-families. Our analysis indicated that moso bamboo MITEs preferred to reside in or near the genes that might be involved in regulation of host gene expression. Of the seven super-families, three might undergo major expansion event twice, respectively, during 8-11 million years ago (mya) ago and 22-28 mya ago; two might experience a long expansion period from 6 to 13 mya. Almost 1/3 small RNAs might be derived from the MITE sequences. Some MITE families generate small RNAs mainly from the terminals, while others predominantly from the central region. Given the high copy number of MITEs, many siRNAs and miRNAs derived from MITE sequences and the preferential insertion of MITE into gene regions, MITEs may contribute to the genomic diversity and differentiation of bamboo.
Collapse
Affiliation(s)
- Mingbing Zhou
- The Nurturing Station for the State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, LinAn, 311300, Zhejiang Province, People's Republic of China.
| | - Guiyun Tao
- The Nurturing Station for the State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, LinAn, 311300, Zhejiang Province, People's Republic of China
| | - Peiyao Pi
- The Nurturing Station for the State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, LinAn, 311300, Zhejiang Province, People's Republic of China
| | - Yihang Zhu
- The Nurturing Station for the State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, LinAn, 311300, Zhejiang Province, People's Republic of China
| | - Youhuang Bai
- The Nurturing Station for the State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, LinAn, 311300, Zhejiang Province, People's Republic of China
| | - Xianwen Meng
- The Nurturing Station for the State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, LinAn, 311300, Zhejiang Province, People's Republic of China
| |
Collapse
|
22
|
Huang YC, Lee CC, Kao CY, Chang NC, Lin CC, Shoemaker D, Wang J. Evolution of long centromeres in fire ants. BMC Evol Biol 2016; 16:189. [PMID: 27628313 PMCID: PMC5024525 DOI: 10.1186/s12862-016-0760-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Accepted: 09/05/2016] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Centromeres are essential for accurate chromosome segregation, yet sequence conservation is low even among closely related species. Centromere drive predicts rapid turnover because some centromeric sequences may compete better than others during female meiosis. In addition to sequence composition, longer centromeres may have a transmission advantage. RESULTS We report the first observations of extremely long centromeres, covering on average 34 % of the chromosomes, in the red imported fire ant Solenopsis invicta. By comparison, cytological examination of Solenopsis geminata revealed typical small centromeric constrictions. Bioinformatics and molecular analyses identified CenSol, the major centromeric satellite DNA repeat. We found that CenSol sequences are very similar between the two species but the CenSol copy number in S. invicta is much greater than that in S. geminata. In addition, centromere expansion in S. invicta is not correlated with the duplication of CenH3. Comparative analyses revealed that several closely related fire ant species also possess long centromeres. CONCLUSIONS Our results are consistent with a model of simple runaway centromere expansion due to centromere drive. We suggest expanded centromeres may be more prevalent in hymenopteran insects, which use haplodiploid sex determination, than previously considered.
Collapse
Affiliation(s)
- Yu-Ching Huang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Chih-Chi Lee
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Chia-Yi Kao
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ni-Chen Chang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- Laboratory of Insect Social Evolution, The Rockefeller University, New York, NY USA
| | - Chung-Chi Lin
- Department of Biology, National Changhua University of Education, Changhua, Taiwan
| | - DeWayne Shoemaker
- United States Department of Agriculture, Agricultural Research Service, Gainesville, FL USA
- Entomology and Plant Pathology Department, University of Tennessee, Knoxville, TN USA
| | - John Wang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
23
|
De Novo Centromere Formation and Centromeric Sequence Expansion in Wheat and its Wide Hybrids. PLoS Genet 2016; 12:e1005997. [PMID: 27110907 PMCID: PMC4844185 DOI: 10.1371/journal.pgen.1005997] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2016] [Accepted: 03/28/2016] [Indexed: 11/19/2022] Open
Abstract
Centromeres typically contain tandem repeat sequences, but centromere function does not necessarily depend on these sequences. We identified functional centromeres with significant quantitative changes in the centromeric retrotransposons of wheat (CRW) contents in wheat aneuploids (Triticum aestivum) and the offspring of wheat wide hybrids. The CRW signals were strongly reduced or essentially lost in some wheat ditelosomic lines and in the addition lines from the wide hybrids. The total loss of the CRW sequences but the presence of CENH3 in these lines suggests that the centromeres were formed de novo. In wheat and its wide hybrids, which carry large complex genomes or no sequenced genome, we performed CENH3-ChIP-dot-blot methods alone or in combination with CENH3-ChIP-seq and identified the ectopic genomic sequences present at the new centromeres. In adcdition, the transcription of the identified DNA sequences was remarkably increased at the new centromere, suggesting that the transcription of the corresponding sequences may be associated with de novo centromere formation. Stable alien chromosomes with two and three regions containing CRW sequences induced by centromere breakage were observed in the wheat-Th. elongatum hybrid derivatives, but only one was a functional centromere. In wheat-rye (Secale cereale) hybrids, the rye centromere-specific sequences spread along the chromosome arms and may have caused centromere expansion. Frequent and significant quantitative alterations in the centromere sequence via chromosomal rearrangement have been systematically described in wheat wide hybridizations, which may affect the retention or loss of the alien chromosomes in the hybrids. Thus, the centromere behavior in wide crosses likely has an important impact on the generation of biodiversity, which ultimately has implications for speciation. Chromosomal rearrangements during the formation of wheat aneuploids and their wide hybrids caused reduction, elimination or expansion of the centromeric retrotransposon sequences and the formation of multiple centromeres. Centromere function was not affected by centromeric sequence elimination, which was revealed by the de novo formation of centromeres on the rearranged chromosomes. Several retrotransposon-like elements near the former centromeres were embedded in the newly formed centromeres, and there were no obvious changes in six histone modifications between normal and new centromeres. The DNA sequences associated with the new centromeres are transcribed at a higher level after centromere formation. Chromosomes containing the neocentromeres can be stably transferred to the next generation. Chromosomes carrying two- or three-locus centromeres are unstable, which induces the formation of novel chromosomes through centromere breakage in wheat-Th. elongatum hybrid derivatives. The centromere-specific sequences on dicentric chromosomes are expanded to the chromosome arms in wheat-rye hybrids, and these sequences may function as a part of the active centromere to cause chromosome breakage in the next generation. Centromere variation and activity in wheat aneuploids and its wide hybrids may be associated with chromosome stability, rearrangements, and novel chromosome formations.
Collapse
|
24
|
Hall AB, Papathanos PA, Sharma A, Cheng C, Akbari OS, Assour L, Bergman NH, Cagnetti A, Crisanti A, Dottorini T, Fiorentini E, Galizi R, Hnath J, Jiang X, Koren S, Nolan T, Radune D, Sharakhova MV, Steele A, Timoshevskiy VA, Windbichler N, Zhang S, Hahn MW, Phillippy AM, Emrich SJ, Sharakhov IV, Tu ZJ, Besansky NJ. Radical remodeling of the Y chromosome in a recent radiation of malaria mosquitoes. Proc Natl Acad Sci U S A 2016; 113:E2114-23. [PMID: 27035980 PMCID: PMC4839409 DOI: 10.1073/pnas.1525164113] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Y chromosomes control essential male functions in many species, including sex determination and fertility. However, because of obstacles posed by repeat-rich heterochromatin, knowledge of Y chromosome sequences is limited to a handful of model organisms, constraining our understanding of Y biology across the tree of life. Here, we leverage long single-molecule sequencing to determine the content and structure of the nonrecombining Y chromosome of the primary African malaria mosquito, Anopheles gambiae We find that the An. gambiae Y consists almost entirely of a few massively amplified, tandemly arrayed repeats, some of which can recombine with similar repeats on the X chromosome. Sex-specific genome resequencing in a recent species radiation, the An. gambiae complex, revealed rapid sequence turnover within An. gambiae and among species. Exploiting 52 sex-specific An. gambiae RNA-Seq datasets representing all developmental stages, we identified a small repertoire of Y-linked genes that lack X gametologs and are not Y-linked in any other species except An. gambiae, with the notable exception of YG2, a candidate male-determining gene. YG2 is the only gene conserved and exclusive to the Y in all species examined, yet sequence similarity to YG2 is not detectable in the genome of a more distant mosquito relative, suggesting rapid evolution of Y chromosome genes in this highly dynamic genus of malaria vectors. The extensive characterization of the An. gambiae Y provides a long-awaited foundation for studying male mosquito biology, and will inform novel mosquito control strategies based on the manipulation of Y chromosomes.
Collapse
Affiliation(s)
- Andrew Brantley Hall
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Philippos-Aris Papathanos
- Section of Genomics and Genetics, Department of Experimental Medicine, University of Perugia, 06132 Perugia, Italy; Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Atashi Sharma
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Changde Cheng
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
| | - Omar S Akbari
- Department of Entomology, Riverside Center for Disease Vector Research, Institute for Integrative Genome Biology, University of California, Riverside, CA 92521
| | - Lauren Assour
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Nicholas H Bergman
- National Biodefense Analysis and Countermeasures Center, Frederick, MD 21702
| | - Alessia Cagnetti
- Section of Genomics and Genetics, Department of Experimental Medicine, University of Perugia, 06132 Perugia, Italy
| | - Andrea Crisanti
- Section of Genomics and Genetics, Department of Experimental Medicine, University of Perugia, 06132 Perugia, Italy; Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Tania Dottorini
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Elisa Fiorentini
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Roberto Galizi
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Jonathan Hnath
- National Biodefense Analysis and Countermeasures Center, Frederick, MD 21702
| | - Xiaofang Jiang
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892
| | - Tony Nolan
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Diane Radune
- National Biodefense Analysis and Countermeasures Center, Frederick, MD 21702
| | - Maria V Sharakhova
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Laboratory of Evolutionary Cytogenetics, Tomsk State University, Tomsk 634050, Russia
| | - Aaron Steele
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Vladimir A Timoshevskiy
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Nikolai Windbichler
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Simo Zhang
- School of Informatics and Computing, Indiana University, Bloomington, IN 47405
| | - Matthew W Hahn
- School of Informatics and Computing, Indiana University, Bloomington, IN 47405; Department of Biology, Indiana University, Bloomington, IN 47405
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892
| | - Scott J Emrich
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556; Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Igor V Sharakhov
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Laboratory of Evolutionary Cytogenetics, Tomsk State University, Tomsk 634050, Russia;
| | - Zhijian Jake Tu
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Department of Biochemistry, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Nora J Besansky
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556;
| |
Collapse
|
25
|
Craddock EM, Gall JG, Jonas M. Hawaiian Drosophila genomes: size variation and evolutionary expansions. Genetica 2016; 144:107-24. [PMID: 26790663 DOI: 10.1007/s10709-016-9882-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2015] [Accepted: 01/09/2016] [Indexed: 01/24/2023]
Abstract
This paper reports genome sizes of one Hawaiian Scaptomyza and 16 endemic Hawaiian Drosophila species that include five members of the antopocerus species group, one member of the modified mouthpart group, and ten members of the picture wing clade. Genome size expansions have occurred independently multiple times among Hawaiian Drosophila lineages, and have resulted in an over 2.3-fold range of genome sizes among species, with the largest observed in Drosophila cyrtoloma (1C = 0.41 pg). We find evidence that these repeated genome size expansions were likely driven by the addition of significant amounts of heterochromatin and satellite DNA. For example, our data reveal that the addition of seven heterochromatic chromosome arms to the ancestral haploid karyotype, and a remarkable proportion of ~70 % satellite DNA, account for the greatly expanded size of the D. cyrtoloma genome. Moreover, the genomes of 13/17 Hawaiian picture wing species are composed of substantial proportions (22-70 %) of detectable satellites (all but one of which are AT-rich). Our results suggest that in this tightly knit group of recently evolved species, genomes have expanded, in large part, via evolutionary amplifications of satellite DNA sequences in centric and pericentric domains (especially of the X and dot chromosomes), which have resulted in longer acrocentric chromosomes or metacentrics with an added heterochromatic chromosome arm. We discuss possible evolutionary mechanisms that may have shaped these patterns, including rapid fixation of novel expanded genomes during founder-effect speciation.
Collapse
Affiliation(s)
- Elysse M Craddock
- Natural Sciences Building, Purchase College, State University of New York, 735 Anderson Hill Road, Purchase, NY, 10577, USA.
| | - Joseph G Gall
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD, USA
| | - Mark Jonas
- Natural Sciences Building, Purchase College, State University of New York, 735 Anderson Hill Road, Purchase, NY, 10577, USA
| |
Collapse
|
26
|
Wolfgruber TK, Nakashima MM, Schneider KL, Sharma A, Xie Z, Albert PS, Xu R, Bilinski P, Dawe RK, Ross-Ibarra J, Birchler JA, Presting GG. High Quality Maize Centromere 10 Sequence Reveals Evidence of Frequent Recombination Events. FRONTIERS IN PLANT SCIENCE 2016; 7:308. [PMID: 27047500 PMCID: PMC4806543 DOI: 10.3389/fpls.2016.00308] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 02/27/2016] [Indexed: 05/02/2023]
Abstract
The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR) has presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here, we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 × 10(-6) and 5 × 10(-5) for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb from the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length CR from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB) repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. In many cases examined here, DSB repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to efficiently repair frequent DSBs in centromeres.
Collapse
Affiliation(s)
- Thomas K. Wolfgruber
- Department of Molecular Biosciences and Bioengineering, University of Hawaíi at MānoaHonolulu, HI, USA
| | - Megan M. Nakashima
- Department of Molecular Biosciences and Bioengineering, University of Hawaíi at MānoaHonolulu, HI, USA
| | - Kevin L. Schneider
- Department of Molecular Biosciences and Bioengineering, University of Hawaíi at MānoaHonolulu, HI, USA
| | - Anupma Sharma
- Department of Molecular Biosciences and Bioengineering, University of Hawaíi at MānoaHonolulu, HI, USA
| | - Zidian Xie
- Department of Molecular Biosciences and Bioengineering, University of Hawaíi at MānoaHonolulu, HI, USA
| | - Patrice S. Albert
- Division of Biological Sciences, University of MissouriColumbia, MO, USA
| | - Ronghui Xu
- Department of Molecular Biosciences and Bioengineering, University of Hawaíi at MānoaHonolulu, HI, USA
| | - Paul Bilinski
- Department of Plant Sciences, University of California DavisDavis, CA, USA
| | - R. Kelly Dawe
- Department of Plant Biology, University of GeorgiaAthens, GA, USA
| | | | - James A. Birchler
- Division of Biological Sciences, University of MissouriColumbia, MO, USA
| | - Gernot G. Presting
- Department of Molecular Biosciences and Bioengineering, University of Hawaíi at MānoaHonolulu, HI, USA
- *Correspondence: Gernot G. Presting
| |
Collapse
|
27
|
Holocentromeres in Rhynchospora are associated with genome-wide centromere-specific repeat arrays interspersed among euchromatin. Proc Natl Acad Sci U S A 2015; 112:13633-8. [PMID: 26489653 DOI: 10.1073/pnas.1512255112] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Holocentric chromosomes lack a primary constriction, in contrast to monocentrics. They form kinetochores distributed along almost the entire poleward surface of the chromatids, to which spindle fibers attach. No centromere-specific DNA sequence has been found for any holocentric organism studied so far. It was proposed that centromeric repeats, typical for many monocentric species, could not occur in holocentrics, most likely because of differences in the centromere organization. Here we show that the holokinetic centromeres of the Cyperaceae Rhynchospora pubera are highly enriched by a centromeric histone H3 variant-interacting centromere-specific satellite family designated "Tyba" and by centromeric retrotransposons (i.e., CRRh) occurring as genome-wide interspersed arrays. Centromeric arrays vary in length from 3 to 16 kb and are intermingled with gene-coding sequences and transposable elements. We show that holocentromeres of metaphase chromosomes are composed of multiple centromeric units rather than possessing a diffuse organization, thus favoring the polycentric model. A cell-cycle-dependent shuffling of multiple centromeric units results in the formation of functional (poly)centromeres during mitosis. The genome-wide distribution of centromeric repeat arrays interspersing the euchromatin provides a previously unidentified type of centromeric chromatin organization among eukaryotes. Thus, different types of holocentromeres exist in different species, namely with and without centromeric repetitive sequences.
Collapse
|
28
|
Garrido-Ramos MA. Satellite DNA in Plants: More than Just Rubbish. Cytogenet Genome Res 2015; 146:153-170. [PMID: 26202574 DOI: 10.1159/000437008] [Citation(s) in RCA: 111] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/20/2015] [Indexed: 11/19/2022] Open
Abstract
For decades, satellite DNAs have been the hidden part of genomes. Initially considered as junk DNA, there is currently an increasing appreciation of the functional significance of satellite DNA repeats and of their sequences. Satellite DNA families accumulate in the heterochromatin in different parts of the eukaryotic chromosomes, mainly in pericentromeric and subtelomeric regions, but they also span the functional centromere. Tandem repeat sequences may spread from subtelomeric to interstitial loci, leading to the formation of chromosome-specific loci or to the accumulation in equilocal sites in different chromosomes. They also appear as the main components of the heterochromatin in the sex-specific region of sex chromosomes. Satellite DNA, required for chromosome organization, also plays a role in pairing and segregation. Some satellite repeats are transcribed and can participate in the formation and maintenance of heterochromatin structure and in the modulation of gene expression. In addition to the identification of the different satellite DNA families, their characteristics and location, we are interested in determining their impact on the genomes, by identifying the mechanisms leading to their appearance and amplification as well as in understanding how they change over time, the factors affecting these changes, and the influence exerted by the evolutionary history of the organisms. On the other hand, satellite DNA sequences are rapidly evolving sequences that may cause reproductive barriers between organisms and promote speciation. The accumulation of experimental data collected in recent years and the emergence of new approaches based on next-generation sequencing and high-throughput genome analysis are opening new perspectives that are changing our understanding of satellite DNA. This review examines recent data to provide a timely update on the overall information gathered about this part of the genome, focusing on the advances in the knowledge of its origin, its evolution, and its potential functional roles.
Collapse
|
29
|
Feliciello I, Akrap I, Brajković J, Zlatar I, Ugarković Đ. Satellite DNA as a driver of population divergence in the red flour beetle Tribolium castaneum. Genome Biol Evol 2014; 7:228-39. [PMID: 25527837 PMCID: PMC4316633 DOI: 10.1093/gbe/evu280] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Tandemly repeated satellite DNAs are among most rapidly evolving sequences in eukaryotic genome, usually differing significantly among closely related species. By inducing changes in heterochromatin and/or centromere, satellite DNAs are expected to drive population and species divergence. However, despite high evolutionary dynamics, divergence of satellite DNA profiles at the level of natural population which precedes and possibly triggers speciation process is not readily detected. Here, we characterize minor TCAST2 satellite DNA of the red flour beetle Tribolium castaneum and follow its dynamics among wild-type strains originating from diverse geographic locations. The investigation revealed presence of three distinct subfamilies of TCAST2 satellite DNA which differ in monomer size, genome organization, and subfamily specific mutations. Subfamilies Tcast2a and Tcast2b are tandemly arranged within pericentromeric heterochromatin whereas Tcast2c is preferentially dispersed within euchromatin of all chromosomes. Among strains, TCAST2 subfamilies are conserved in sequence but exhibit a significant content variability. This results in overrepresentation or almost complete absence of particular subfamily in some strains and enables discrimination between strains. It is proposed that homologous recombination, probably stimulated by environmental stress, is responsible for the emergence of TCAST2 satellite subfamilies, their copy number variation and dispersion within genome. The results represent the first evidence for the existence of population-specific satellite DNA profiles. Partial organization of TCAST2 satellite DNA in the form of single repeats dispersed within euchromatin additionally contributes to the genome divergence at the population level.
Collapse
Affiliation(s)
- Isidoro Feliciello
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia Laboratory of Experimental Biology, Department of Clinical Medicine and Surgery, University of Naples Federico II, Italy
| | - Ivana Akrap
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Josip Brajković
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Ivo Zlatar
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Đurđica Ugarković
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| |
Collapse
|
30
|
Bilinski P, Distor K, Gutierrez-Lopez J, Mendoza GM, Shi J, Dawe RK, Ross-Ibarra J. Diversity and evolution of centromere repeats in the maize genome. Chromosoma 2014; 124:57-65. [PMID: 25190528 DOI: 10.1007/s00412-014-0483-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2014] [Revised: 07/21/2014] [Accepted: 08/11/2014] [Indexed: 10/24/2022]
Abstract
Centromere repeats are found in most eukaryotes and play a critical role in kinetochore formation. Though centromere repeats exhibit considerable diversity both within and among species, little is understood about the mechanisms that drive centromere repeat evolution. Here, we use maize as a model to investigate how a complex history involving polyploidy, fractionation, and recent domestication has impacted the diversity of the maize centromeric repeat CentC. We first validate the existence of long tandem arrays of repeats in maize and other taxa in the genus Zea. Although we find considerable sequence diversity among CentC copies genome-wide, genetic similarity among repeats is highest within these arrays, suggesting that tandem duplications are the primary mechanism for the generation of new copies. Nonetheless, clustering analyses identify similar sequences among distant repeats, and simulations suggest that this pattern may be due to homoplasious mutation. Although the two ancestral subgenomes of maize have contributed nearly equal numbers of centromeres, our analysis shows that the majority of all CentC repeats derive from one of the parental genomes, with an even stronger bias when examining the largest assembled contiguous clusters. Finally, by comparing maize with its wild progenitor teosinte, we find that the abundance of CentC likely decreased after domestication, while the pericentromeric repeat Cent4 has drastically increased.
Collapse
Affiliation(s)
- Paul Bilinski
- Department of Plant Sciences, University of California Davis, Davis, CA, 95616, USA
| | | | | | | | | | | | | |
Collapse
|
31
|
Mehrotra S, Goel S, Raina SN, Rajpal VR. Significance of satellite DNA revealed by conservation of a widespread repeat DNA sequence among angiosperms. Appl Biochem Biotechnol 2014; 173:1790-801. [PMID: 24907041 DOI: 10.1007/s12010-014-0966-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 05/16/2014] [Indexed: 10/25/2022]
Abstract
The analysis of plant genome structure and evolution requires comprehensive characterization of repetitive sequences that make up the majority of plant nuclear DNA. In the present study, we analyzed the nature of pCtKpnI-I and pCtKpnI-II tandem repeated sequences, reported earlier in Carthamus tinctorius. Interestingly, homolog of pCtKpnI-I repeat sequence was also found to be present in widely divergent families of angiosperms. pCtKpnI-I showed high sequence similarity but low copy number among various taxa of different families of angiosperms analyzed. In comparison, pCtKpnI-II was specific to the genus Carthamus and was not present in any other taxa analyzed. The molecular structure of pCtKpnI-I was analyzed in various unrelated taxa of angiosperms to decipher the evolutionary conserved nature of the sequence and its possible functional role.
Collapse
Affiliation(s)
- Shweta Mehrotra
- Department of Botany, University of Delhi, Delhi, 110007, India,
| | | | | | | |
Collapse
|
32
|
Centromere identity from the DNA point of view. Chromosoma 2014; 123:313-25. [PMID: 24763964 PMCID: PMC4107277 DOI: 10.1007/s00412-014-0462-0] [Citation(s) in RCA: 136] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Revised: 03/28/2014] [Accepted: 04/01/2014] [Indexed: 02/05/2023]
Abstract
The centromere is a chromosomal locus responsible for the faithful segregation of genetic material during cell division. It has become evident that centromeres can be established literally on any DNA sequence, and the possible synergy between DNA sequences and the most prominent centromere identifiers, protein components, and epigenetic marks remains uncertain. However, some evolutionary preferences seem to exist, and long-term established centromeres are frequently formed on long arrays of satellite DNAs and/or transposable elements. Recent progress in understanding functional centromere sequences is based largely on the high-resolution DNA mapping of sequences that interact with the centromere-specific histone H3 variant, the most reliable marker of active centromeres. In addition, sequence assembly and mapping of large repetitive centromeric regions, as well as comparative genome analyses offer insight into their complex organization and evolution. The rapidly advancing field of transcription in centromere regions highlights the functional importance of centromeric transcripts. Here, we comprehensively review the current state of knowledge on the composition and functionality of DNA sequences underlying active centromeres and discuss their contribution to the functioning of different centromere types in higher eukaryotes.
Collapse
|
33
|
Richard MMS, Chen NWG, Thareau V, Pflieger S, Blanchet S, Pedrosa-Harand A, Iwata A, Chavarro C, Jackson SA, Geffroy V. The Subtelomeric khipu Satellite Repeat from Phaseolus vulgaris: Lessons Learned from the Genome Analysis of the Andean Genotype G19833. FRONTIERS IN PLANT SCIENCE 2013; 4:109. [PMID: 24137164 PMCID: PMC3797529 DOI: 10.3389/fpls.2013.00109] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Accepted: 04/09/2013] [Indexed: 05/10/2023]
Abstract
Subtelomeric regions in eukaryotic organisms are known for harboring species-specific tandemly repeated satellite sequences. However, studies on the molecular organization and evolution of subtelomeric repeats are scarce, especially in plants. Khipu is a satellite DNA of 528-bp repeat unit, specific of the Phaseolus genus, with a subtelomeric distribution in common bean, P. vulgaris. To investigate the genomic organization and the evolution of khipu, we performed genome-wide analysis on the complete genome sequence of the common bean genotype G19833. We identified 2,460 khipu units located at most distal ends of the sequenced regions. Khipu units are arranged in discrete blocks of 2-55 copies and are heterogeneously distributed among the different chromosome ends of G19833 (from 0 to 555 khipus units per chromosome arm). Phylogenetically related khipu units are spread between numerous chromosome ends, suggesting frequent exchanges between non-homologous subtelomeres. However, most subclades contain numerous khipu units from only one or few chromosome ends indicating that local duplication is also driving khipu expansion. Unexpectedly, we also identified 81 khipu units located at centromeres. All the centromeric khipu units belong to a single divergent clade also comprised of a few units from several subtelomeres, suggesting that a few sequence exchanges between centromeres and subtelomeres took place in the common bean genome. The divergence and low copy number of these centromeric units from the subtelomeric units could explain why they were not detected by FISH (Fluorescence in situ Hybridization) although it can not be excluded that these centromeric units may have resulted from errors in the pseudomolecule assembly. Altogether our data highlight extensive sequence exchanges in subtelomeres between non-homologous chromosomes in common bean and confirm that subtelomeres represent one of the most dynamic and rapidly evolving regions in eukaryotic genomes.
Collapse
Affiliation(s)
- Manon M. S. Richard
- UMR-CNRS 8618, Saclay Plant Sciences, Institut de Biologie des Plantes, Université Paris SudOrsay Cedex, France
| | - Nicolas W. G. Chen
- UMR-CNRS 8618, Saclay Plant Sciences, Institut de Biologie des Plantes, Université Paris SudOrsay Cedex, France
| | - Vincent Thareau
- UMR-CNRS 8618, Saclay Plant Sciences, Institut de Biologie des Plantes, Université Paris SudOrsay Cedex, France
| | - Stéphanie Pflieger
- UMR-CNRS 8618, Saclay Plant Sciences, Institut de Biologie des Plantes, Université Paris SudOrsay Cedex, France
- Université Paris Diderot, Sorbonne Paris CitéParis, France
| | - Sophie Blanchet
- UMR-CNRS 8618, Saclay Plant Sciences, Institut de Biologie des Plantes, Université Paris SudOrsay Cedex, France
| | - Andrea Pedrosa-Harand
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Universidade Federal de Pernambuco, Rua Nelson Chaves s/nRecife, Pernambuco, Brazil
| | - Aiko Iwata
- Center for Applied Genetic Technologies, Institute for Plant Breeding, Genetics, and Genomics, University of GeorgiaAthens, GA, USA
| | - Carolina Chavarro
- Center for Applied Genetic Technologies, Institute for Plant Breeding, Genetics, and Genomics, University of GeorgiaAthens, GA, USA
| | - Scott A. Jackson
- Center for Applied Genetic Technologies, Institute for Plant Breeding, Genetics, and Genomics, University of GeorgiaAthens, GA, USA
| | - Valérie Geffroy
- UMR-CNRS 8618, Saclay Plant Sciences, Institut de Biologie des Plantes, Université Paris SudOrsay Cedex, France
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche AgronomiqueGif-sur-Yvette, France
| |
Collapse
|
34
|
Iwata A, Tek AL, Richard MMS, Abernathy B, Fonsêca A, Schmutz J, Chen NWG, Thareau V, Magdelenat G, Li Y, Murata M, Pedrosa-Harand A, Geffroy V, Nagaki K, Jackson SA. Identification and characterization of functional centromeres of the common bean. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 76:47-60. [PMID: 23795942 DOI: 10.1111/tpj.12269] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2013] [Revised: 06/15/2013] [Accepted: 06/20/2013] [Indexed: 05/07/2023]
Abstract
In higher eukaryotes, centromeres are typically composed of megabase-sized arrays of satellite repeats that evolve rapidly and homogenize within a species' genome. Despite the importance of centromeres, our knowledge is limited to a few model species. We conducted a comprehensive analysis of common bean (Phaseolus vulgaris) centromeric satellite DNA using genomic data, fluorescence in situ hybridization (FISH), immunofluorescence and chromatin immunoprecipitation (ChIP). Two unrelated centromere-specific satellite repeats, CentPv1 and CentPv2, and the common bean centromere-specific histone H3 (PvCENH3) were identified. FISH showed that CentPv1 and CentPv2 are predominantly located at subsets of eight and three centromeres, respectively. Immunofluorescence- and ChIP-based assays demonstrated the functional significance of CentPv1 and CentPv2 at centromeres. Genomic analysis revealed several interesting features of CentPv1 and CentPv2: (i) CentPv1 is organized into an higher-order repeat structure, named Nazca, of 528 bp, whereas CentPv2 is composed of tandemly organized monomers; (ii) CentPv1 and CentPv2 have undergone chromosome-specific homogenization; and (iii) CentPv1 and CentPv2 are not likely to be commingled in the genome. These findings suggest that two distinct sets of centromere sequences have evolved independently within the common bean genome, and provide insight into centromere satellite evolution.
Collapse
Affiliation(s)
- Aiko Iwata
- Center for Applied Genetic Technologies and Institute for Plant Breeding Genetics, and Genomics, University of Georgia, Athens, GA, 30602, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Umemori J, Mori A, Ichiyanagi K, Uno T, Koide T. Identification of both copy number variation-type and constant-type core elements in a large segmental duplication region of the mouse genome. BMC Genomics 2013; 14:455. [PMID: 23834397 PMCID: PMC3722088 DOI: 10.1186/1471-2164-14-455] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2012] [Accepted: 07/05/2013] [Indexed: 11/14/2022] Open
Abstract
Background Copy number variation (CNV), an important source of diversity in genomic structure, is frequently found in clusters called CNV regions (CNVRs). CNVRs are strongly associated with segmental duplications (SDs), but the composition of these complex repetitive structures remains unclear. Results We conducted self-comparative-plot analysis of all mouse chromosomes using the high-speed and large-scale-homology search algorithm SHEAP. For eight chromosomes, we identified various types of large SD as tartan-checked patterns within the self-comparative plots. A complex arrangement of diagonal split lines in the self-comparative-plots indicated the presence of large homologous repetitive sequences. We focused on one SD on chromosome 13 (SD13M), and developed SHEPHERD, a stepwise ab initio method, to extract longer repetitive elements and to characterize repetitive structures in this region. Analysis using SHEPHERD showed the existence of 60 core elements, which were expected to be the basic units that form SDs within the repetitive structure of SD13M. The demonstration that sequences homologous to the core elements (>70% homology) covered approximately 90% of the SD13M region indicated that our method can characterize the repetitive structure of SD13M effectively. Core elements were composed largely of fragmented repeats of a previously identified type, such as long interspersed nuclear elements (LINEs), together with partial genic regions. Comparative genome hybridization array analysis showed that whereas 42 core elements were components of CNVR that varied among mouse strains, 8 did not vary among strains (constant type), and the status of the others could not be determined. The CNV-type core elements contained significantly larger proportions of long terminal repeat (LTR) types of retrotransposon than the constant-type core elements, which had no CNV. The higher divergence rates observed in the CNV-type core elements than in the constant type indicate that the CNV-type core elements have a longer evolutionary history than constant-type core elements in SD13M. Conclusions Our methodology for the identification of repetitive core sequences simplifies characterization of the structures of large SDs and detailed analysis of CNV. The results of detailed structural and quantitative analyses in this study might help to elucidate the biological role of one of the SDs on chromosome 13.
Collapse
Affiliation(s)
- Juzoh Umemori
- Mouse Genomics Resource Laboratory, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | | | | | | | | |
Collapse
|
36
|
Yin H, Liu J, Xu Y, Liu X, Zhang S, Ma J, Du J. TARE1, a mutated Copia-like LTR retrotransposon followed by recent massive amplification in tomato. PLoS One 2013; 8:e68587. [PMID: 23861922 PMCID: PMC3701649 DOI: 10.1371/journal.pone.0068587] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 06/05/2013] [Indexed: 11/19/2022] Open
Abstract
Long terminal repeat retrotransposons (LTR-RTs) are the major DNA components in flowering plants. Most LTR-RTs contain dinucleotides ‘TG’ and ‘CA’ at the ends of the two LTRs. Here we report the structure, evolution, and propensity of a tomato atypical retrotransposon element (TARE1) with both LTRs starting as ‘TA’. This family is also characterized by high copy numbers (354 copies), short LTR size (194 bp), extremely low ratio of solo LTRs to intact elements (0.05∶1), recent insertion (most within 0.75∼1.75 million years, Mys), and enrichment in pericentromeric region. The majority (83%) of the TARE1 elements are shared between S. lycopersicum and its wild relative S. pimpinellifolium, but none of them are found in potato. In the present study, we used shared LTR-RTs as molecular markers and estimated the divergence time between S. lycopersicum and S. pimpinellifolium to be <0.5 Mys. Phylogenetic analysis showed that the TARE1 elements, together with two closely related families, TARE2 and TGRE1, have formed a sub-lineage belonging to a Copia-like Ale lineage. Although TARE1 and TARE2 shared similar structural characteristics, the timing, scale, and activity of their amplification were found to be substantially different. We further propose a model wherein a single mutation from ‘G’ to ‘A’ in 3′ LTR followed by amplification is responsible for the origin of TARE1, thus providing evidence that the proliferation of a spontaneous mutation can be mediated by the amplification of LTR-RTs at the level of RNA.
Collapse
Affiliation(s)
- Hao Yin
- Bioinformatics Group, Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China
- Center of Pear Engineering Technology Research, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Jing Liu
- Bioinformatics Group, Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Yingxiu Xu
- Bioinformatics Group, Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Xing Liu
- Bioinformatics Group, Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China
- Center of Pear Engineering Technology Research, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Shaoling Zhang
- Center of Pear Engineering Technology Research, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, Indiana, United States of America
| | - Jianchang Du
- Bioinformatics Group, Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China
- * E-mail:
| |
Collapse
|
37
|
Sharma A, Wolfgruber TK, Presting GG. Tandem repeats derived from centromeric retrotransposons. BMC Genomics 2013; 14:142. [PMID: 23452340 PMCID: PMC3648361 DOI: 10.1186/1471-2164-14-142] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Accepted: 02/23/2013] [Indexed: 12/26/2022] Open
Abstract
Background Tandem repeats are ubiquitous and abundant in higher eukaryotic genomes and constitute, along with transposable elements, much of DNA underlying centromeres and other heterochromatic domains. In maize, centromeric satellite repeat (CentC) and centromeric retrotransposons (CR), a class of Ty3/gypsy retrotransposons, are enriched at centromeres. Some satellite repeats have homology to retrotransposons and several mechanisms have been proposed to explain the expansion, contraction as well as homogenization of tandem repeats. However, the origin and evolution of tandem repeat loci remain largely unknown. Results CRM1TR and CRM4TR are novel tandem repeats that we show to be entirely derived from CR elements belonging to two different subfamilies, CRM1 and CRM4. Although these tandem repeats clearly originated in at least two separate events, they are derived from similar regions of their respective parent element, namely the long terminal repeat (LTR) and untranslated region (UTR). The 5′ ends of the monomer repeat units of CRM1TR and CRM4TR map to different locations within their respective LTRs, while their 3′ ends map to the same relative position within a conserved region of their UTRs. Based on the insertion times of heterologous retrotransposons that have inserted into these tandem repeats, amplification of the repeats is estimated to have begun at least ~4 (CRM1TR) and ~1 (CRM4TR) million years ago. Distinct CRM1TR sequence variants occupy the two CRM1TR loci, indicating that there is little or no movement of repeats between loci, even though they are separated by only ~1.4 Mb. Conclusions The discovery of two novel retrotransposon derived tandem repeats supports the conclusions from earlier studies that retrotransposons can give rise to tandem repeats in eukaryotic genomes. Analysis of monomers from two different CRM1TR loci shows that gene conversion is the major cause of sequence variation. We propose that successive intrastrand deletions generated the initial repeat structure, and gene conversions increased the size of each tandem repeat locus.
Collapse
|
38
|
Wei L, Xiao M, An Z, Ma B, Mason AS, Qian W, Li J, Fu D. New insights into nested long terminal repeat retrotransposons in Brassica species. MOLECULAR PLANT 2013; 6:470-482. [PMID: 22930733 DOI: 10.1093/mp/sss081] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Long terminal repeat (LTR) retrotransposons, one of the foremost types of transposons, continually change or modify gene function and reorganize the genome through bursts of dramatic proliferation. Many LTR-TEs preferentially insert within other LTR-TEs, but the cause and evolutionary significance of these nested LTR-TEs are not well understood. In this study, a total of 1.52Gb of Brassica sequence containing 2020 bacterial artificial chromosomes (BACs) was scanned, and six bacterial artificial chromosome (BAC) clones with extremely nested LTR-TEs (LTR-TEs density: 7.24/kb) were selected for further analysis. The majority of the LTR-TEs in four of the six BACs were found to be derived from the rapid proliferation of retrotransposons originating within the BAC regions, with only a few LTR-TEs originating from the proliferation and insertion of retrotransposons from outside the BAC regions approximately 5-23Mya. LTR-TEs also preferably inserted into TA-rich repeat regions. Gene prediction by Genescan identified 207 genes in the 0.84Mb of total BAC sequences. Only a few genes (3/207) could be matched to the Brassica expressed sequence tag (EST) database, indicating that most genes were inactive after retrotransposon insertion. Five of the six BACs were putatively centromeric. Hence, nested LTR-TEs in centromere regions are rapidly duplicated, repeatedly inserted, and act to suppress activity of genes and to reshuffle the structure of the centromeric sequences. Our results suggest that LTR-TEs burst and proliferate on a local scale to create nested LTR-TE regions, and that these nested LTR-TEs play a role in the formation of centromeres.
Collapse
Affiliation(s)
- Lijuan Wei
- Chongqing Engineering Research Center for Rapeseed, College of Agronomy and Biotechnology, Southwest University, Chongqing 400716, China
| | | | | | | | | | | | | | | |
Collapse
|
39
|
Melters DP, Bradnam KR, Young HA, Telis N, May MR, Ruby JG, Sebra R, Peluso P, Eid J, Rank D, Garcia JF, DeRisi JL, Smith T, Tobias C, Ross-Ibarra J, Korf I, Chan SWL. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol 2013; 14:R10. [PMID: 23363705 PMCID: PMC4053949 DOI: 10.1186/gb-2013-14-1-r10] [Citation(s) in RCA: 322] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2012] [Accepted: 01/30/2013] [Indexed: 01/01/2023] Open
Abstract
Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
Collapse
|
40
|
Satellite DNA-like elements associated with genes within euchromatin of the beetle Tribolium castaneum. G3-GENES GENOMES GENETICS 2012; 2:931-41. [PMID: 22908042 PMCID: PMC3411249 DOI: 10.1534/g3.112.003467] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2012] [Accepted: 06/18/2012] [Indexed: 01/30/2023]
Abstract
In the red flour beetle Tribolium castaneum the major TCAST satellite DNA accounts for 35% of the genome and encompasses the pericentromeric regions of all chromosomes. Because of the presence of transcriptional regulatory elements and transcriptional activity in these sequences, TCAST satellite DNAs also have been proposed to be modulators of gene expression within euchromatin. Here, we analyze the distribution of TCAST homologous repeats in T. castaneum euchromatin and study their association with genes as well as their potential gene regulatory role. We identified 68 arrays composed of TCAST-like elements distributed on all chromosomes. Based on sequence characteristics the arrays were composed of two types of TCAST-like elements. The first type consists of TCAST satellite-like elements in the form of partial monomers or tandemly arranged monomers, up to tetramers, whereas the second type consists of TCAST-like elements embedded with a complex unit that resembles a DNA transposon. TCAST-like elements were also found in the 5′ untranslated region (UTR) of the CR1-3_TCa retrotransposon, and therefore retrotransposition may have contributed to their dispersion throughout the genome. No significant difference in the homogenization of dispersed TCAST-like elements was found either at the level of local arrays or chromosomes nor among different chromosomes. Of 68 TCAST-like elements, 29 were located within introns, with the remaining elements flanked by genes within a 262 to 404,270 nt range. TCAST-like elements are statistically overrepresented near genes with immunoglobulin-like domains attesting to their nonrandom distribution and a possible gene regulatory role.
Collapse
|
41
|
Tsukahara S, Kawabe A, Kobayashi A, Ito T, Aizu T, Shin-i T, Toyoda A, Fujiyama A, Tarutani Y, Kakutani T. Centromere-targeted de novo integrations of an LTR retrotransposon of Arabidopsis lyrata. Genes Dev 2012; 26:705-13. [PMID: 22431508 DOI: 10.1101/gad.183871.111] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The plant genome evolves with rapid proliferation of LTR-type retrotransposons, which is associated with their clustered accumulation in gene-poor regions, such as centromeres. Despite their major role for plant genome evolution, no mobile LTR element with targeted integration into gene-poor regions has been identified in plants. Here, we report such targeted integrations de novo. We and others have previously shown that an ATCOPIA93 family retrotransposon in Arabidopsis thaliana is mobilized when the DNA methylation machinery is compromised. Although ATCOPIA93 family elements are low copy number in the wild-type A. thaliana genome, high-copy-number related elements are found in the wild-type Arabidopsis lyrata genome, and they show centromere-specific localization. To understand the mechanisms for the clustered accumulation of the A. lyrata elements directly, we introduced one of them, named Tal1 (Transposon of Arabidopsis lyrata 1), into A. thaliana by transformation. The introduced Tal1 was retrotransposed in A. thaliana, and most of the retrotransposed copies were found in centromeric repeats of A. thaliana, suggesting targeted integration. The targeted integration is especially surprising because the centromeric repeat sequences differ considerably between A. lyrata and A. thaliana. Our results revealed unexpectedly dynamic controls for evolution of the transposon-rich heterochromatic regions.
Collapse
Affiliation(s)
- Sayuri Tsukahara
- Department of Integrated Genetics, National Institute of Genetics, Shizuoka 411-8540, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Zou J, Fu D, Gong H, Qian W, Xia W, Pires JC, Li R, Long Y, Mason AS, Yang TJ, Lim YP, Park BS, Meng J. De novo genetic variation associated with retrotransposon activation, genomic rearrangements and trait variation in a recombinant inbred line population of Brassica napus derived from interspecific hybridization with Brassica rapa. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2011; 68:212-24. [PMID: 21689170 DOI: 10.1111/j.1365-313x.2011.04679.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Interspecific hybridization is a significant evolutionary force as well as a powerful method for crop breeding. Partial substitution of the AA subgenome in Brassica napus (A(n) A(n) C(n) C(n) ) with the Brassica rapa (A(r) A(r) ) genome by two rounds of interspecific hybridization resulted in a new introgressed type of B. napus (A(r) A(r) C(n) C(n) ). In this study, we construct a population of recombinant inbred lines of the new introgressed type of B. napus. Microsatellite, intron-based and retrotransposon markers were used to characterize this experimental population with genetic mapping, genetic map comparison and specific marker cloning analysis. Yield-related traits were also recorded for identification of quantitative trait loci (QTLs). A remarkable range of novel genomic alterations was observed in the population, including simple sequence repeat (SSR) mutations, chromosomal rearrangements and retrotransposon activations. Most of these changes occurred immediately after interspecific hybridization, in the early stages of genome stabilization and derivation of experimental lines. These novel genomic alterations affected yield-related traits in the introgressed B. napus to an even greater extent than the alleles alone that were introgressed from the A(r) subgenome of B. rapa, suggesting that genomic changes induced by interspecific hybridization are highly significant in both genome evolution and crop improvement.
Collapse
Affiliation(s)
- Jun Zou
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
A brief history of the status of transposable elements: from junk DNA to major players in evolution. Genetics 2011; 186:1085-93. [PMID: 21156958 DOI: 10.1534/genetics.110.124180] [Citation(s) in RCA: 143] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The idea that some genetic factors are able to move around chromosomes emerged more than 60 years ago when Barbara McClintock first suggested that such elements existed and had a major role in controlling gene expression and that they also have had a major influence in reshaping genomes in evolution. It was many years, however, before the accumulation of data and theories showed that this latter revolutionary idea was correct although, understandably, it fell far short of our present view of the significant influence of what are now known as "transposable elements" in evolution. In this article, I summarize the main events that influenced my thinking about transposable elements as a young scientist and the influence and role of these specific genomic elements in evolution over subsequent years. Today, we recognize that the findings about genomic changes affected by transposable elements have considerably altered our view of the ways in which genomes evolve and work.
Collapse
|
44
|
Tian Z, Yu Y, Lin F, Yu Y, Sanmiguel PJ, Wing RA, McCouch SR, Ma J, Jackson SA. Exceptional lability of a genomic complex in rice and its close relatives revealed by interspecific and intraspecific comparison and population analysis. BMC Genomics 2011; 12:142. [PMID: 21385395 PMCID: PMC3060143 DOI: 10.1186/1471-2164-12-142] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2010] [Accepted: 03/08/2011] [Indexed: 11/16/2022] Open
Abstract
Background Extensive DNA rearrangement of genic colinearity, as revealed by comparison of orthologous genomic regions, has been shown to be a general concept describing evolutionary dynamics of plant genomes. However, the nature, timing, lineages and adaptation of local genomic rearrangement in closely related species (e.g., within a genus) and haplotype variation of genomic rearrangement within populations have not been well documented. Results We previously identified a hotspot for genic rearrangement and transposon accumulation in the Orp region of Asian rice (Oryza sativa, AA) by comparison with its orthologous region in sorghum. Here, we report the comparative analysis of this region with its orthologous regions in the wild progenitor species (O. nivara, AA) of Asian rice and African rice (O. glaberrima) using the BB genome Oryza species (O. punctata) as an outgroup, and investigation of transposon insertion sites and a segmental inversion event in the AA genomes at the population level. We found that Orp region was primarily and recently expanded in the Asian rice species O. sativa and O. nivara. LTR-retrotransposons shared by the three AA-genomic regions have been fixed in all the 94 varieties that represent different populations of the AA-genome species/subspecies, indicating their adaptive role in genome differentiation. However, LTR-retrotransposons unique to either O. nivara or O. sativa regions exhibited dramatic haplotype variation regarding their presence or absence between or within populations/subpopulations. Conclusions The LTR-retrotransposon insertion hotspot in the Orp region was formed recently, independently and concurrently in different AA-genome species, and that the genic rearrangements detected in different species appear to be differentially triggered by transposable elements. This region is located near the end of the short arm of chromosome 8 and contains a high proportion of LTR-retrotransposons similar to observed in the centromeric region of this same chromosome, and thus may represent a genomic region that has recently switched from euchromatic to heterochromatic states. The haplotype variation of LTR-retrotransposon insertions within this region reveals substantial admixture among various subpopulations as established by molecular markers at the whole genome level, and can be used to develop retrotransposon junction markers for simple and rapid classification of O. sativa germplasm.
Collapse
Affiliation(s)
- Zhixi Tian
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Jo SH, Park HM, Kim SM, Kim HH, Hur CG, Choi D. Unraveling the sequence dynamics of the formation of genus-specific satellite DNAs in the family solanaceae. Heredity (Edinb) 2010; 106:876-85. [PMID: 21063436 DOI: 10.1038/hdy.2010.131] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Tandemly repeated DNAs, referred to as satellite DNAs, often occur in a genome in a genus-specific manner. However, the mechanisms for generation and evolution for these sequences are largely unknown because of the uncertain origins of the satellite DNAs. We found highly divergent genus-specific satellite DNAs that showed sequence similarity with genus-specific intergenic spacers (IGSs) in the family Solanaceae, which includes the genera Nicotiana, Solanum and Capsicum. The conserved position of the IGS between 25S and 18S rDNA facilitates comparison of IGS sequences across genera, even in the presence of very low sequence similarity. Sequence comparison of IGS may elucidate the procedure of the genesis of complex monomer units of the satellite DNAs. Within the IGS of Capsicum species, base substitutions and copy number variation of subrepeat monomers were causes of monomer divergence in IGS sequences. At the level of inter-generic IGS sequences of the family Solanaceae, however, genus-specific motif selection, motif shuffling between subrepeats and differential amplification among motifs were involved in formation of genus-specific IGS. Therefore, the genus-specific satellite DNAs in Solanaceae plants can be generated from differentially organized repeat monomers of the IGS rather than by accumulation of mutations from pre-existent satellite DNAs.
Collapse
Affiliation(s)
- S-H Jo
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | | | | | | | | | | |
Collapse
|
46
|
Du J, Tian Z, Hans CS, Laten HM, Cannon SB, Jackson SA, Shoemaker RC, Ma J. Evolutionary conservation, diversity and specificity of LTR-retrotransposons in flowering plants: insights from genome-wide analysis and multi-specific comparison. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2010; 63:584-98. [PMID: 20525006 DOI: 10.1111/j.1365-313x.2010.04263.x] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The availability of complete or nearly complete genome sequences from several plant species permits detailed discovery and cross-species comparison of transposable elements (TEs) at the whole genome level. We initially investigated 510 long terminal repeat-retrotransposon (LTR-RT) families comprising 32370 elements in soybean (Glycine max (L.) Merr.). Approximately 87% of these elements were located in recombination-suppressed pericentromeric regions, where the ratio (1.26) of solo LTRs to intact elements (S/I) is significantly lower than that of chromosome arms (1.62). Further analysis revealed a significant positive correlation between S/I and LTR sizes, indicating that larger LTRs facilitate solo LTR formation. Phylogenetic analysis revealed seven Copia and five Gypsy evolutionary lineages that were present before the divergence of eudicot and monocot species, but the scales and timeframes within which they proliferated vary dramatically across families, lineages and species, and notably, a Copia lineage has been lost in soybean. Analysis of the physical association of LTR-RTs with centromere satellite repeats identified two putative centromere retrotransposon (CR) families of soybean, which were grouped into the CR (e.g. CRR and CRM) lineage found in grasses, indicating that the 'functional specification' of CR pre-dates the bifurcation of eudicots and monocots. However, a number of families of the CR lineage are not concentrated in centromeres, suggesting that their CR roles may now be defunct. Our data also suggest that the envelope-like genes in the putative Copia retrovirus-like family are probably derived from the Gypsy retrovirus-like lineage, and thus we propose the hypothesis of a single ancient origin of envelope-like genes in flowering plants.
Collapse
Affiliation(s)
- Jianchang Du
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | | | | | | | | | | | | | |
Collapse
|
47
|
Macas J, Neumann P, Novák P, Jiang J. Global sequence characterization of rice centromeric satellite based on oligomer frequency analysis in large-scale sequencing data. ACTA ACUST UNITED AC 2010; 26:2101-8. [PMID: 20616383 DOI: 10.1093/bioinformatics/btq343] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
MOTIVATION Satellite DNA makes up significant portion of many eukaryotic genomes, yet it is relatively poorly characterized even in extensively sequenced species. This is, in part, due to methodological limitations of traditional methods of satellite repeat analysis, which are based on multiple alignments of monomer sequences. Therefore, we employed an alternative, alignment-free, approach utilizing k-mer frequency statistics, which is in principle more suitable for analyzing large sets of satellite repeat data, including sequence reads from next generation sequencing technologies. RESULTS k-mer frequency spectra were determined for two sets of rice centromeric satellite CentO sequences, including 454 reads from ChIP-sequencing of CENH3-bound DNA (7.6 Mb) and the whole genome Sanger sequencing reads (5.8 Mb). k-mer frequencies were used to identify the most conserved sequence regions and to reconstruct consensus sequences of complete monomers. Reconstructed consensus sequences as well as the assessment of overall divergence of k-mer spectra revealed high similarity of the two datasets, suggesting that CentO sequences associated with functional centromeres (CENH3-bound) do not significantly differ from the total population of CentO, which includes both centromeric and pericentromeric repeat arrays. On the other hand, considerable differences were revealed when these methods were used for comparison of CentO populations between individual chromosomes of the rice genome assembly, demonstrating preferential sequence homogenization of the clusters within the same chromosome. k-mer frequencies were also successfully used to identify and characterize smRNAs derived from CentO repeats.
Collapse
Affiliation(s)
- Jirí Macas
- Institute of Plant Molecular Biology, Biology Centre ASCR, Branisovska 31, CZ-37005, Ceske Budejovice, Czech Republic.
| | | | | | | |
Collapse
|
48
|
[Detection of maize centromeric repeats in the relatives of maize using fluorescence in situ hybridization]. YI CHUAN = HEREDITAS 2010; 32:264-70. [PMID: 20233704 DOI: 10.3724/sp.j.1005.2010.00264] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
In order to analyze the conservation of maize centromeric satellite DNA (CentC) and centromeric retrotransposon (CRM) in the subspecies and relatives of Zea mays, dual fluorescence in situ hybridization (FISH) was used to detect the existence and distribution of the above two repetitive sequences in Zea mays ssp. mexicana, Z. diploperennis, Z. perennis, Tripsacum dactyloides, Coix lacryma-jobi, and Sorghum bicolor. In Z. mays ssp. mexicana, Z. diploperennis, and Z. perennis, both CentC and CRM probes produced strong or relatively strong signals in the centromeric regions of all chromosomes. There was an obvious variation in the intensity of hybridization signals on different chromosomes, indicating that different centromeres have different amounts of CentC and CRM sequences. In some centromeres, the intensity of CentC signals differed from that of CRM signals and was free from overlapping. In T. dactyloides, only weak CentC and CRM signals were detected in the centromeric regions of most chromosomes, while in C. lacryma-jobi and S. bicolor only relatively strong or strong CRM signals primarily located in the centromeric regions were detected. This result indicates that CentC is highly conserved among the subspecies of Z. mays and the species of Zea, and has high conservation in Tripsacum, a genus that is most closely related to Zea, and CRM is conserved among the species of grass family either closely or distantly related to Zea.
Collapse
|
49
|
Abstract
Data from maize show that centromeres strongly suppress crossing over and instead undergo frequent genetic exchange in the form of gene conversion. Centromeres are the most dynamic regions of the genome, yet they are typified by little or no crossing over, making it difficult to explain the origin of this diversity. To address this question, we developed a novel CENH3 ChIP display method that maps kinetochore footprints over transposon-rich areas of centromere cores. A high level of polymorphism made it possible to map a total of 238 within-centromere markers using maize recombinant inbred lines. Over half of the markers were shown to interact directly with kinetochores (CENH3) by chromatin immunoprecipitation. Although classical crossing over is fully suppressed across CENH3 domains, two gene conversion events (i.e., non-crossover marker exchanges) were identified in a mapping population. A population genetic analysis of 53 diverse inbreds suggests that historical gene conversion is widespread in maize centromeres, occurring at a rate >1×10−5/marker/generation. We conclude that gene conversion accelerates centromere evolution by facilitating sequence exchange among chromosomes. Centromeres, which harbor the attachment points for microtubules during cell division, are characterized by repetitive DNA, paucity of genes, and almost complete suppression of crossing over. The repetitive DNA within centromeres appears to evolve much faster than would be expected for genetically inert regions, however. Current explanations for this rapid evolution tend to be theoretical. On the one hand there are arguments that subtle forms of selection on selfish repeat sequences can explain the rapid rate of change, while on the other hand it seems plausible that some form of accelerated neutral evolution is occurring. Here, we address this question in maize, which is known for its excellent genetic mapping resources. We first developed a method for identifying hundreds of single copy markers in centromeres and confirmed that they lie within functional domains by using a chromatin immunoprecipitation assay for kinetochore protein CENH3. All markers were mapped in relation to each other. The data show that, whereas classical crossing over is suppressed, there is extensive genetic exchange in the form of gene conversion (by which short segments of one chromosome are copied onto the other). These results were confirmed by demonstrating that similar short exchange tracts are common among the centromeres from multiple diverse inbred lines of maize. Our study suggests that centromere diversity can be at least partially attributed to a high rate of previously “hidden” genetic exchange within the core kinetochore domains.
Collapse
|
50
|
Du J, Grant D, Tian Z, Nelson RT, Zhu L, Shoemaker RC, Ma J. SoyTEdb: a comprehensive database of transposable elements in the soybean genome. BMC Genomics 2010; 11:113. [PMID: 20163715 PMCID: PMC2830986 DOI: 10.1186/1471-2164-11-113] [Citation(s) in RCA: 102] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2009] [Accepted: 02/17/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Transposable elements are the most abundant components of all characterized genomes of higher eukaryotes. It has been documented that these elements not only contribute to the shaping and reshaping of their host genomes, but also play significant roles in regulating gene expression, altering gene function, and creating new genes. Thus, complete identification of transposable elements in sequenced genomes and construction of comprehensive transposable element databases are essential for accurate annotation of genes and other genomic components, for investigation of potential functional interaction between transposable elements and genes, and for study of genome evolution. The recent availability of the soybean genome sequence has provided an unprecedented opportunity for discovery, and structural and functional characterization of transposable elements in this economically important legume crop. DESCRIPTION Using a combination of structure-based and homology-based approaches, a total of 32,552 retrotransposons (Class I) and 6,029 DNA transposons (Class II) with clear boundaries and insertion sites were structurally annotated and clearly categorized, and a soybean transposable element database, SoyTEdb, was established. These transposable elements have been anchored in and integrated with the soybean physical map and genetic map, and are browsable and visualizable at any scale along the 20 soybean chromosomes, along with predicted genes and other sequence annotations. BLAST search and other infrastracture tools were implemented to facilitate annotation of transposable elements or fragments from soybean and other related legume species. The majority (> 95%) of these elements (particularly a few hundred low-copy-number families) are first described in this study. CONCLUSION SoyTEdb provides resources and information related to transposable elements in the soybean genome, representing the most comprehensive and the largest manually curated transposable element database for any individual plant genome completely sequenced to date. Transposable elements previously identified in legumes, the third largest family of flowering plants, are relatively scarce. Thus this database will facilitate structural, evolutionary, functional, and epigenetic analyses of transposable elements in soybean and other legume species.
Collapse
Affiliation(s)
- Jianchang Du
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | - David Grant
- US Department of Agriculture-Agricultural Research Service, Corn Insect and Crop Genetics Research Unit, Ames, Iowa 50011, USA
| | - Zhixi Tian
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | - Rex T Nelson
- US Department of Agriculture-Agricultural Research Service, Corn Insect and Crop Genetics Research Unit, Ames, Iowa 50011, USA
| | - Liucun Zhu
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | - Randy C Shoemaker
- US Department of Agriculture-Agricultural Research Service, Corn Insect and Crop Genetics Research Unit, Ames, Iowa 50011, USA
| | - Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|