1
|
Belyayev A, Josefiová J, Jandová M, Kalendar R, Mahelka V, Mandák B, Krak K. The structural diversity of CACTA transposons in genomes of Chenopodium (Amaranthaceae, Caryophyllales) species: specific traits and comparison with the similar elements of angiosperms. Mob DNA 2022; 13:8. [PMID: 35379321 PMCID: PMC8978399 DOI: 10.1186/s13100-022-00265-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 03/11/2022] [Indexed: 12/05/2022] Open
Abstract
Background CACTA transposable elements (TEs) comprise one of the most abundant superfamilies of Class 2 (cut-and-paste) transposons. Over recent decades, CACTA elements were widely identified in species from the plant, fungi, and animal kingdoms, but sufficiently studied in the genomes of only a few model species although non-model genomes can bring additional and valuable information. It primarily concerned the genomes of species belonging to clades in the base of large taxonomic groups whose genomes, to a certain extent, can preserve relict and/or possesses specific traits. Thus, we sought to investigate the genomes of Chenopodium (Amaranthaceae, Caryophyllales) species to unravel the structural variability of CACTA elements. Caryophyllales is a separate branch of Angiosperms and until recently the diversity of CACTA elements in this clade was unknown. Results Application of the short-read genome assembly algorithm followed by analysis of detected complete CACTA elements allowed for the determination of their structural diversity in the genomes of 22 Chenopodium album aggregate species. This approach yielded knowledge regarding: (i) the coexistence of two CACTA transposons subtypes in single genome; (ii) gaining of additional protein conserved domains within the coding sequence; (iii) the presence of captured gene fragments, including key genes for flower development; and (iv)) identification of captured satDNA arrays. Wide comparative database analysis revealed that identified events are scattered through Angiosperms in different proportions. Conclusions Our study demonstrated that while preserving the basic element structure a wide range of coding and non-coding additions to CACTA transposons occur in the genomes of C. album aggregate species. Ability to relocate additions inside genome in combination with the proposed novel functional features of structural-different CACTA elements can impact evolutionary trajectory of the host genome. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-022-00265-3.
Collapse
|
2
|
Goubert C, Craig RJ, Bilat AF, Peona V, Vogan AA, Protasio AV. A beginner's guide to manual curation of transposable elements. Mob DNA 2022; 13:7. [PMID: 35354491 PMCID: PMC8969392 DOI: 10.1186/s13100-021-00259-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 12/17/2021] [Indexed: 12/11/2022] Open
Abstract
Background In the study of transposable elements (TEs), the generation of a high confidence set of consensus sequences that represent the diversity of TEs found in a given genome is a key step in the path to investigate these fascinating genomic elements. Many algorithms and pipelines are available to automatically identify putative TE families present in a genome. Despite the availability of these valuable resources, producing a library of high-quality full-length TE consensus sequences largely remains a process of manual curation. This know-how is often passed on from mentor-to-mentee within research groups, making it difficult for those outside the field to access this highly specialised skill. Results Our manuscript attempts to fill this gap by providing a set of detailed computer protocols, software recommendations and video tutorials for those aiming to manually curate TEs. Detailed step-by-step protocols, aimed at the complete beginner, are presented in the Supplementary Methods. Conclusions The proposed set of programs and tools presented here will make the process of manual curation achievable and amenable to all researchers and in special to those new to the field of TEs. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-021-00259-7.
Collapse
Affiliation(s)
- Clement Goubert
- Canadian Center for Computational Genomics, McGill University, Montreal, Québec, Canada.,Department of Human Genetics, McGill University, Montreal, Québec, Canada
| | - Rory J Craig
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - Agustin F Bilat
- Departamento de Genética, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay
| | - Valentina Peona
- Department of Organismal Biology, Uppsala University, Norbyvägen 18D, 752 36, Uppsala, Sweden
| | - Aaron A Vogan
- Department of Organismal Biology, Uppsala University, Norbyvägen 18D, 752 36, Uppsala, Sweden
| | - Anna V Protasio
- Department of Pathology, Tennis Court Road, Cambridge, CB1 2PQ, UK. .,Christ's College, St Andrews Street, Cambridge, CB2 3BU, UK.
| |
Collapse
|
3
|
Shenton M, Kobayashi M, Terashima S, Ohyanagi H, Copetti D, Hernández-Hernández T, Zhang J, Ohmido N, Fujita M, Toyoda A, Ikawa H, Fujiyama A, Furuumi H, Miyabayashi T, Kubo T, Kudrna D, Wing R, Yano K, Nonomura KI, Sato Y, Kurata N. Evolution and Diversity of the Wild Rice Oryza officinalis Complex, across Continents, Genome Types, and Ploidy Levels. Genome Biol Evol 2021; 12:413-428. [PMID: 32125373 PMCID: PMC7531200 DOI: 10.1093/gbe/evaa037] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/25/2020] [Indexed: 12/15/2022] Open
Abstract
The Oryza officinalis complex is the largest species group in
Oryza, with more than nine species from four continents, and is a
tertiary gene pool that can be exploited in breeding programs for the improvement of
cultivated rice. Most diploid and tetraploid members of this group have a C genome. Using
a new reference C genome for the diploid species O. officinalis, and
draft genomes for two other C genome diploid species Oryza eichingeri and
Oryza rhizomatis, we examine the influence of transposable elements on
genome structure and provide a detailed phylogeny and evolutionary history of the
Oryza C genomes. The O. officinalis genome is 1.6
times larger than the A genome of cultivated Oryza sativa, mostly due to
proliferation of Gypsy type long-terminal repeat transposable elements,
but overall syntenic relationships are maintained with other Oryza
genomes (A, B, and F). Draft genome assemblies of the two other C genome diploid species,
Oryza eichingeri and Oryza rhizomatis, and short-read
resequencing of a series of other C genome species and accessions reveal that after the
divergence of the C genome progenitor, there was still a substantial degree of variation
within the C genome species through proliferation and loss of both DNA and long-terminal
repeat transposable elements. We provide a detailed phylogeny and evolutionary history of
the Oryza C genomes and a genomic resource for the exploitation of the
Oryza tertiary gene pool.
Collapse
Affiliation(s)
| | | | | | - Hajime Ohyanagi
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Dario Copetti
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona.,T.T. Chang Genetic Resources Center, International Rice Research Institute, Los Baños, Philippines
| | | | - Jianwei Zhang
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona
| | - Nobuko Ohmido
- Division of the Living Environment, Kobe University, Japan
| | | | | | | | | | | | | | - Takahiko Kubo
- National Institute of Genetics, Mishima, Japan.,Faculty of Agriculture, Kyushu University, Fukuoka, Japan
| | - David Kudrna
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona
| | - Rod Wing
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona.,T.T. Chang Genetic Resources Center, International Rice Research Institute, Los Baños, Philippines.,Biological and Environment Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Kentaro Yano
- School of Agriculture, Meiji University, Tokyo, Japan
| | | | - Yutaka Sato
- National Institute of Genetics, Mishima, Japan
| | - Nori Kurata
- National Institute of Genetics, Mishima, Japan
| |
Collapse
|
4
|
Liu B, Iwata-Otsubo A, Yang D, Baker RL, Liang C, Jackson SA, Liu S, Ma J, Zhao M. Analysis of CACTA transposase genes unveils the mechanism of intron loss and distinct small RNA silencing pathways underlying divergent evolution of Brassica genomes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 105:34-48. [PMID: 33098166 DOI: 10.1111/tpj.15037] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 10/19/2020] [Accepted: 10/13/2020] [Indexed: 06/11/2023]
Abstract
In comparison with retrotransposons, DNA transposons make up a smaller proportion of most plant genomes. However, these elements are often proximal to genes to affect gene expression depending on the activity of the transposons, which is largely reflected by the activity of the transposase genes. Here, we show that three AT-rich introns were retained in the TNP2-like transposase genes of the Bot1 (Brassica oleracea transposon 1) CACTA transposable elements in Brassica oleracea, but were lost in the majority of the Bot1 elements in Brassica rapa. A recent burst of transposition of Bot1 was observed in B. oleracea, but not in B. rapa. This burst of transposition is likely related to the activity of the TNP2-like transposase genes as the expression values of the transposase genes were higher in B. oleracea than in B. rapa. In addition, distinct populations of small RNAs (21, 22 and 24 nt) were detected from the Bot1 elements in B. oleracea, but the vast majority of the small RNAs from the Bot1 elements in B. rapa are 24 nt in length. We hypothesize that the different activity of the TNP2-like transposase genes is likely associated with the three introns, and intron loss is likely reverse transcriptase mediated. Furthermore, we propose that the Bot1 family is currently undergoing silencing in B. oleracea, but has already been silenced in B. rapa. Taken together, our data provide new insights into the differentiation of transposons and their role in the asymmetric evolution of these two closely related Brassica species.
Collapse
Affiliation(s)
- Beibei Liu
- Department of Biology, Miami University, Oxford, OH, 45056, USA
| | - Aiko Iwata-Otsubo
- Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA, 30602,, USA
| | - Diya Yang
- Department of Biology, Miami University, Oxford, OH, 45056, USA
| | - Robert L Baker
- Department of Biology, Miami University, Oxford, OH, 45056, USA
| | - Chun Liang
- Department of Biology, Miami University, Oxford, OH, 45056, USA
| | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, 111 Riverbend Road, Athens, GA, 30602,, USA
| | - Shengyi Liu
- Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, 430062, China
| | - Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN, 47907, USA
| | - Meixia Zhao
- Department of Biology, Miami University, Oxford, OH, 45056, USA
| |
Collapse
|
5
|
Wicker T, Schulman AH, Tanskanen J, Spannagl M, Twardziok S, Mascher M, Springer NM, Li Q, Waugh R, Li C, Zhang G, Stein N, Mayer KFX, Gundlach H. The repetitive landscape of the 5100 Mbp barley genome. Mob DNA 2017; 8:22. [PMID: 29270235 PMCID: PMC5738225 DOI: 10.1186/s13100-017-0102-3] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 11/22/2017] [Indexed: 01/07/2023] Open
Abstract
Background While transposable elements (TEs) comprise the bulk of plant genomic DNA, how they contribute to genome structure and organization is still poorly understood. Especially in large genomes where TEs make the majority of genomic DNA, it is still unclear whether TEs target specific chromosomal regions or whether they simply accumulate where they are best tolerated. Results Here, we present an analysis of the repetitive fraction of the 5100 Mb barley genome, the largest angiosperm genome to have a near-complete sequence assembly. Genes make only about 2% of the genome, while over 80% is derived from TEs. The TE fraction is composed of at least 350 different families. However, 50% of the genome is comprised of only 15 high-copy TE families, while all other TE families are present in moderate or low copy numbers. We found that the barley genome is highly compartmentalized with different types of TEs occupying different chromosomal “niches”, such as distal, interstitial, or proximal regions of chromosome arms. Furthermore, gene space represents its own distinct genomic compartment that is enriched in small non-autonomous DNA transposons, suggesting that these TEs specifically target promoters and downstream regions. Furthermore, their presence in gene promoters is associated with decreased methylation levels. Conclusions Our data show that TEs are major determinants of overall chromosome structure. We hypothesize that many of the the various chromosomal distribution patterns are the result of TE families targeting specific niches, rather than them accumulating where they have the least deleterious effects. Electronic supplementary material The online version of this article (10.1186/s13100-017-0102-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Thomas Wicker
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, CH-8008 Zurich, Switzerland
| | - Alan H Schulman
- Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Green Technology, Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Jaakko Tanskanen
- Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Green Technology, Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Manuel Spannagl
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Sven Twardziok
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, 1479 Gortner Avenue, Saint Paul, MN 55108 USA
| | - Qing Li
- Department of Plant and Microbial Biology, University of Minnesota, 1479 Gortner Avenue, Saint Paul, MN 55108 USA.,Present address: National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070 China
| | - Robbie Waugh
- The James Hutton Institute, Dundee, UK.,School of Life Sciences, University of Dundee, Dundee, UK
| | - Chengdao Li
- Western Barley Genetics Alliance/the State Agricultural Biotechnology Centre, School of Veterinary and Life Sciences, Murdoch University, Murdoch, WA6150 Australia.,Department of Primary Industry and Regional Development, Government of Western Australia, South Perth, WA6155 Australia
| | - Guoping Zhang
- College of Agriculture and Biotechnology, Wuhan, ZU China
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, Germany
| | - Klaus F X Mayer
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany.,TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Heidrun Gundlach
- PGSB - Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Neuherberg, Germany
| |
Collapse
|
6
|
Jiang SH, Li GY, Xiong XM. Novel miniature inverted-repeat transposable elements derived from novel CACTA transposons were discovered in the genome of the ant Camponotus floridanus. Genes Genomics 2016. [DOI: 10.1007/s13258-016-0464-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|