1
|
Garcia S, Kovarik A, Maiwald S, Mann L, Schmidt N, Pascual-Díaz JP, Vitales D, Weber B, Heitkam T. The Dynamic Interplay Between Ribosomal DNA and Transposable Elements: A Perspective From Genomics and Cytogenetics. Mol Biol Evol 2024; 41:msae025. [PMID: 38306580 PMCID: PMC10946416 DOI: 10.1093/molbev/msae025] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 12/06/2023] [Accepted: 01/29/2024] [Indexed: 02/04/2024] Open
Abstract
Although both are salient features of genomes, at first glance ribosomal DNAs and transposable elements are genetic elements with not much in common: whereas ribosomal DNAs are mainly viewed as housekeeping genes that uphold all prime genome functions, transposable elements are generally portrayed as selfish and disruptive. These opposing characteristics are also mirrored in other attributes: organization in tandem (ribosomal DNAs) versus organization in a dispersed manner (transposable elements); evolution in a concerted manner (ribosomal DNAs) versus evolution by diversification (transposable elements); and activity that prolongs genomic stability (ribosomal DNAs) versus activity that shortens it (transposable elements). Re-visiting relevant instances in which ribosomal DNA-transposable element interactions have been reported, we note that both repeat types share at least four structural and functional hallmarks: (1) they are repetitive DNAs that shape genomes in evolutionary timescales, (2) they exchange structural motifs and can enter co-evolution processes, (3) they are tightly controlled genomic stress sensors playing key roles in senescence/aging, and (4) they share common epigenetic marks such as DNA methylation and histone modification. Here, we give an overview of the structural, functional, and evolutionary characteristics of both ribosomal DNAs and transposable elements, discuss their roles and interactions, and highlight trends and future directions as we move forward in understanding ribosomal DNA-transposable element associations.
Collapse
Affiliation(s)
- Sònia Garcia
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Catalonia, Spain
| | - Ales Kovarik
- Institute of Biophysics, Academy of Sciences of the Czech Republic, 61265 Brno, Czech Republic
| | - Sophie Maiwald
- Faculty of Biology, Technische Universität Dresden, D-01069 Dresden, Germany
| | - Ludwig Mann
- Faculty of Biology, Technische Universität Dresden, D-01069 Dresden, Germany
| | - Nicola Schmidt
- Faculty of Biology, Technische Universität Dresden, D-01069 Dresden, Germany
| | | | - Daniel Vitales
- Institut Botànic de Barcelona (IBB), CSIC-CMCNB, 08038 Barcelona, Catalonia, Spain
- Laboratori de Botànica–Unitat Associada CSIC, Facultat de Farmàcia i Ciències de l’Alimentació, Universitat de Barcelona, 08028 Barcelona, Catalonia, Spain
| | - Beatrice Weber
- Faculty of Biology, Technische Universität Dresden, D-01069 Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, D-01069 Dresden, Germany
- Institute of Biology, NAWI Graz, Karl-Franzens-Universität, A-8010 Graz, Austria
| |
Collapse
|
2
|
Kojima KK. Helenus and Ajax, Two Groups of Non-Autonomous LTR Retrotransposons, Represent a New Type of Small RNA Gene-Derived Mobile Elements. BIOLOGY 2024; 13:119. [PMID: 38392337 PMCID: PMC10886601 DOI: 10.3390/biology13020119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/06/2024] [Accepted: 02/10/2024] [Indexed: 02/24/2024]
Abstract
Terminal repeat retrotransposons in miniature (TRIMs) are short non-autonomous long terminal repeat (LTR) retrotransposons found from various eukaryotes. Cassandra is a unique TRIM lineage which contains a 5S rRNA-derived sequence in its LTRs. Here, two new groups of TRIMs, designated Helenus and Ajax, are reported based on bioinformatics analysis and the usage of Repbase. Helenus is found from fungi, animals, and plants, and its LTRs contain a tRNA-like sequence. It includes two LTRs and between them, a primer-binding site (PBS) and polypurine tract (PPT) exist. Fungal and plant Helenus generate 5 bp target site duplications (TSDs) upon integration, while animal Helenus generates 4 bp TSDs. Ajax includes a 5S rRNA-derived sequence in its LTR and is found from two nemertean genomes. Ajax generates 5 bp TSDs upon integration. These results suggest that despite their unique promoters, Helenus and Ajax are TRIMs whose transposition is dependent on autonomous LTR retrotransposon. These TRIMs can originate through an insertion of SINE in an LTR of TRIM. The discovery of Helenus and Ajax suggests the presence of TRIMs with a promoter for RNA polymerase III derived from a small RNA gene, which is here collectively termed TRIMp3.
Collapse
Affiliation(s)
- Kenji K Kojima
- Genetic Information Research Institute, Cupertino, CA 95014, USA
| |
Collapse
|
3
|
Maiwald S, Mann L, Garcia S, Heitkam T. Evolving Together: Cassandra Retrotransposons Gradually Mirror Promoter Mutations of the 5S rRNA Genes. Mol Biol Evol 2024; 41:msae010. [PMID: 38262464 PMCID: PMC10853983 DOI: 10.1093/molbev/msae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 10/26/2023] [Accepted: 12/11/2023] [Indexed: 01/25/2024] Open
Abstract
The 5S rRNA genes are among the most conserved nucleotide sequences across all species. Similar to the 5S preservation we observe the occurrence of 5S-related nonautonomous retrotransposons, so-called Cassandras. Cassandras harbor highly conserved 5S rDNA-related sequences within their long terminal repeats, advantageously providing them with the 5S internal promoter. However, the dynamics of Cassandra retrotransposon evolution in the context of 5S rRNA gene sequence information and structural arrangement are still unclear, especially: (1) do we observe repeated or gradual domestication of the highly conserved 5S promoter by Cassandras and (2) do changes in 5S organization such as in the linked 35S-5S rDNA arrangements impact Cassandra evolution? Here, we show evidence for gradual co-evolution of Cassandra sequences with their corresponding 5S rDNAs. To follow the impact of 5S rDNA variability on Cassandra TEs, we investigate the Asteraceae family where highly variable 5S rDNAs, including 5S promoter shifts and both linked and separated 35S-5S rDNA arrangements have been reported. Cassandras within the Asteraceae mirror 5S rDNA promoter mutations of their host genome, likely as an adaptation to the host's specific 5S transcription factors and hence compensating for evolutionary changes in the 5S rDNA sequence. Changes in the 5S rDNA sequence and in Cassandras seem uncorrelated with linked/separated rDNA arrangements. We place all these observations into the context of angiosperm 5S rDNA-Cassandra evolution, discuss Cassandra's origin hypotheses (single or multiple) and Cassandra's possible impact on rDNA and plant genome organization, giving new insights into the interplay of ribosomal genes and transposable elements.
Collapse
Affiliation(s)
- Sophie Maiwald
- Faculty of Biology, Technische Universität Dresden, 01069 Dresden, Germany
| | - Ludwig Mann
- Faculty of Biology, Technische Universität Dresden, 01069 Dresden, Germany
| | - Sònia Garcia
- Institut Botànic de Barcelona, IBB (CSIC-MCNB), 08038 Barcelona, Catalonia, Spain
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, 01069 Dresden, Germany
- Institute of Biology, NAWI Graz, Karl-Franzens-Universität, 8010 Graz, Austria
| |
Collapse
|
4
|
Gao D. Introduction of Plant Transposon Annotation for Beginners. BIOLOGY 2023; 12:1468. [PMID: 38132293 PMCID: PMC10741241 DOI: 10.3390/biology12121468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023]
Abstract
Transposons are mobile DNA sequences that contribute large fractions of many plant genomes. They provide exclusive resources for tracking gene and genome evolution and for developing molecular tools for basic and applied research. Despite extensive efforts, it is still challenging to accurately annotate transposons, especially for beginners, as transposon prediction requires necessary expertise in both transposon biology and bioinformatics. Moreover, the complexity of plant genomes and the dynamic evolution of transposons also bring difficulties for genome-wide transposon discovery. This review summarizes the three major strategies for transposon detection including repeat-based, structure-based, and homology-based annotation, and introduces the transposon superfamilies identified in plants thus far, and some related bioinformatics resources for detecting plant transposons. Furthermore, it describes transposon classification and explains why the terms 'autonomous' and 'non-autonomous' cannot be used to classify the superfamilies of transposons. Lastly, this review also discusses how to identify misannotated transposons and improve the quality of the transposon database. This review provides helpful information about plant transposons and a beginner's guide on annotating these repetitive sequences.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID 83210, USA
| |
Collapse
|
5
|
Gao D, Fox-Fogle E. Identification of transcriptionally active transposons in Barley. BMC Genom Data 2023; 24:64. [PMID: 37925398 PMCID: PMC10625261 DOI: 10.1186/s12863-023-01170-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 10/30/2023] [Indexed: 11/06/2023] Open
Abstract
BACKGROUND The genomes of many major crops including barley (Hordeum vulgare) consist of numerous transposons. Despite their important roles in crop genome evolution and morphological variations, most of these elements are silent or truncated and unable to be mobile in host genomes. Thus far, only a very limited number of active transposons were identified in plants. RESULTS We analyzed the barley full-length cDNA (FLcDNA) sequences and detected 71 unique FLcDNAs exhibiting significant sequence similarity to the extant transposase proteins. These FLcDNAs were then used to search against the genome of a malting barley cultivar 'Morex', seven new intact transposons were identified. Sequence alignments indicated that six intact transposons contained the entire FLcDNAs whereas another one served as 3' untranslated region (3' UTR) of a barley gene. Our reverse transcription-PCR (RT-PCR) experiment further confirmed the expression of these six transposons and revealed their differential expression. We conducted genome-wide transposon comparisons and detected polymorphisms of three transposon families between the genomes of 'Morex' and other three genotypes including the wild barley (Hordeum spontaneum, B1K-04-12) and two cultivated barley varieties, 'Golden Promise' and 'Lasa Goumang'. Lastly, we screened the transcripts of all annotated barley genes and found that some transposons may serve as the coding regions (CDSs) or UTRs of barley genes. CONCLUSION We identified six newly expressed transposons in the barley genome and revealed the recent mobility of three transposon families. Our efforts provide a valuable resource for understanding the effects of transposons on barley genome evolution and for developing novel molecular tools for barley genetic improvement and other research.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, 83210, USA.
| | - Emma Fox-Fogle
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, 83210, USA
- National Agricultural Statistical Service, USDA, Olympia, WA, 98501, USA
| |
Collapse
|
6
|
Yang Y, Wen X, Wu Z, Wang K, Zhu Y. Large-scale long terminal repeat insertions produced a significant set of novel transcripts in cotton. SCIENCE CHINA. LIFE SCIENCES 2023; 66:1711-1724. [PMID: 37079218 DOI: 10.1007/s11427-022-2341-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 04/03/2023] [Indexed: 04/21/2023]
Abstract
Genomic analysis has revealed that the 1,637-Mb Gossypium arboreum genome contains approximately 81% transposable elements (TEs), while only 57% of the 735-Mb G. raimondii genome is occupied by TEs. In this study, we investigated whether there were unknown transcripts associated with TE or TE fragments and, if so, how these new transcripts were evolved and regulated. As sequence depths increased from 4 to 100 G, a total of 10,284 novel intergenic transcripts (intergenic genes) were discovered. On average, approximately 84% of these intergenic transcripts possibly overlapped with the long terminal repeat (LTR) insertions in the otherwise untranscribed intergenic regions and were expressed at relatively low levels. Most of these intergenic transcripts possessed no transcription activation markers, while the majority of the regular genic genes possessed at least one such marker. Genes without transcription activation markers formed their+1 and -1 nucleosomes more closely (only (117±1.4)bp apart), while twice as big spaces (approximately (403.5±46.0) bp apart) were detected for genes with the activation markers. The analysis of 183 previously assembled genomes across three different kingdoms demonstrated systematically that intergenic transcript numbers in a given genome correlated positively with its LTR content. Evolutionary analysis revealed that genic genes originated during one of the whole-genome duplication events around 137.7 million years ago (MYA) for all eudicot genomes or 13.7 MYA for the Gossypium family, respectively, while the intergenic transcripts evolved around 1.6 MYA, resultant of the last LTR insertion. The characterization of these low-transcribed intergenic transcripts can facilitate our understanding of the potential biological roles played by LTRs during speciation and diversifications.
Collapse
Affiliation(s)
- Yan Yang
- Institute for Advanced Studies, Wuhan University, Wuhan, 430072, China
| | - Xingpeng Wen
- Institute for Advanced Studies, Wuhan University, Wuhan, 430072, China
- College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Zhiguo Wu
- College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Kun Wang
- College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Yuxian Zhu
- Institute for Advanced Studies, Wuhan University, Wuhan, 430072, China.
- College of Life Sciences, Wuhan University, Wuhan, 430072, China.
- Hubei Hongshan Laboratory, Wuhan, 430072, China.
- TaiKang Center for Life and Medical Sciences, RNA Institute, Remin Hospital, Wuhan University, Wuhan, 430072, China.
| |
Collapse
|
7
|
Oggenfuss U, Croll D. Recent transposable element bursts are associated with the proximity to genes in a fungal plant pathogen. PLoS Pathog 2023; 19:e1011130. [PMID: 36787337 PMCID: PMC9970103 DOI: 10.1371/journal.ppat.1011130] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 02/27/2023] [Accepted: 01/18/2023] [Indexed: 02/15/2023] Open
Abstract
The activity of transposable elements (TEs) contributes significantly to pathogen genome evolution. TEs often destabilize genome integrity but may also confer adaptive variation in pathogenicity or resistance traits. De-repression of epigenetically silenced TEs often initiates bursts of transposition activity that may be counteracted by purifying selection and genome defenses. However, how these forces interact to determine the expansion routes of TEs within a pathogen species remains largely unknown. Here, we analyzed a set of 19 telomere-to-telomere genomes of the fungal wheat pathogen Zymoseptoria tritici. Phylogenetic reconstruction and ancestral state estimates of individual TE families revealed that TEs have undergone distinct activation and repression periods resulting in highly uneven copy numbers between genomes of the same species. Most TEs are clustered in gene poor niches, indicating strong purifying selection against insertions near coding sequences, or as a consequence of insertion site preferences. TE families with high copy numbers have low sequence divergence and strong signatures of defense mechanisms (i.e., RIP). In contrast, small non-autonomous TEs (i.e., MITEs) are less impacted by defense mechanisms and are often located in close proximity to genes. Individual TE families have experienced multiple distinct burst events that generated many nearly identical copies. We found that a Copia element burst was initiated from recent copies inserted substantially closer to genes compared to older copies. Overall, TE bursts tended to initiate from copies in GC-rich niches that escaped inactivation by genomic defenses. Our work shows how specific genomic environments features provide triggers for TE proliferation in pathogen genomes.
Collapse
Affiliation(s)
- Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
- * E-mail:
| |
Collapse
|
8
|
Burgess D, Chow HT, Grover JW, Freeling M, Mosher RA. Ovule siRNAs methylate protein-coding genes in trans. THE PLANT CELL 2022; 34:3647-3664. [PMID: 35781738 PMCID: PMC9516104 DOI: 10.1093/plcell/koac197] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 06/24/2022] [Indexed: 05/31/2023]
Abstract
Twenty-four-nucleotide (nt) small interfering RNAs (siRNAs) maintain asymmetric DNA methylation at thousands of euchromatic transposable elements in plant genomes in a process called RNA-directed DNA methylation (RdDM). RdDM is dispensable for growth and development in Arabidopsis thaliana, but is required for reproduction in other plants, such as Brassica rapa. The 24-nt siRNAs are abundant in maternal reproductive tissue, due largely to overwhelming expression from a few loci in the ovule and developing seed coat, termed siren loci. A recent study showed that 24-nt siRNAs produced in the anther tapetal tissue can methylate male meiocyte genes in trans. Here we show that in B. rapa, a similar process takes place in female tissue. siRNAs are produced from gene fragments embedded in some siren loci, and these siRNAs can trigger methylation in trans at related protein-coding genes. This trans-methylation is associated with silencing of some target genes and may be responsible for seed abortion in RdDM mutants. Furthermore, we demonstrate that a consensus sequence in at least two families of DNA transposons is associated with abundant siren expression, most likely through recruitment of CLASSY3, a putative chromatin remodeler. This research describes a mechanism whereby RdDM influences gene expression and sheds light on the role of RdDM during plant reproduction.
Collapse
|
9
|
Gao D, Nascimento EFMB, Leal-Bertioli SCM, Abernathy B, Jackson SA, Araujo ACG, Bertioli DJ. TAR30, a homolog of the canonical plant TTTAGGG telomeric repeat, is enriched in the proximal chromosome regions of peanut (Arachis hypogaea L.). Chromosome Res 2022; 30:77-90. [DOI: 10.1007/s10577-022-09684-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 01/07/2022] [Accepted: 01/11/2022] [Indexed: 11/03/2022]
|
10
|
Fu J, Wan L, Song L, He L, Jiang N, Long H, Huo J, Ji X, Hu F, Wei S, Pan L. OUP accepted manuscript. Genome Biol Evol 2022; 14:6575330. [PMID: 35482027 PMCID: PMC9113316 DOI: 10.1093/gbe/evac060] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/21/2022] [Indexed: 11/15/2022] Open
Affiliation(s)
| | | | - Lisha Song
- Guangxi Botanical Garden of Medicinal Plants, Nanning 530023, China
| | - Lili He
- Guangxi Botanical Garden of Medicinal Plants, Nanning 530023, China
| | - Ni Jiang
- Guangxi Botanical Garden of Medicinal Plants, Nanning 530023, China
| | - Hairong Long
- Guangxi Botanical Garden of Medicinal Plants, Nanning 530023, China
| | - Juan Huo
- Guangxi Botanical Garden of Medicinal Plants, Nanning 530023, China
| | - Xiaowen Ji
- Guangxi Botanical Garden of Medicinal Plants, Nanning 530023, China
| | - Fengyun Hu
- Guangxi Botanical Garden of Medicinal Plants, Nanning 530023, China
| | | | | |
Collapse
|
11
|
Vargas-Chavez C, Longo Pendy NM, Nsango SE, Aguilera L, Ayala D, González J. Transposable element variants and their potential adaptive impact in urban populations of the malaria vector Anopheles coluzzii. Genome Res 2021; 32:189-202. [PMID: 34965939 PMCID: PMC8744685 DOI: 10.1101/gr.275761.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 11/24/2021] [Indexed: 11/28/2022]
Abstract
Anopheles coluzzii is one of the primary vectors of human malaria in sub-Saharan Africa. Recently, it has spread into the main cities of Central Africa threatening vector control programs. The adaptation of An. coluzzii to urban environments partly results from an increased tolerance to organic pollution and insecticides. Some of the molecular mechanisms for ecological adaptation are known, but the role of transposable elements (TEs) in the adaptive processes of this species has not been studied yet. As a first step toward assessing the role of TEs in rapid urban adaptation, we sequenced using long reads six An. coluzzii genomes from natural breeding sites in two major Central Africa cities. We de novo annotated TEs in these genomes and in an additional high-quality An. coluzzii genome, and we identified 64 new TE families. TEs were nonrandomly distributed throughout the genome with significant differences in the number of insertions of several superfamilies across the studied genomes. We identified seven putatively active families with insertions near genes with functions related to vectorial capacity, and several TEs that may provide promoter and transcription factor binding sites to insecticide resistance and immune-related genes. Overall, the analysis of multiple high-quality genomes allowed us to generate the most comprehensive TE annotation in this species to date and identify several TE insertions that could potentially impact both genome architecture and the regulation of functionally relevant genes. These results provide a basis for future studies of the impact of TEs on the biology of An. coluzzii.
Collapse
Affiliation(s)
- Carlos Vargas-Chavez
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Neil Michel Longo Pendy
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,École Doctorale Régional (EDR) en Infectiologie Tropicale d'Afrique Centrale, BP 876, Franceville, Gabon
| | - Sandrine E Nsango
- Faculté de Médecine et des Sciences Pharmaceutiques, Université de Douala, BP 2701, Douala, Cameroun
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Diego Ayala
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution et Contrôle (MIVEGEC), Université Montpellier, CNRS, IRD, 64501 Montpellier, France
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| |
Collapse
|
12
|
Chaux-Jukic F, O'Donnell S, Craig RJ, Eberhard S, Vallon O, Xu Z. Architecture and evolution of subtelomeres in the unicellular green alga Chlamydomonas reinhardtii. Nucleic Acids Res 2021; 49:7571-7587. [PMID: 34165564 PMCID: PMC8287924 DOI: 10.1093/nar/gkab534] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 06/01/2021] [Accepted: 06/08/2021] [Indexed: 02/06/2023] Open
Abstract
In most eukaryotes, subtelomeres are dynamic genomic regions populated by multi-copy sequences of different origins, which can promote segmental duplications and chromosomal rearrangements. However, their repetitive nature has complicated the efforts to sequence them, analyse their structure and infer how they evolved. Here, we use recent genome assemblies of Chlamydomonas reinhardtii based on long-read sequencing to comprehensively describe the subtelomere architecture of the 17 chromosomes of this model unicellular green alga. We identify three main repeated elements present at subtelomeres, which we call Sultan, Subtile and Suber, alongside three chromosome extremities with ribosomal DNA as the only identified component of their subtelomeres. The most common architecture, present in 27 out of 34 subtelomeres, is a heterochromatic array of Sultan elements adjacent to the telomere, followed by a transcribed Spacer sequence, a G-rich microsatellite and transposable elements. Sequence similarity analyses suggest that Sultan elements underwent segmental duplications within each subtelomere and rearranged between subtelomeres at a much lower frequency. Analysis of other green algae reveals species-specific repeated elements that are shared across subtelomeres, with an overall organization similar to C. reinhardtii. This work uncovers the complexity and evolution of subtelomere architecture in green algae.
Collapse
Affiliation(s)
- Frédéric Chaux-Jukic
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Samuel O'Donnell
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Rory J Craig
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, Edinburgh, UK
| | - Stephan Eberhard
- Sorbonne Université, CNRS, UMR7141, Institut de Biologie Physico-Chimique, Laboratory of Chloroplast Biology and Light-Sensing in Microalgae, 75005 Paris, France
| | - Olivier Vallon
- Sorbonne Université, CNRS, UMR7141, Institut de Biologie Physico-Chimique, Laboratory of Chloroplast Biology and Light-Sensing in Microalgae, 75005 Paris, France
| | - Zhou Xu
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| |
Collapse
|
13
|
Chaux-Jukic F, O'Donnell S, Craig RJ, Eberhard S, Vallon O, Xu Z. Architecture and evolution of subtelomeres in the unicellular green alga Chlamydomonas reinhardtii. Nucleic Acids Res 2021. [PMID: 34165564 DOI: 10.1101/2021.01.29.428817)] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2023] Open
Abstract
In most eukaryotes, subtelomeres are dynamic genomic regions populated by multi-copy sequences of different origins, which can promote segmental duplications and chromosomal rearrangements. However, their repetitive nature has complicated the efforts to sequence them, analyse their structure and infer how they evolved. Here, we use recent genome assemblies of Chlamydomonas reinhardtii based on long-read sequencing to comprehensively describe the subtelomere architecture of the 17 chromosomes of this model unicellular green alga. We identify three main repeated elements present at subtelomeres, which we call Sultan, Subtile and Suber, alongside three chromosome extremities with ribosomal DNA as the only identified component of their subtelomeres. The most common architecture, present in 27 out of 34 subtelomeres, is a heterochromatic array of Sultan elements adjacent to the telomere, followed by a transcribed Spacer sequence, a G-rich microsatellite and transposable elements. Sequence similarity analyses suggest that Sultan elements underwent segmental duplications within each subtelomere and rearranged between subtelomeres at a much lower frequency. Analysis of other green algae reveals species-specific repeated elements that are shared across subtelomeres, with an overall organization similar to C. reinhardtii. This work uncovers the complexity and evolution of subtelomere architecture in green algae.
Collapse
Affiliation(s)
- Frédéric Chaux-Jukic
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Samuel O'Donnell
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| | - Rory J Craig
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, Edinburgh, UK
| | - Stephan Eberhard
- Sorbonne Université, CNRS, UMR7141, Institut de Biologie Physico-Chimique, Laboratory of Chloroplast Biology and Light-Sensing in Microalgae, 75005 Paris, France
| | - Olivier Vallon
- Sorbonne Université, CNRS, UMR7141, Institut de Biologie Physico-Chimique, Laboratory of Chloroplast Biology and Light-Sensing in Microalgae, 75005 Paris, France
| | - Zhou Xu
- Sorbonne Université, CNRS, UMR7238, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, 75005 Paris, France
| |
Collapse
|
14
|
Virus-like insertions with sequence signatures similar to those of endogenous nonretroviral RNA viruses in the human genome. Proc Natl Acad Sci U S A 2021; 118:2010758118. [PMID: 33495343 DOI: 10.1073/pnas.2010758118] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Understanding the genetics and taxonomy of ancient viruses will give us great insights into not only the origin and evolution of viruses but also how viral infections played roles in our evolution. Endogenous viruses are remnants of ancient viral infections and are thought to retain the genetic characteristics of viruses from ancient times. In this study, we used machine learning of endogenous RNA virus sequence signatures to identify viruses in the human genome that have not been detected or are already extinct. Here, we show that the k-mer occurrence of ancient RNA viral sequences remains similar to that of extant RNA viral sequences and can be differentiated from that of other human genome sequences. Furthermore, using this characteristic, we screened RNA viral insertions in the human reference genome and found virus-like insertions with phylogenetic and evolutionary features indicative of an exogenous origin but lacking homology to previously identified sequences. Our analysis indicates that animal genomes still contain unknown virus-derived sequences and provides a glimpse into the diversity of the ancient virosphere.
Collapse
|
15
|
Orozco-Arias S, Jaimes PA, Candamil MS, Jiménez-Varón CF, Tabares-Soto R, Isaza G, Guyot R. InpactorDB: A Classified Lineage-Level Plant LTR Retrotransposon Reference Library for Free-Alignment Methods Based on Machine Learning. Genes (Basel) 2021; 12:genes12020190. [PMID: 33525408 PMCID: PMC7910972 DOI: 10.3390/genes12020190] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/21/2021] [Accepted: 01/22/2021] [Indexed: 12/04/2022] Open
Abstract
Long terminal repeat (LTR) retrotransposons are mobile elements that constitute the major fraction of most plant genomes. The identification and annotation of these elements via bioinformatics approaches represent a major challenge in the era of massive plant genome sequencing. In addition to their involvement in genome size variation, LTR retrotransposons are also associated with the function and structure of different chromosomal regions and can alter the function of coding regions, among others. Several sequence databases of plant LTR retrotransposons are available for public access, such as PGSB and RepetDB, or restricted access such as Repbase. Although these databases are useful to identify LTR-RTs in new genomes by similarity, the elements of these databases are not fully classified to the lineage (also called family) level. Here, we present InpactorDB, a semi-curated dataset composed of 130,439 elements from 195 plant genomes (belonging to 108 plant species) classified to the lineage level. This dataset has been used to train two deep neural networks (i.e., one fully connected and one convolutional) for the rapid classification of these elements. In lineage-level classification approaches, we obtain up to 98% performance, indicated by the F1-score, precision and recall scores.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, 170002 Manizales, Colombia; (P.A.J.); (M.S.C.)
- Department of Systems and Informatics, Universidad de Caldas, 170002 Manizales, Colombia;
- Correspondence: (S.O.-A.); (R.G.)
| | - Paula A. Jaimes
- Department of Computer Science, Universidad Autónoma de Manizales, 170002 Manizales, Colombia; (P.A.J.); (M.S.C.)
| | - Mariana S. Candamil
- Department of Computer Science, Universidad Autónoma de Manizales, 170002 Manizales, Colombia; (P.A.J.); (M.S.C.)
| | | | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, 170002 Manizales, Colombia;
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, 170002 Manizales, Colombia;
| | - Romain Guyot
- Department of Electronics and Automation, Universidad Autónoma de Manizales, 170002 Manizales, Colombia;
- Institut de Recherche pour le Développement, CIRAD, University of Montpellier, 34394 Montpellier, France
- Correspondence: (S.O.-A.); (R.G.)
| |
Collapse
|
16
|
Maiwald S, Weber B, Seibt KM, Schmidt T, Heitkam T. The Cassandra retrotransposon landscape in sugar beet (Beta vulgaris) and related Amaranthaceae: recombination and re-shuffling lead to a high structural variability. ANNALS OF BOTANY 2021; 127:91-109. [PMID: 33009553 PMCID: PMC7750724 DOI: 10.1093/aob/mcaa176] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 09/28/2020] [Indexed: 05/26/2023]
Abstract
BACKGROUND AND AIMS Plant genomes contain many retrotransposons and their derivatives, which are subject to rapid sequence turnover. As non-autonomous retrotransposons do not encode any proteins, they experience reduced selective constraints leading to their diversification into multiple families, usually limited to a few closely related species. In contrast, the non-coding Cassandra terminal repeat retrotransposons in miniature (TRIMs) are widespread in many plants. Their hallmark is a conserved 5S rDNA-derived promoter in their long terminal repeats (LTRs). As sugar beet (Beta vulgaris) has a well-described LTR retrotransposon landscape, we aim to characterize TRIMs in beet and related genomes. METHODS We identified Cassandra retrotransposons in the sugar beet reference genome and characterized their structural relationships. Genomic organization, chromosomal localization, and distribution of Cassandra-TRIMs across the Amaranthaceae were verified by Southern and fluorescent in situ hybridization. KEY RESULTS All 638 Cassandra sequences in the sugar beet genome contain conserved LTRs and thus constitute a single family. Nevertheless, variable internal regions required a subdivision into two Cassandra subfamilies within B. vulgaris. The related Chenopodium quinoa harbours a third subfamily. These subfamilies vary in their distribution within Amaranthaceae genomes, their insertion times and the degree of silencing by small RNAs. Cassandra retrotransposons gave rise to many structural variants, such as solo LTRs or tandemly arranged Cassandra retrotransposons. These Cassandra derivatives point to an interplay of template switch and recombination processes - mechanisms that likely caused Cassandra's subfamily formation and diversification. CONCLUSIONS We traced the evolution of Cassandra in the Amaranthaceae and detected a considerable variability within the short internal regions, whereas the LTRs are strongly conserved in sequence and length. Presumably these hallmarks make Cassandra a prime target for unequal recombination, resulting in the observed structural diversity, an example of the impact of LTR-mediated evolutionary mechanisms on the host genome.
Collapse
Affiliation(s)
- Sophie Maiwald
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Beatrice Weber
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Tony Heitkam
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
17
|
Zavallo D, Crescente JM, Gantuz M, Leone M, Vanzetti LS, Masuelli RW, Asurmendi S. Genomic re-assessment of the transposable element landscape of the potato genome. PLANT CELL REPORTS 2020; 39:1161-1174. [PMID: 32435866 DOI: 10.1007/s00299-020-02554-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 05/07/2020] [Indexed: 05/14/2023]
Abstract
We provide a comprehensive and reliable potato TE landscape, based on a wide variety of identification tools and integrative approaches, producing clear and ready-to-use outputs for the scientific community. Transposable elements (TEs) are DNA sequences with the ability to autoreplicate and move throughout the host genome. TEs are major drivers in stress response and genome evolution. Given their significance, the development of clear and efficient TE annotation pipelines has become essential for many species. The latest de novo TE discovery tools, along with available TEs from Repbase and sRNA-seq data, allowed us to perform a reliable potato TEs detection, classification and annotation through an open-source and freely available pipeline ( https://github.com/DiegoZavallo/TE_Discovery ). Using a variety of tools, approaches and rules, we were able to provide a clearly annotated of characterized TEs landscape. Additionally, we described the distribution of the different types of TEs across the genome, where LTRs and MITEs present a clear clustering pattern in pericentromeric and subtelomeric/telomeric regions respectively. Finally, we analyzed the insertion age and distribution of LTR retrotransposon families which display a distinct pattern between the two major superfamilies. While older Gypsy elements concentrated around heterochromatic regions, younger Copia elements located predominantly on euchromatic regions. Overall, we delivered not only a reliable, ready-to-use potato TE annotation files, but also all the necessary steps to perform de novo detection for other species.
Collapse
Affiliation(s)
- Diego Zavallo
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina.
| | - Juan Manuel Crescente
- Grupo Biotecnologia y Recursos Genéticos, EEA INTA Marcos Juárez, Ruta 12 Km 3, 2580, Marcos Juárez, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Magdalena Gantuz
- Instituto de Biología Agrícola de Mendoza (IBAM), Facultad de Ciencias Agrarias (FCA), CONICET-UNCuyo, Almirante Brown 500, M5528AHB, Chacras de Coria, Mendoza, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Melisa Leone
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina
- Agencia Nacional de Promocion Científica y Tecnológica (ANPCyT), Buenos Aires, Argentina
| | - Leonardo Sebastian Vanzetti
- Grupo Biotecnologia y Recursos Genéticos, EEA INTA Marcos Juárez, Ruta 12 Km 3, 2580, Marcos Juárez, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Ricardo Williams Masuelli
- Instituto de Biología Agrícola de Mendoza (IBAM), Facultad de Ciencias Agrarias (FCA), CONICET-UNCuyo, Almirante Brown 500, M5528AHB, Chacras de Coria, Mendoza, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Sebastian Asurmendi
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina.
| |
Collapse
|
18
|
The unusual dRemp retrotransposon is abundant, highly mutagenic, and mobilized only in the second pollen mitosis of some maize lines. Proc Natl Acad Sci U S A 2020; 117:18091-18098. [PMID: 32661148 DOI: 10.1073/pnas.2010234117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The frequent mutations recovered recently from the pollen of select maize lines resulted from the meiotic mobilization of specific low-copy number long-terminal repeat (LTR) retrotransposons, which differ among lines. Mutations that arise at male meiosis produce kernels with concordant mutant phenotypes in both endosperm and embryo because the two sperms that participate in double fertilization are genetically identical. Those are in a majority. However, a small minority of kernels with a mutant endosperm carry a nonconcordant normal embryo, pointing to a postmeiotic or microgametophytic origin. In this study, we have identified the basis for those nonconcordant mutations. We find that all are produced by transposition of a defective LTR retrotransposon that we have termed dRemp (defective retroelement mobile in pollen). This element has several unique properties. Unlike the mutagenic LTR retrotransposons identified previously, dRemp is present in hundreds of copies in all sequenced lines. It seems to transpose only at the second pollen mitosis because all dRemp insertion mutants are nonconcordant yet recoverable in either the endosperm or the embryo. Although it does not move in most lines, dRemp is highly mobile in the Corn Belt inbred M14, identified earlier by breeders as being highly unstable. Lastly, it can be recovered in an array of structures, ranging from solo LTRs to tandem dRemp repeats containing several internal LTRs, suggestive of extensive recombination during retrotransposition. These results shed further light on the spontaneous mutation process and on the possible basis for inbred instability in maize.
Collapse
|
19
|
Kalendar R, Raskina O, Belyayev A, Schulman AH. Long Tandem Arrays of Cassandra Retroelements and Their Role in Genome Dynamics in Plants. Int J Mol Sci 2020; 21:ijms21082931. [PMID: 32331257 PMCID: PMC7215508 DOI: 10.3390/ijms21082931] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Revised: 04/15/2020] [Accepted: 04/17/2020] [Indexed: 02/07/2023] Open
Abstract
Retrotransposable elements are widely distributed and diverse in eukaryotes. Their copy number increases through reverse-transcription-mediated propagation, while they can be lost through recombinational processes, generating genomic rearrangements. We previously identified extensive structurally uniform retrotransposon groups in which no member contains the gag, pol, or env internal domains. Because of the lack of protein-coding capacity, these groups are non-autonomous in replication, even if transcriptionally active. The Cassandra element belongs to the non-autonomous group called terminal-repeat retrotransposons in miniature (TRIM). It carries 5S RNA sequences with conserved RNA polymerase (pol) III promoters and terminators in its long terminal repeats (LTRs). Here, we identified multiple extended tandem arrays of Cassandra retrotransposons within different plant species, including ferns. At least 12 copies of repeated LTRs (as the tandem unit) and internal domain (as a spacer), giving a pattern that resembles the cellular 5S rRNA genes, were identified. A cytogenetic analysis revealed the specific chromosomal pattern of the Cassandra retrotransposon with prominent clustering at and around 5S rDNA loci. The secondary structure of the Cassandra retroelement RNA is predicted to form super-loops, in which the two LTRs are complementary to each other and can initiate local recombination, leading to the tandem arrays of Cassandra elements. The array structures are conserved for Cassandra retroelements of different species. We speculate that recombination events similar to those of 5S rRNA genes may explain the wide variation in Cassandra copy number. Likewise, the organization of 5S rRNA gene sequences is very variable in flowering plants; part of what is taken for 5S gene copy variation may be variation in Cassandra number. The role of the Cassandra 5S sequences remains to be established.
Collapse
Affiliation(s)
- Ruslan Kalendar
- Department of Agricultural Sciences, University of Helsinki, P.O. Box 27 (Latokartanonkaari 5), FI-00014 Helsinki, Finland
- RSE “National Center for Biotechnology”, Korgalzhyn Highway 13/5, Nur-Sultan 010000, Kazakhstan
- Correspondence: (R.K.); (A.H.S.)
| | - Olga Raskina
- Institute of Evolution, University of Haifa, Mount Carmel, Haifa 31905, Israel;
| | - Alexander Belyayev
- Laboratory of Molecular Cytogenetics and Karyology, Institute of Botany of the ASCR, Zámek 1, CZ-252 43 Průhonice, Czech Republic;
| | - Alan H. Schulman
- Natural Resources Institute Finland (Luke), Latokartanonkaari 9, FI-00790 Helsinki, Finland
- Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65, FI-00014 Helsinki, Finland
- Correspondence: (R.K.); (A.H.S.)
| |
Collapse
|
20
|
Akakpo R, Carpentier MC, Ie Hsing Y, Panaud O. The impact of transposable elements on the structure, evolution and function of the rice genome. THE NEW PHYTOLOGIST 2020; 226:44-49. [PMID: 31797393 DOI: 10.1111/nph.16356] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Accepted: 11/05/2019] [Indexed: 06/10/2023]
Abstract
Transposable elements (TEs) are ubiquitous in plants and are the primary genomic component of the majority of taxa. Knowledge of their impact on the structure, function and evolution of plant genomes is therefore a priority in the field of genomics. Rice, as one of the most prevalent crops for food security worldwide, has been subjected to intense research efforts over recent decades. Consequently, a considerable amount of genomic resources has been generated and made freely available to the scientific community. These can be exploited both to improve our understanding of some basic aspects of genome biology of this species and to develop new concepts for crop improvement. In this review, we describe the current knowledge on how TEs have shaped rice chromosomes and propose a new strategy based on a genome-wide association study (GWAS) to address the important question of their functional impact on this crop.
Collapse
Affiliation(s)
- Roland Akakpo
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS/UPVD, Université de Perpignan, Via Domitia, 52 Avenue Paul Alduy, 66860, Perpignan Cedex, France
| | - Marie-Christine Carpentier
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS/UPVD, Université de Perpignan, Via Domitia, 52 Avenue Paul Alduy, 66860, Perpignan Cedex, France
| | - Yue Ie Hsing
- Institute of Plant and Microbial Biology, Acadeia Sinica, 128, Section 2, Yien-chu-yuan Road, Nankang, 115, Taipei, Taiwan
| | - Olivier Panaud
- Laboratoire Génome et Développement des Plantes, UMR 5096 CNRS/UPVD, Université de Perpignan, Via Domitia, 52 Avenue Paul Alduy, 66860, Perpignan Cedex, France
- Institut Universitaire de France, 1 Rue Descartes, 75231, Paris Cedex 05, France
| |
Collapse
|
21
|
Badet T, Oggenfuss U, Abraham L, McDonald BA, Croll D. A 19-isolate reference-quality global pangenome for the fungal wheat pathogen Zymoseptoria tritici. BMC Biol 2020; 18:12. [PMID: 32046716 PMCID: PMC7014611 DOI: 10.1186/s12915-020-0744-3] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 01/27/2020] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND The gene content of a species largely governs its ecological interactions and adaptive potential. A species is therefore defined by both core genes shared between all individuals and accessory genes segregating presence-absence variation. There is growing evidence that eukaryotes, similar to bacteria, show intra-specific variability in gene content. However, it remains largely unknown how functionally relevant such a pangenome structure is for eukaryotes and what mechanisms underlie the emergence of highly polymorphic genome structures. RESULTS Here, we establish a reference-quality pangenome of a fungal pathogen of wheat based on 19 complete genomes from isolates sampled across six continents. Zymoseptoria tritici causes substantial worldwide losses to wheat production due to rapidly evolved tolerance to fungicides and evasion of host resistance. We performed transcriptome-assisted annotations of each genome to construct a global pangenome. Major chromosomal rearrangements are segregating within the species and underlie extensive gene presence-absence variation. Conserved orthogroups account for only ~ 60% of the species pangenome. Investigating gene functions, we find that the accessory genome is enriched for pathogenesis-related functions and encodes genes involved in metabolite production, host tissue degradation and manipulation of the immune system. De novo transposon annotation of the 19 complete genomes shows that the highly diverse chromosomal structure is tightly associated with transposable element content. Furthermore, transposable element expansions likely underlie recent genome expansions within the species. CONCLUSIONS Taken together, our work establishes a highly complex eukaryotic pangenome providing an unprecedented toolbox to study how pangenome structure impacts crop-pathogen interactions.
Collapse
Affiliation(s)
- Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Leen Abraham
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Bruce A McDonald
- Plant Pathology, Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland.
| |
Collapse
|
22
|
Badet T, Oggenfuss U, Abraham L, McDonald BA, Croll D. A 19-isolate reference-quality global pangenome for the fungal wheat pathogen Zymoseptoria tritici. BMC Biol 2020; 18:12. [PMID: 32046716 DOI: 10.1101/803098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 01/27/2020] [Indexed: 05/26/2023] Open
Abstract
BACKGROUND The gene content of a species largely governs its ecological interactions and adaptive potential. A species is therefore defined by both core genes shared between all individuals and accessory genes segregating presence-absence variation. There is growing evidence that eukaryotes, similar to bacteria, show intra-specific variability in gene content. However, it remains largely unknown how functionally relevant such a pangenome structure is for eukaryotes and what mechanisms underlie the emergence of highly polymorphic genome structures. RESULTS Here, we establish a reference-quality pangenome of a fungal pathogen of wheat based on 19 complete genomes from isolates sampled across six continents. Zymoseptoria tritici causes substantial worldwide losses to wheat production due to rapidly evolved tolerance to fungicides and evasion of host resistance. We performed transcriptome-assisted annotations of each genome to construct a global pangenome. Major chromosomal rearrangements are segregating within the species and underlie extensive gene presence-absence variation. Conserved orthogroups account for only ~ 60% of the species pangenome. Investigating gene functions, we find that the accessory genome is enriched for pathogenesis-related functions and encodes genes involved in metabolite production, host tissue degradation and manipulation of the immune system. De novo transposon annotation of the 19 complete genomes shows that the highly diverse chromosomal structure is tightly associated with transposable element content. Furthermore, transposable element expansions likely underlie recent genome expansions within the species. CONCLUSIONS Taken together, our work establishes a highly complex eukaryotic pangenome providing an unprecedented toolbox to study how pangenome structure impacts crop-pathogen interactions.
Collapse
Affiliation(s)
- Thomas Badet
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Ursula Oggenfuss
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Leen Abraham
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Bruce A McDonald
- Plant Pathology, Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland.
| |
Collapse
|
23
|
Wang PL, Luchetti A, Alberto Ruggieri A, Xiong XM, Xu MRX, Zhang XG, Zhang HH. Successful Invasions of Short Internally Deleted Elements (SIDEs) and Its Partner CR1 in Lepidoptera Insects. Genome Biol Evol 2019; 11:2505-2516. [PMID: 31384954 PMCID: PMC6740152 DOI: 10.1093/gbe/evz174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/01/2019] [Indexed: 11/28/2022] Open
Abstract
Although DNA transposons often generated internal deleted derivatives such as miniature inverted-repeat transposable elements, short internally deleted elements (SIDEs) derived from nonlong terminal-repeat retrotransposons are rare. Here, we found a novel SIDE, named Persaeus, that originated from the chicken repeat 1 (CR1) retrotransposon Zenon and it has been found widespread in Lepidoptera insects. Our findings suggested that Persaeus and the partner Zenon have experienced a transposition burst in their host genomes and the copy number of Persaeus and Zenon in assayed genomes are significantly correlated. Accordingly, the activity though age analysis indicated that the replication wave of Persaeus coincided with that of Zenon. Phylogenetic analyses suggested that Persaeus may have evolved at least four times independently, and that it has been vertically transferred into its host genomes. Together, our results provide new insights into the evolution dynamics of SIDEs and its partner non-LTRs.
Collapse
Affiliation(s)
- Ping-Lan Wang
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Italy
| | | | | | - Min-Rui-Xuan Xu
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Xiao-Gu Zhang
- College of Pharmacy and Life Science, Jiujiang University, China
| | - Hua-Hao Zhang
- College of Pharmacy and Life Science, Jiujiang University, China
| |
Collapse
|
24
|
Orozco-Arias S, Isaza G, Guyot R. Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning. Int J Mol Sci 2019; 20:E3837. [PMID: 31390781 PMCID: PMC6696364 DOI: 10.3390/ijms20153837] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 07/31/2019] [Accepted: 08/02/2019] [Indexed: 01/26/2023] Open
Abstract
Transposable elements (TEs) are genomic units able to move within the genome of virtually all organisms. Due to their natural repetitive numbers and their high structural diversity, the identification and classification of TEs remain a challenge in sequenced genomes. Although TEs were initially regarded as "junk DNA", it has been demonstrated that they play key roles in chromosome structures, gene expression, and regulation, as well as adaptation and evolution. A highly reliable annotation of these elements is, therefore, crucial to better understand genome functions and their evolution. To date, much bioinformatics software has been developed to address TE detection and classification processes, but many problematic aspects remain, such as the reliability, precision, and speed of the analyses. Machine learning and deep learning are algorithms that can make automatic predictions and decisions in a wide variety of scientific applications. They have been tested in bioinformatics and, more specifically for TEs, classification with encouraging results. In this review, we will discuss important aspects of TEs, such as their structure, importance in the evolution and architecture of the host, and their current classifications and nomenclatures. We will also address current methods and their limitations in identifying and classifying TEs.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales 170001, Colombia
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Romain Guyot
- Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales 170001, Colombia.
- Institut de Recherche pour le Développement, CIRAD, University Montpellier, 34000 Montpellier, France.
| |
Collapse
|
25
|
Shi J, Liang C. Generic Repeat Finder: A High-Sensitivity Tool for Genome-Wide De Novo Repeat Detection. PLANT PHYSIOLOGY 2019; 180:1803-1815. [PMID: 31152127 PMCID: PMC6670090 DOI: 10.1104/pp.19.00386] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Accepted: 05/17/2019] [Indexed: 05/25/2023]
Abstract
Comprehensive and accurate annotation of the repeatome, including transposons, is critical for deepening our understanding of repeat origins, biogenesis, regulatory mechanisms, and roles. Here, we developed Generic Repeat Finder (GRF), a tool for genome-wide repeat detection based on fast, exhaustive numerical calculation algorithms integrated with optimized dynamic programming strategies. GRF sensitively identifies terminal inverted repeats (TIRs), terminal direct repeats (TDRs), and interspersed repeats that bear both inverted and direct repeats. GRF also detects DNA or RNA transposable elements characterized by these repeats in plant and animal genomes. For TIRs and TDRs, GRF identifies spacers in the middle and mismatches/insertions or deletions in terminal repeats, showing their alignment or base-pairing information. GRF helps improve the annotation for various DNA transposons and retrotransposons, such as miniature inverted-repeat transposable elements (MITEs), long terminal repeat (LTR) retrotransposons, and non-LTR retrotransposons, including long interspersed nuclear elements and short interspersed nuclear elements in plants. We used GRF to perform TIR/TDR, interspersed-repeat, and MITE detection in several species, including Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and mouse (Mus musculus). As a generic bioinformatics tool in repeat finding implemented as a parallelized C++ program, GRF was faster and more sensitive than the existing inverted repeat/MITE detection tools based on numerical approaches (i.e. detectIR and detectMITE) in Arabidopsis and mouse. GRF is more sensitive than Inverted Repeat Finder in TIR detection, LTR_FINDER in short TDR detection (≤1,000 nt), and phRAIDER in interspersed repeat detection in Arabidopsis and rice. GRF is an open source available from Github.
Collapse
Affiliation(s)
- Jieming Shi
- Department of Biology, Miami University, Oxford, Ohio 45056
| | - Chun Liang
- Department of Biology, Miami University, Oxford, Ohio 45056
- Department of Computer Science and Software Engineering, Miami University, Oxford, Ohio 45056
| |
Collapse
|
26
|
Gao D, Chu Y, Xia H, Xu C, Heyduk K, Abernathy B, Ozias-Akins P, Leebens-Mack JH, Jackson SA. Horizontal Transfer of Non-LTR Retrotransposons from Arthropods to Flowering Plants. Mol Biol Evol 2019; 35:354-364. [PMID: 29069493 PMCID: PMC5850137 DOI: 10.1093/molbev/msx275] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Even though lateral movements of transposons across families and even phyla within multicellular eukaryotic kingdoms have been found, little is known about transposon transfer between the kingdoms Animalia and Plantae. We discovered a novel non-LTR retrotransposon, AdLINE3, in a wild peanut species. Sequence comparisons and phylogenetic analyses indicated that AdLINE3 is a member of the RTE clade, originally identified in a nematode and rarely reported in plants. We identified RTE elements in 82 plants, spanning angiosperms to algae, including recently active elements in some flowering plants. RTE elements in flowering plants were likely derived from a single family we refer to as An-RTE. Interestingly, An-RTEs show significant DNA sequence identity with non-LTR retroelements from 42 animals belonging to four phyla. Moreover, the sequence identity of RTEs between two arthropods and two plants was higher than that of homologous genes. Phylogenetic and evolutionary analyses of RTEs from both animals and plants suggest that the An-RTE family was likely transferred horizontally into angiosperms from an ancient aphid(s) or ancestral arthropod(s). Notably, some An-RTEs were recruited as coding sequences of functional genes participating in metabolic or other biochemical processes in plants. This is the first potential example of horizontal transfer of transposons between animals and flowering plants. Our findings help to understand exchanges of genetic material between the kingdom Animalia and Plantae and suggest arthropods likely impacted on plant genome evolution.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| | - Ye Chu
- Department of Horticulture, University of Georgia, Tifton, GA
| | - Han Xia
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA.,Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Jinan, Shandong, China
| | - Chunming Xu
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| | - Karolina Heyduk
- Department of Plant Biology, University of Georgia, Athens, GA
| | - Brian Abernathy
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| | | | | | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA
| |
Collapse
|
27
|
Morata J, Marín F, Payet J, Casacuberta JM. Plant Lineage-Specific Amplification of Transcription Factor Binding Motifs by Miniature Inverted-Repeat Transposable Elements (MITEs). Genome Biol Evol 2018; 10:1210-1220. [PMID: 29659815 PMCID: PMC5950925 DOI: 10.1093/gbe/evy073] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/04/2018] [Indexed: 12/20/2022] Open
Abstract
Transposable elements are one of the main drivers of plant genome evolution. Transposon insertions can modify the gene coding capacity or the regulation of their expression, the latter being a more subtle effect, and therefore particularly useful for evolution. Transposons have been show to contain transcription factor binding sites that can be mobilized upon transposition with the potential to integrate new genes into transcriptional networks. Miniature inverted-repeat transposable elements (MITEs) are a type of noncoding DNA transposons that could be particularly suited as a vector to mobilize transcription factor binding sites and modify transcriptional networks during evolution. MITEs are small in comparison to other transposons and can be excised, which should make them less mutagenic when inserting into promoters. On the other hand, in spite of their cut-and-paste mechanisms of transposition, they can reach very high copy numbers in genomes. We have previously shown that MITEs have amplified and redistributed the binding motif of the E2F transcription factor in different Brassicas. Here, we show that MITEs have amplified and mobilized the binding motifs of the bZIP60 and PIF3 transcription factors in peach and Prunus mume, and the TCP15/23 binding motif in tomato. Our results suggest that MITEs could have rewired new genes into transcriptional regulatory networks that are responsible for important adaptive responses and breeding traits in plants, such as stress responses, flowering time, or fruit ripening. The results presented here therefore suggest a general impact of MITEs in the evolution of transcriptional regulatory networks in plants.
Collapse
Affiliation(s)
- Jordi Morata
- CRAG (CSIC-IRTA-UAB-UB) Campus UAB, Bellaterra, Cerdanyola del Vallès, Barcelona, Spain
| | - Fatima Marín
- CRAG (CSIC-IRTA-UAB-UB) Campus UAB, Bellaterra, Cerdanyola del Vallès, Barcelona, Spain
| | | | - Josep M Casacuberta
- CRAG (CSIC-IRTA-UAB-UB) Campus UAB, Bellaterra, Cerdanyola del Vallès, Barcelona, Spain
| |
Collapse
|
28
|
Kazachkova Y, Eshel G, Pantha P, Cheeseman JM, Dassanayake M, Barak S. Halophytism: What Have We Learnt From Arabidopsis thaliana Relative Model Systems? PLANT PHYSIOLOGY 2018; 178:972-988. [PMID: 30237204 PMCID: PMC6236594 DOI: 10.1104/pp.18.00863] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 08/31/2018] [Indexed: 05/06/2023]
Abstract
Halophytes are able to thrive in salt concentrations that would kill 99% of other plant species, and identifying their salt-adaptive mechanisms has great potential for improving the tolerance of crop plants to salinized soils. Much research has focused on the physiological basis of halophyte salt tolerance, whereas the elucidation of molecular mechanisms has traditionally lagged behind due to the absence of a model halophyte system. However, over the last decade and a half, two Arabidopsis (Arabidopsis thaliana) relatives, Eutrema salsugineum and Schrenkiella parvula, have been established as transformation-competent models with various genetic resources including high-quality genome assemblies. These models have facilitated powerful comparative analyses with salt-sensitive Arabidopsis to unravel the genetic adaptations that enable a halophytic lifestyle. The aim of this review is to explore what has been learned about halophytism using E. salsugineum and S. parvula We consider evidence from physiological and molecular studies suggesting that differences in salt tolerance between related halophytes and salt-sensitive plants are associated with alterations in the regulation of basic physiological, biochemical, and molecular processes. Furthermore, we discuss how salt tolerance mechanisms of the halophytic models are reflected at the level of their genomes, where evolutionary processes such as subfunctionalization and/or neofunctionalization have altered the expression and/or functions of genes to facilitate adaptation to saline conditions. Lastly, we summarize the many areas of research still to be addressed with E. salsugineum and S. parvula as well as obstacles hindering further progress in understanding halophytism.
Collapse
Affiliation(s)
- Yana Kazachkova
- French Associates' Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben-Gurion 8499000, Israel
| | - Gil Eshel
- French Associates' Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben-Gurion 8499000, Israel
| | - Pramod Pantha
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803
| | - John M Cheeseman
- Department of Plant Biology, University of Illinois, Urbana, Illinois 61801
| | - Maheshi Dassanayake
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803
| | - Simon Barak
- French Associates' Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben-Gurion 8499000, Israel
| |
Collapse
|
29
|
Sahebi M, Hanafi MM, van Wijnen AJ, Rice D, Rafii MY, Azizi P, Osman M, Taheri S, Bakar MFA, Isa MNM, Noor YM. Contribution of transposable elements in the plant's genome. Gene 2018; 665:155-166. [PMID: 29684486 DOI: 10.1016/j.gene.2018.04.050] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 04/04/2018] [Accepted: 04/18/2018] [Indexed: 12/26/2022]
Abstract
Plants maintain extensive growth flexibility under different environmental conditions, allowing them to continuously and rapidly adapt to alterations in their environment. A large portion of many plant genomes consists of transposable elements (TEs) that create new genetic variations within plant species. Different types of mutations may be created by TEs in plants. Many TEs can avoid the host's defense mechanisms and survive alterations in transposition activity, internal sequence and target site. Thus, plant genomes are expected to utilize a variety of mechanisms to tolerate TEs that are near or within genes. TEs affect the expression of not only nearby genes but also unlinked inserted genes. TEs can create new promoters, leading to novel expression patterns or alternative coding regions to generate alternate transcripts in plant species. TEs can also provide novel cis-acting regulatory elements that act as enhancers or inserts within original enhancers that are required for transcription. Thus, the regulation of plant gene expression is strongly managed by the insertion of TEs into nearby genes. TEs can also lead to chromatin modifications and thereby affect gene expression in plants. TEs are able to generate new genes and modify existing gene structures by duplicating, mobilizing and recombining gene fragments. They can also facilitate cellular functions by sharing their transposase-coding regions. Hence, TE insertions can not only act as simple mutagens but can also alter the elementary functions of the plant genome. Here, we review recent discoveries concerning the contribution of TEs to gene expression in plant genomes and discuss the different mechanisms by which TEs can affect plant gene expression and reduce host defense mechanisms.
Collapse
Affiliation(s)
- Mahbod Sahebi
- Laboratory of Climate-Smart Food Crop Production, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia.
| | - Mohamed M Hanafi
- Laboratory of Climate-Smart Food Crop Production, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia; Laboratory of Plantation Science and Technology, Institute of Plantation Studies, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia; Department of Land Management, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia.
| | | | - David Rice
- Department of Molecular Biology & Biotecnology, University of Sheffield, United Kingdom
| | - M Y Rafii
- Laboratory of Climate-Smart Food Crop Production, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | - Parisa Azizi
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | - Mohamad Osman
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | - Sima Taheri
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | | | | | | |
Collapse
|
30
|
McCormick RF, Truong SK, Sreedasyam A, Jenkins J, Shu S, Sims D, Kennedy M, Amirebrahimi M, Weers BD, McKinley B, Mattison A, Morishige DT, Grimwood J, Schmutz J, Mullet JE. The Sorghum bicolor reference genome: improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 93:338-354. [PMID: 29161754 DOI: 10.1111/tpj.13781] [Citation(s) in RCA: 282] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Revised: 11/05/2017] [Accepted: 11/14/2017] [Indexed: 05/20/2023]
Abstract
Sorghum bicolor is a drought tolerant C4 grass used for the production of grain, forage, sugar, and lignocellulosic biomass and a genetic model for C4 grasses due to its relatively small genome (approximately 800 Mbp), diploid genetics, diverse germplasm, and colinearity with other C4 grass genomes. In this study, deep sequencing, genetic linkage analysis, and transcriptome data were used to produce and annotate a high-quality reference genome sequence. Reference genome sequence order was improved, 29.6 Mbp of additional sequence was incorporated, the number of genes annotated increased 24% to 34 211, average gene length and N50 increased, and error frequency was reduced 10-fold to 1 per 100 kbp. Subtelomeric repeats with characteristics of Tandem Repeats in Miniature (TRIM) elements were identified at the termini of most chromosomes. Nucleosome occupancy predictions identified nucleosomes positioned immediately downstream of transcription start sites and at different densities across chromosomes. Alignment of more than 50 resequenced genomes from diverse sorghum genotypes to the reference genome identified approximately 7.4 M single nucleotide polymorphisms (SNPs) and 1.9 M indels. Large-scale variant features in euchromatin were identified with periodicities of approximately 25 kbp. A transcriptome atlas of gene expression was constructed from 47 RNA-seq profiles of growing and developed tissues of the major plant organs (roots, leaves, stems, panicles, and seed) collected during the juvenile, vegetative and reproductive phases. Analysis of the transcriptome data indicated that tissue type and protein kinase expression had large influences on transcriptional profile clustering. The updated assembly, annotation, and transcriptome data represent a resource for C4 grass research and crop improvement.
Collapse
Affiliation(s)
- Ryan F McCormick
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, 77843, USA
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Sandra K Truong
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, 77843, USA
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | | | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Shengqiang Shu
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - David Sims
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Megan Kennedy
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | - Brock D Weers
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Brian McKinley
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Ashley Mattison
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, 77843, USA
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Daryl T Morishige
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - John E Mullet
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| |
Collapse
|
31
|
Song Q, Zhang T, Stelly DM, Chen ZJ. Epigenomic and functional analyses reveal roles of epialleles in the loss of photoperiod sensitivity during domestication of allotetraploid cottons. Genome Biol 2017; 18:99. [PMID: 28558752 PMCID: PMC5450403 DOI: 10.1186/s13059-017-1229-8] [Citation(s) in RCA: 132] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2017] [Accepted: 05/03/2017] [Indexed: 02/08/2023] Open
Abstract
Background Polyploidy is a pervasive evolutionary feature of all flowering plants and some animals, leading to genetic and epigenetic changes that affect gene expression and morphology. DNA methylation changes can produce meiotically stable epialleles, which are transmissible through selection and breeding. However, the relationship between DNA methylation and polyploid plant domestication remains elusive. Results We report comprehensive epigenomic and functional analyses, including ~12 million differentially methylated cytosines in domesticated allotetraploid cottons and their tetraploid and diploid relatives. Methylated genes evolve faster than unmethylated genes; DNA methylation changes between homoeologous loci are associated with homoeolog-expression bias in the allotetraploids. Significantly, methylation changes induced in the interspecific hybrids are largely maintained in the allotetraploids. Among 519 differentially methylated genes identified between wild and cultivated cottons, some contribute to domestication traits, including flowering time and seed dormancy. CONSTANS (CO) and CO-LIKE (COL) genes regulate photoperiodicity in Arabidopsis. COL2 is an epiallele in allotetraploid cottons. COL2A is hypermethylated and silenced, while COL2D is repressed in wild cottons but highly expressed due to methylation loss in all domesticated cottons tested. Inhibiting DNA methylation activates COL2 expression, and repressing COL2 in cultivated cotton delays flowering. Conclusions We uncover epigenomic signatures of domestication traits during cotton evolution. Demethylation of COL2 increases its expression, inducing photoperiodic flowering, which could have contributed to the suitability of cotton for cultivation worldwide. These resources should facilitate epigenetic engineering, breeding, and improvement of polyploid crops. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1229-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Qingxin Song
- Department of Molecular Biosciences, Institute for Cellular and Molecular Biology, and Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Tianzhen Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China
| | - David M Stelly
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 78743, USA
| | - Z Jeffrey Chen
- Department of Molecular Biosciences, Institute for Cellular and Molecular Biology, and Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, TX, 78712, USA. .,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, China.
| |
Collapse
|
32
|
Xia C, Zhang L, Zou C, Gu Y, Duan J, Zhao G, Wu J, Liu Y, Fang X, Gao L, Jiao Y, Sun J, Pan Y, Liu X, Jia J, Kong X. A TRIM insertion in the promoter of Ms2 causes male sterility in wheat. Nat Commun 2017; 8:15407. [PMID: 28497807 PMCID: PMC5437302 DOI: 10.1038/ncomms15407] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Accepted: 03/25/2017] [Indexed: 11/09/2022] Open
Abstract
The male-sterile ms2 mutant has been known for 40 years and has become extremely important in the commercial production of wheat. However, the gene responsible for this phenotype has remained unknown. Here we report the map-based cloning of the Ms2 gene. The Ms2 locus is remarkable in several ways that have implications in basic biology. Beyond having no functional annotation, barely detectable transcription in fertile wild-type wheat plants, and accumulated destructive mutations in Ms2 orthologs, the Ms2 allele in the ms2 mutant has acquired a terminal-repeat retrotransposon in miniature (TRIM) element in its promoter. This TRIM element is responsible for the anther-specific Ms2 activation that confers male sterility. The identification of Ms2 not only unravels the genetic basis of a historically important breeding trait, but also shows an example of how a TRIM element insertion near a gene can contribute to genetic novelty and phenotypic plasticity.
Collapse
Affiliation(s)
- Chuan Xia
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Lichao Zhang
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Cheng Zou
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yongqiang Gu
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Albany, California 94710, USA
| | - Jialei Duan
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Guangyao Zhao
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jiajie Wu
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yue Liu
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xiaohua Fang
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Lifeng Gao
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yuannian Jiao
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Jiaqiang Sun
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Yinghong Pan
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xu Liu
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jizeng Jia
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xiuying Kong
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, Ministry of Agriculture, The National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
33
|
Genome sequencing and population genomic analyses provide insights into the adaptive landscape of silver birch. Nat Genet 2017; 49:904-912. [PMID: 28481341 DOI: 10.1038/ng.3862] [Citation(s) in RCA: 125] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Accepted: 04/12/2017] [Indexed: 12/26/2022]
Abstract
Silver birch (Betula pendula) is a pioneer boreal tree that can be induced to flower within 1 year. Its rapid life cycle, small (440-Mb) genome, and advanced germplasm resources make birch an attractive model for forest biotechnology. We assembled and chromosomally anchored the nuclear genome of an inbred B. pendula individual. Gene duplicates from the paleohexaploid event were enriched for transcriptional regulation, whereas tandem duplicates were overrepresented by environmental responses. Population resequencing of 80 individuals showed effective population size crashes at major points of climatic upheaval. Selective sweeps were enriched among polyploid duplicates encoding key developmental and physiological triggering functions, suggesting that local adaptation has tuned the timing of and cross-talk between fundamental plant processes. Variation around the tightly-linked light response genes PHYC and FRS10 correlated with latitude and longitude and temperature, and with precipitation for PHYC. Similar associations characterized the growth-promoting cytokinin response regulator ARR1, and the wood development genes KAK and MED5A.
Collapse
|
34
|
Rey-Baños R, Sáenz de Miera LE, García P, Pérez de la Vega M. Obtaining retrotransposon sequences, analysis of their genomic distribution and use of retrotransposon-derived genetic markers in lentil (Lens culinaris Medik.). PLoS One 2017; 12:e0176728. [PMID: 28448614 PMCID: PMC5407846 DOI: 10.1371/journal.pone.0176728] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 04/14/2017] [Indexed: 12/02/2022] Open
Abstract
Retrotransposons with long terminal repeats (LTR-RTs) are widespread mobile elements in eukaryotic genomes. We obtained a total of 81 partial LTR-RT sequences from lentil corresponding to internal retrotransposon components and LTRs. Sequences were obtained by PCR from genomic DNA. Approximately 37% of the LTR-RT internal sequences presented premature stop codons, pointing out that these elements must be non-autonomous. LTR sequences were obtained using the iPBS technique which amplifies sequences between LTR-RTs. A total of 193 retrotransposon-derived genetic markers, mainly iPBS, were used to obtain a genetic linkage map from 94 F7 inbred recombinant lines derived from the cross between the cultivar Lupa and the wild ancestor L. culinaris subsp. orientalis. The genetic map included 136 markers located in eight linkage groups. Clusters of tightly linked retrotransposon-derived markers were detected in linkage groups LG1, LG2, and LG6, hence denoting a non-random genomic distribution. Phylogenetic analyses identified the LTR-RT families in which internal and LTR sequences are included. Ty3-gypsy elements were more frequent than Ty1-copia, mainly due to the high Ogre element frequency in lentil, as also occurs in other species of the tribe Vicieae. LTR and internal sequences were used to analyze in silico their distribution among the contigs of the lentil draft genome. Up to 8.8% of the lentil contigs evidenced the presence of at least one LTR-RT similar sequence. A statistical analysis suggested a non-random distribution of these elements within of the lentil genome. In most cases (between 97% and 72%, depending on the LTR-RT type) none of the internal sequences flanked by the LTR sequence pair was detected, suggesting that defective and non-autonomous LTR-RTs are very frequent in lentil. Results support that LTR-RTs are abundant and widespread throughout of the lentil genome and that they are a suitable source of genetic markers useful to carry out further genetic analyses.
Collapse
Affiliation(s)
- Rita Rey-Baños
- Área de Genética, Dpto. de Biología Molecular, Universidad de León, León, Spain
| | - Luis E. Sáenz de Miera
- Área de Genética, Dpto. de Biología Molecular, Universidad de León, León, Spain
- * E-mail:
| | - Pedro García
- Área de Genética, Dpto. de Biología Molecular, Universidad de León, León, Spain
| | | |
Collapse
|
35
|
Abstract
We have witnessed an explosion in our understanding of the evolution and structure of plant genomes in recent years. Here, we highlight three important emergent realizations: (1) that the evolutionary history of all plant genomes contains multiple, cyclical episodes of whole-genome doubling that were followed by myriad fractionation processes; (2) that the vast majority of the variation in genome size reflects the dynamics of proliferation and loss of lineage-specific transposable elements; and (3) that various classes of small RNAs help shape genomic architecture and function. We illustrate ways in which understanding these organism-level and molecular genetic processes can be used for crop plant improvement.
Collapse
Affiliation(s)
- Jonathan F Wendel
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA, 50011, USA.
| | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA, 30602, USA
| | - Blake C Meyers
- Donald Danforth Plant Science Center, 975 North Warson Road, St. Louis, MO, 63132, USA.,Division of Plant Sciences, University of Missouri-Columbia, 52 Agriculture Laboratory, Columbia, MO, 65211, USA
| | - Rod A Wing
- Arizona Genomics Institute, School of Plant Sciences and Department of Ecology and Evolutionary Biology, Tucson, AZ, 85750, USA.,T.T. Chang Genetic Resource Center, International Rice Research Institute, Los Baños, Laguna, Philippines
| |
Collapse
|