1
|
Ancient hybridization and repetitive element proliferation in the evolutionary history of the monocot genus Amomum (Zingiberaceae). FRONTIERS IN PLANT SCIENCE 2024; 15:1324358. [PMID: 38708400 PMCID: PMC11066291 DOI: 10.3389/fpls.2024.1324358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 03/12/2024] [Indexed: 05/07/2024]
Abstract
Genome size variation is a crucial aspect of plant evolution, influenced by a complex interplay of factors. Repetitive elements, which are fundamental components of genomic architecture, often play a role in genome expansion by selectively amplifying specific repeat motifs. This study focuses on Amomum, a genus in the ginger family (Zingiberaceae), known for its 4.4-fold variation in genome size. Using a robust methodology involving PhyloNet reconstruction, RepeatExplorer clustering, and repeat similarity-based phylogenetic network construction, we investigated the repeatome composition, analyzed repeat dynamics, and identified potential hybridization events within the genus. Our analysis confirmed the presence of four major infrageneric clades (A-D) within Amomum, with clades A-C exclusively comprising diploid species (2n = 48) and clade D encompassing both diploid and tetraploid species (2n = 48 and 96). We observed an increase in the repeat content within the genus, ranging from 84% to 89%, compared to outgroup species with 75% of the repeatome. The SIRE lineage of the Ty1-Copia repeat superfamily was prevalent in most analyzed ingroup genomes. We identified significant difference in repeatome structure between the basal Amomum clades (A, B, C) and the most diverged clade D. Our investigation revealed evidence of ancient hybridization events within Amomum, coinciding with a substantial proliferation of multiple repeat groups. This finding supports the hypothesis that ancient hybridization is a driving force in the genomic evolution of Amomum. Furthermore, we contextualize our findings within the broader context of genome size variations and repeatome dynamics observed across major monocot lineages. This study enhances our understanding of evolutionary processes within monocots by highlighting the crucial roles of repetitive elements in shaping genome size and suggesting the mechanisms that drive these changes.
Collapse
|
2
|
Genome assembly and annotation of the mermithid nematode Mermis nigrescens. G3 (BETHESDA, MD.) 2024; 14:jkae023. [PMID: 38301266 PMCID: PMC10989877 DOI: 10.1093/g3journal/jkae023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 01/21/2024] [Accepted: 01/22/2024] [Indexed: 02/03/2024]
Abstract
Genetic studies of nematodes have been dominated by Caenorhabditis elegans as a model species. A lack of genomic resources has limited the expansion of genetic research to other groups of nematodes. Here, we report a draft genome assembly of a mermithid nematode, Mermis nigrescens. Mermithidae are insect parasitic nematodes with hosts including a wide range of terrestrial arthropods. We sequenced, assembled, and annotated the whole genome of M. nigrescens using nanopore long reads and 10X Chromium link reads. The assembly is 524 Mb in size consisting of 867 scaffolds. The N50 value is 2.42 Mb, and half of the assembly is in the 30 longest scaffolds. The assembly BUSCO score from the eukaryotic database (eukaryota_odb10) indicates that the genome is 86.7% complete and 5.1% partial. The genome has a high level of heterozygosity (6.6%) with a repeat content of 83.98%. mRNA-seq reads from different sized nematodes (≤2 cm, 3.5-7 cm, and >7 cm body length) representing different developmental stages were also generated and used for the genome annotation. Using ab initio and evidence-based gene model predictions, 12,313 protein-coding genes and 24,186 mRNAs were annotated. These genomic resources will help researchers investigate the various aspects of the biology and host-parasite interactions of mermithid nematodes.
Collapse
|
3
|
Expanding horizons of tandem repeats in biology and medicine: Why 'genomic dark matter' matters. Emerg Top Life Sci 2023; 7:ETLS20230075. [PMID: 38088823 PMCID: PMC10754335 DOI: 10.1042/etls20230075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 11/27/2023] [Accepted: 11/27/2023] [Indexed: 12/30/2023]
Abstract
Approximately half of the human genome includes repetitive sequences, and these DNA sequences (as well as their transcribed repetitive RNA and translated amino-acid repeat sequences) are known as the repeatome. Within this repeatome there are a couple of million tandem repeats, dispersed throughout the genome. These tandem repeats have been estimated to constitute ∼8% of the entire human genome. These tandem repeats can be located throughout exons, introns and intergenic regions, thus potentially affecting the structure and function of tandemly repetitive DNA, RNA and protein sequences. Over more than three decades, more than 60 monogenic human disorders have been found to be caused by tandem-repeat mutations. These monogenic tandem-repeat disorders include Huntington's disease, a variety of ataxias, amyotrophic lateral sclerosis and frontotemporal dementia, as well as many other neurodegenerative diseases. Furthermore, tandem-repeat disorders can include fragile X syndrome, related fragile X disorders, as well as other neurological and psychiatric disorders. However, these monogenic tandem-repeat disorders, which were discovered via their dominant or recessive modes of inheritance, may represent the 'tip of the iceberg' with respect to tandem-repeat contributions to human disorders. A previous proposal that tandem repeats may contribute to the 'missing heritability' of various common polygenic human disorders has recently been supported by a variety of new evidence. This includes genome-wide studies that associate tandem-repeat mutations with autism, schizophrenia, Parkinson's disease and various types of cancers. In this article, I will discuss how tandem-repeat mutations and polymorphisms could contribute to a wide range of common disorders, along with some of the many major challenges of tandem-repeat biology and medicine. Finally, I will discuss the potential of tandem repeats to be therapeutically targeted, so as to prevent and treat an expanding range of human disorders.
Collapse
|
4
|
Conserved satellite DNA motif and lack of interstitial telomeric sites in highly rearranged African Nothobranchius killifish karyotypes. JOURNAL OF FISH BIOLOGY 2023; 103:1501-1514. [PMID: 37661806 DOI: 10.1111/jfb.15550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 08/27/2023] [Accepted: 08/29/2023] [Indexed: 09/05/2023]
Abstract
Using African annual killifishes of the genus Nothobranchius from temporary savannah pools with rapid karyotype and sex chromosome evolution, we analysed the chromosomal distribution of telomeric (TTAGGG)n repeat and Nfu-SatC satellite DNA (satDNA; isolated from Nothobranchius furzeri) in 15 species across the Nothobranchius killifish phylogeny, and with Fundulosoma thierryi as an out-group. Our fluorescence in situ hybridization experiments revealed that all analysed taxa share the presence of Nfu-SatC repeat but with diverse organization and distribution on chromosomes. Nfu-SatC landscape was similar in conspecific populations of Nothobranchius guentheri and Nothobranchius melanospilus but slightly-to-moderately differed between populations of Nothobranchius pienaari, and between closely related Nothobranchius kuhntae and Nothobranchius orthonotus. Inter-individual variability in Nfu-SatC patterns was found in N. orthonotus and Nothobranchius krysanovi. We revealed mostly no sex-linked patterns of studied repetitive DNA distribution. Only in Nothobranchius brieni, possessing multiple sex chromosomes, Nfu-SatC repeat occupied a substantial portion of the neo-Y chromosome, similarly as formerly found in the XY sex chromosome system of turquoise killifish N. furzeri and its sister species Nothobranchius kadleci-representatives not closely related to N. brieni. All studied species further shared patterns of expected telomeric repeats at the ends of all chromosomes and no additional interstitial telomeric sites. In summary, we revealed (i) the presence of conserved satDNA class in Nothobranchius clades (a rare pattern among ray-finned fishes); (ii) independent trajectories of Nothobranchius sex chromosome differentiation, with recurrent and convergent accumulation of Nfu-SatC on the Y chromosome in some species; and (iii) genus-wide shared tendency to loss of telomeric repeats during interchromosomal rearrangements. Collectively, our findings advance our understanding of genome structure, mechanisms of karyotype reshuffling, and sex chromosome differentiation in Nothobranchius killifishes from the genus-wide perspective.
Collapse
|
5
|
Erratum: Aegilops crassa Boiss. repeatome characterized using low-coverage NGS as a source of new FISH markers: application in phylogenetic studies of the Triticeae. FRONTIERS IN PLANT SCIENCE 2023; 14:1207880. [PMID: 37521923 PMCID: PMC10374421 DOI: 10.3389/fpls.2023.1207880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 05/04/2023] [Indexed: 08/01/2023]
Abstract
[This corrects the article DOI: 10.3389/fpls.2022.980764.].
Collapse
|
6
|
Comparison of the evolutionary patterns of DNA repeats in ancient and young invertebrate species flocks of Lake Baikal. Vavilovskii Zhurnal Genet Selektsii 2023; 27:349-356. [PMID: 37465187 PMCID: PMC10350863 DOI: 10.18699/vjgb-23-42] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 02/20/2023] [Accepted: 02/20/2023] [Indexed: 07/20/2023] Open
Abstract
DNA repeat composition of low coverage (0.1-0.5) genomic libraries of four amphipods species endemic to Lake Baikal (East Siberia) and four endemic gastropod species of the fam. Baicaliidae have been compared to each other. In order to do so, a neighbor joining tree was inferred for each quartet of species (amphipods and mollusks) based on the ratio of repeat classes shared in each pair of species. The topology of this tree was compared to the phylogenies inferred for the same species from the concatenated protein-coding mitochondrial nucleotide sequences. In all species analyzed, the fraction of DNA repeats involved circa half of the genome. In relatively more ancient amphipods (most recent common ancestor, MRCA, existed approximately sixty millions years ago), the most abundant were species-specific repeats, while in much younger Baicaliidae (MRCA equal to ca. three millions years) most of the DNA repeats were shared among all four species. If the presence/absence of a repeat is regarded as a separate independent trait, and the ratio of shared to total numbers of repeats in a species pair is used as the measure of distance, the topology of the NJ tree is the same as the quartet phylogeny inferred for the mitogenomes protein coding nucleotide sequences. Meanwhile, in each group of species, a substantial number of repeats were detected pointing to the possibility of non-neutral evolution or a horizontal transfer between species occupying the same biotope. These repeats were shared by non-sister groups while being absent in the sister genomes. On the other hand, in such cases some traits of ecological significance were also shared.
Collapse
|
7
|
Sympatric Speciation in Mole Rats and Wild Barley and Their Genome Repeatome Evolution: A Commentary. ADVANCED GENETICS (HOBOKEN, N.J.) 2022; 3:2200009. [PMID: 36911292 PMCID: PMC9993473 DOI: 10.1002/ggn2.202200009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 07/16/2022] [Indexed: 11/05/2022]
Abstract
The theories of sympatric speciation (SS) and coding and noncoding (cd and ncd =repeatome) genome function are still contentious. Studies on SS in our two new models, "Evolution Canyon" and "Evolution Plateau", in Israel, divergent microclimatically and geologically-edaphically, respectively, indicated that in ecologically divergent microsites SS is a common speciation model across life from bacteria to mammals. Genomically, the intergenic ncd repeatome was and is still regarded by many biologists as "selfish," "junk," and non-functional. In contrast, it is considered by the encyclopedia of DNA elements discovery as biochemically functional and regulatory, and the transposable elements were considered earlier by Barbara McClintock as "controlling elements" of genes. Remarkably, it is found that repeated elements can statistically identify significantly, the five species of subterranean mole rats of Spalax ehrenbergi superspecies adapted to increasingly arid climatic trend southward in Israel. Moreover, it is first discovered in the SS studies in two distant taxa, subterranean mole rats and wild barley, and later also in spiny mice in Israel and subterranean zokors in China, that the noncoding repeatome is genomically mirroring the image of the protein-coding genome in divergent ecologies. It is shown that this mirroring image is statistically significant both within and between the ecologically divergent taxa supporting the hypothesis that much of the repeatome might be regulatory and selected as the protein-coding genome by the same ecological stresses.
Collapse
|
8
|
Corrigendum: Evolution of tandem repeats is mirroring post-polyploid cladogenesis in Heliophila (Brassicaceae). FRONTIERS IN PLANT SCIENCE 2022; 13:1054800. [PMID: 36388541 PMCID: PMC9641311 DOI: 10.3389/fpls.2022.1054800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 10/04/2022] [Indexed: 06/16/2023]
Abstract
[This corrects the article DOI: 10.3389/fpls.2020.607893.].
Collapse
|
9
|
Aegilops crassa Boiss. repeatome characterized using low-coverage NGS as a source of new FISH markers: Application in phylogenetic studies of the Triticeae. FRONTIERS IN PLANT SCIENCE 2022; 13:980764. [PMID: 36325551 PMCID: PMC9621091 DOI: 10.3389/fpls.2022.980764] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 08/29/2022] [Indexed: 06/13/2023]
Abstract
Aegilops crassa Boiss. is polyploid grass species that grows in the eastern part of the Fertile Crescent, Afghanistan, and Middle Asia. It consists of tetraploid (4x) and hexaploid (6x) cytotypes (2n = 4x = 28, D1D (Abdolmalaki et al., 2019) XcrXcr and 2n = 6x = 42, D1D (Abdolmalaki et al., 2019) XcrXcrD2D (Adams and Wendel, 2005), respectively) that are similar morphologically. Although many Aegilops species were used in wheat breeding, the genetic potential of Ae. crassa has not yet been exploited due to its uncertain origin and significant genome modifications. Tetraploid Ae. crassa is thought to be the oldest polyploid Aegilops species, the subgenomes of which still retain some features of its ancient diploid progenitors. The D1 and D2 subgenomes of Ae. crassa were contributed by Aegilops tauschii (2n = 2x = 14, DD), while the Xcr subgenome donor is still unknown. Owing to its ancient origin, Ae. crassa can serve as model for studying genome evolution. Despite this, Ae. crassa is poorly studied genetically and no genome sequences were available for this species. We performed low-coverage genome sequencing of 4x and 6x cytotypes of Ae. crassa, and four Ae. tauschii accessions belonging to different subspecies; diploid wheatgrass Thinopyrum bessarabicum (Jb genome), which is phylogenetically close to D (sub)genome species, was taken as an outgroup. Subsequent data analysis using the pipeline RepeatExplorer2 allowed us to characterize the repeatomes of these species and identify several satellite sequences. Some of these sequences are novel, while others are found to be homologous to already known satellite sequences of Triticeae species. The copy number of satellite repeats in genomes of different species and their subgenome (D1 or Xcr) affinity in Ae. crassa were assessed by means of comparative bioinformatic analysis combined with quantitative PCR (qPCR). Fluorescence in situ hybridization (FISH) was performed to map newly identified satellite repeats on chromosomes of common wheat, Triticum aestivum, 4x and 6x Ae. crassa, Ae. tauschii, and Th. bessarabicum. The new FISH markers can be used in phylogenetic analyses of the Triticeae for chromosome identification and the assessment of their subgenome affinities and for evaluation of genome/chromosome constitution of wide hybrids or polyploid species.
Collapse
|
10
|
The relationship between transposable elements and ecological niches in the Greater Cape Floristic Region: A study on the genus Pteronia (Asteraceae). FRONTIERS IN PLANT SCIENCE 2022; 13:982852. [PMID: 36247607 PMCID: PMC9559566 DOI: 10.3389/fpls.2022.982852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 09/02/2022] [Indexed: 06/16/2023]
Abstract
Non-coding repetitive DNA (repeatome) is an active part of the nuclear genome, involved in its structure, evolution and function. It is dominated by transposable elements (TEs) and satellite DNA and is prone to the most rapid changes over time. The TEs activity presumably causes the global genome reorganization and may play an adaptive or regulatory role in response to environmental challenges. This assumption is applied here for the first time to plants from the Cape Floristic hotspot to determine whether changes in repetitive DNA are related to responses to a harsh, but extremely species-rich environment. The genus Pteronia (Asteraceae) serves as a suitable model group because it shows considerable variation in genome size at the diploid level and has high and nearly equal levels of endemism in the two main Cape biomes, Fynbos and Succulent Karoo. First, we constructed a phylogeny based on multiple low-copy genes that served as a phylogenetic framework for detecting quantitative and qualitative changes in the repeatome. Second, we performed a comparative analysis of the environments of two groups of Pteronia differing in their TEs bursts. Our results suggest that the environmental transition from the Succulent Karoo to the Fynbos is accompanied by TEs burst, which is likely also driving phylogenetic divergence. We thus hypothesize that analysis of rapidly evolving repeatome could serve as an important proxy for determining the molecular basis of lineage divergence in rapidly radiating groups.
Collapse
|
11
|
Idahoa and Subularia: Hidden polyploid origins of two enigmatic genera of crucifers. AMERICAN JOURNAL OF BOTANY 2022; 109:1273-1289. [PMID: 35912547 DOI: 10.1002/ajb2.16042] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 07/10/2022] [Accepted: 07/11/2022] [Indexed: 06/15/2023]
Abstract
PREMISE The monotypic Idahoa (I. scapigera) and the bispecific Subularia (S. aquatica and S. monticola) belong to Brassicaceae with unclear phylogenetic relationships and no tribal assignment. To fill this knowledge gap, we investigated these species and their closest relatives by combining cytogenomic and phylogenomic methods. METHODS We used whole plastome sequences in maximum likelihood and Bayesian inference analyses. We tested the phylogenetic informativeness of shared genomic repeats. We combined nuclear gene tree reconciliation and comparative chromosome painting (CCP) to examine the occurrence of past whole-genome duplications (WGDs). RESULTS The plastid data set corroborated the sister relationship between Idahoa and Subularia within the crucifer Lineage V but failed to resolve consistent topologies using both inference methods. The shared repetitive sequences provided conflicting pwhylogenetic signals. CCP analysis unexpectedly revealed that Idahoa (2n = 16) has a diploidized mesotetraploid genome, whereas two Subularia species (2n = 28 and 30) have diploidized mesoctoploid genomes. Several ancient allopolyploidy events have also been detected in closely related taxa (Chamira circaeoides, Cremolobeae, Eudemeae, and Notothlaspideae). CONCLUSIONS Our results suggest that the contentious phylogenetic placement of Idahoa and Subularia is best explained by two WGDs involving one or more shared parental genomes. The newly identified mesopolyploid genomes highlight the challenges of studying plant clades with complex polyploidy histories and provide a better framework for understanding genome evolution in the crucifer family.
Collapse
|
12
|
Human endogenous retrovirus-K (HERV-K) reverse transcriptase (RT) structure and biochemistry reveals remarkable similarities to HIV-1 RT and opportunities for HERV-K-specific inhibition. Proc Natl Acad Sci U S A 2022; 119:e2200260119. [PMID: 35771941 PMCID: PMC9271190 DOI: 10.1073/pnas.2200260119] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A large percentage of the human genome is composed of repetitive elements that are relics of past viral infections. Expression of these human endogenous retroviruses (HERVs) is associated with a variety of diseases, including cancer; however, causality remains to be established. A subset of these HERVs express proteins with reverse transcriptase (RT) activity. This has inspired several clinical studies of antiviral RT inhibitors for indications in which HERV expression is associated with disease. We have determined the X-ray structure of an HERV reverse transcriptase. This structure clarifies the reasons for poor inhibition by 3TC (lamivudine) and lack of inhibition by nonnucleoside inhibitors nevirapine and efavirenz. This structure will enable the design of selective HERV-K RT tools for drug target validation. Human endogenous retroviruses (HERVs) comprise nearly 8% of the human genome and are derived from ancient integrations of retroviruses into the germline. The biology of HERVs is poorly defined, but there is accumulating evidence supporting pathological roles in diverse diseases, such as cancer, autoimmune, and neurodegenerative diseases. Functional proteins are produced by HERV-encoded genes, including reverse transcriptases (RTs), which could be a contributor to the pathology attributed to aberrant HERV-K expression. To facilitate the discovery and development of HERV-K RT potent and selective inhibitors, we expressed active HERV-K RT and determined the crystal structure of a ternary complex of this enzyme with a double-stranded DNA substrate. We demonstrate a range of RT inhibition with antiretroviral nucleotide analogs, while classic nonnucleoside analogs do not inhibit HERV-K RT. Detailed comparisons of HERV-K RT with other known RTs demonstrate similarities to diverse RT families and a striking similarity to the HIV-1 RT asymmetric heterodimer. Our analysis further reveals opportunities for selective HERV-K RT inhibition.
Collapse
|
13
|
Evolutionary Dynamics of the Repeatome Explains Contrasting Differences in Genome Sizes and Hybrid and Polyploid Origins of Grass Loliinae Lineages. FRONTIERS IN PLANT SCIENCE 2022; 13:901733. [PMID: 35845705 PMCID: PMC9284676 DOI: 10.3389/fpls.2022.901733] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 05/25/2022] [Indexed: 06/15/2023]
Abstract
The repeatome is composed of diverse families of repetitive DNA that keep signatures on the historical events that shaped the evolution of their hosting species. The cold seasonal Loliinae subtribe includes worldwide distributed taxa, some of which are the most important forage and lawn species (fescues and ray-grasses). The Loliinae are prone to hybridization and polyploidization. It has been observed a striking two-fold difference in genome size between the broad-leaved (BL) and fine-leaved (FL) Loliinae diploids and a general trend of genome reduction of some high polyploids. We have used genome skimming data to uncover the composition, abundance, and potential phylogenetic signal of repetitive elements across 47 representatives of the main Loliinae lineages. Independent and comparative analyses of repetitive sequences and of 5S rDNA loci were performed for all taxa under study and for four evolutionary Loliinae groups [Loliinae, Broad-leaved (BL), Fine-leaved (FL), and Schedonorus lineages]. Our data showed that the proportion of the genome covered by the repeatome in the Loliinae species was relatively high (average ∼ 51.8%), ranging from high percentages in some diploids (68.7%) to low percentages in some high-polyploids (30.7%), and that changes in their genome sizes were likely caused by gains or losses in their repeat elements. Ty3-gypsy Retand and Ty1-copia Angela retrotransposons were the most frequent repeat families in the Loliinae although the relatively more conservative Angela repeats presented the highest correlation of repeat content with genome size variation and the highest phylogenetic signal of the whole repeatome. By contrast, Athila retrotransposons presented evidence of recent proliferations almost exclusively in the Lolium clade. The repeatome evolutionary networks showed an overall topological congruence with the nuclear 35S rDNA phylogeny and a geographic-based structure for some lineages. The evolution of the Loliinae repeatome suggests a plausible scenario of recurrent allopolyploidizations followed by diploidizations that generated the large genome sizes of BL diploids as well as large genomic rearrangements in highly hybridogenous lineages that caused massive repeatome and genome contractions in the Schedonorus and Aulaxyper polyploids. Our study has contributed to disentangling the impact of the repeatome dynamics on the genome diversification and evolution of the Loliinae grasses.
Collapse
|
14
|
Genomes, repeatomes and interphase chromosome organization in the meadowfoam family (Limnanthaceae, Brassicales). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:1462-1475. [PMID: 35352402 DOI: 10.1111/tpj.15750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 03/17/2022] [Accepted: 03/28/2022] [Indexed: 06/14/2023]
Abstract
The meadowfoam family (Limnanthaceae) is one of the smallest and genomically underexplored families of the Brassicales. The Limnanthaceae harbor about seven species in the genus Limnanthes (meadowfoam) and Floerkea proserpinacoides (false mermaidweed), all native to North America. Because all Limnanthes and Floerkea species have only five chromosome pairs, i.e., a chromosome number rare in Brassicales and shared with Arabidopsis thaliana (Arabidopsis), we examined the Limnanthaceae genomes as a potential model system. Using low-coverage whole-genome sequencing data, we reexamined phylogenetic relationships and characterized the repeatomes of Limnanthaceae genomes. Phylogenies based on complete chloroplast and 35S rDNA sequences corroborated the sister relationship between Floerkea and Limnanthes and two major clades in the latter genus. The genome size of Limnanthaceae species ranges from 1.5 to 2.1 Gb, apparently due to the large increase in DNA repeats, which constitute 60-70% of their genomes. Repeatomes are dominated by long terminal repeat retrotransposons, while tandem repeats represent only less than 0.5% of the genomes. The average chromosome size in Limnanthaceae species (340-420 Mb) is more than 10 times larger than in Arabidopsis (32 Mb). A three-dimensional fluorescence in situ hybridization analysis demonstrated that the five chromosome pairs in interphase nuclei of Limnanthes species adopt the Rabl-like configuration.
Collapse
|
15
|
Integration of Genomic and Cytogenetic Data on Tandem DNAs for Analyzing the Genome Diversity Within the Genus Hedysarum L. (Fabaceae). FRONTIERS IN PLANT SCIENCE 2022; 13:865958. [PMID: 35574118 PMCID: PMC9101955 DOI: 10.3389/fpls.2022.865958] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 03/28/2022] [Indexed: 06/15/2023]
Abstract
The section Multicaulia is the largest clade in the genus Hedysarum L. (Fabaceae). Representatives of the sect. Multicaulia are valuable plants used for medicinal and fodder purposes. The taxonomy and phylogeny of the sect. Multicaulia are still ambiguous. To clarify the species relationships within sect. Multicaulia, we, for the first time, explored repeatomes of H. grandiflorum Pall., H. zundukii Peschkova, and H. dahuricum Turcz. using next-generation sequencing technologies and a subsequent bioinformatic analysis by RepeatExplorer/TAREAN pipelines. The comparative repeatome analysis showed that mobile elements made up 20-24% (Class I) and about 2-2.5% (Class II) of their repetitive DNAs. The amount of ribosomal DNA varied from 1 to 2.6%, and the content of satellite DNA ranged from 2.7 to 5.1%. For each species, five high confident putative tandem DNA repeats and 5-10 low confident putative DNA repeats were identified. According to BLAST, these repeats demonstrated high sequence similarity within the studied species. FISH-based mapping of 35S rDNA, 5S rDNA, and satDNAs made it possible to detect new effective molecular chromosome markers for Hedysarum species and construct the species karyograms. Comparison of the patterns of satDNA localization on chromosomes of the studied species allowed us to assess genome diversity within the sect. Multicaulia. In all studied species, we revealed intra- and interspecific variabilities in patterns of the chromosomal distribution of molecular chromosome markers. In H. gmelinii Ledeb. and H. setigerum Turcz. ex Fisch. et Meyer, similar subgenomes were detected, which confirmed the polyploid status of their genomes. Our findings demonstrated a close genomic relationship among six studied species indicating their common origin and confirmed the taxonomic status of H. setigerum as a subspecies of H. gmelinii as well as the validity of combining the sect. Multicaulia and Subacaulia into one sect. Multicaulia.
Collapse
|
16
|
Power and Weakness of Repetition - Evaluating the Phylogenetic Signal From Repeatomes in the Family Rosaceae With Two Case Studies From Genera Prone to Polyploidy and Hybridization ( Rosa and Fragaria). FRONTIERS IN PLANT SCIENCE 2021; 12:738119. [PMID: 34950159 PMCID: PMC8688825 DOI: 10.3389/fpls.2021.738119] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 11/08/2021] [Indexed: 06/14/2023]
Abstract
Plant genomes consist, to a considerable extent, of non-coding repetitive DNA. Several studies showed that phylogenetic signals can be extracted from such repeatome data by using among-species dissimilarities from the RepeatExplorer2 pipeline as distance measures. Here, we advanced this approach by adjusting the read input for comparative clustering indirectly proportional to genome size and by summarizing all clusters into a main distance matrix subjected to Neighbor Joining algorithms and Principal Coordinate Analyses. Thus, our multivariate statistical method works as a "repeatomic fingerprint," and we proved its power and limitations by exemplarily applying it to the family Rosaceae at intrafamilial and, in the genera Fragaria and Rosa, at the intrageneric level. Since both taxa are prone to hybridization events, we wanted to show whether repeatome data are suitable to unravel the origin of natural and synthetic hybrids. In addition, we compared the results based on complete repeatomes with those from ribosomal DNA clusters only, because they represent one of the most widely used barcoding markers. Our results demonstrated that repeatome data contained a clear phylogenetic signal supporting the current subfamilial classification within Rosaceae. Accordingly, the well-accepted major evolutionary lineages within Fragaria were distinguished, and hybrids showed intermediate positions between parental species in data sets retrieved from both complete repeatomes and rDNA clusters. Within the taxonomically more complicated and particularly frequently hybridizing genus Rosa, we detected rather weak phylogenetic signals but surprisingly found a geographic pattern at a population scale. In sum, our method revealed promising results at larger taxonomic scales as well as within taxa with manageable levels of reticulation, but success remained rather taxon specific. Since repeatomes can be technically easy and comparably inexpensively retrieved even from samples of rather poor DNA quality, our phylogenomic method serves as a valuable alternative when high-quality genomes are unavailable, for example, in the case of old museum specimens.
Collapse
|
17
|
Repeatome-Based Phylogenetics in Pelargonium Section Ciconium (Sweet) Harvey. Genome Biol Evol 2021; 13:6454096. [PMID: 34893846 PMCID: PMC8684485 DOI: 10.1093/gbe/evab269] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2021] [Indexed: 12/23/2022] Open
Abstract
The repetitive part of the genome (the repeatome) contains a wealth of often overlooked information that can be used to resolve phylogenetic relationships and test evolutionary hypotheses for clades of related plant species such as Pelargonium. We have generated genome skimming data for 18 accessions of Pelargonium section Ciconium and one outgroup. We analyzed repeat abundancy and repeat similarity in order to construct repeat profiles and then used these for phylogenetic analyses. We found that phylogenetic trees based on read similarity were largely congruent with previous work based on morphological and chloroplast sequence data. For example, results agreed in identifying a “Core Ciconium” group which evolved after the split with P. elongatum. We found that this group was characterized by a unique set of repeats, which confirmed currently accepted phylogenetic hypotheses. We also found four species groups within P. sect. Ciconium that reinforce previous plastome-based reconstructions. A second repeat expansion was identified in a subclade which contained species that are considered to have dispersed from Southern Africa into Eastern Africa and the Arabian Peninsula. We speculate that the Core Ciconium repeat set correlates with a possible WGD event leading to this branch.
Collapse
|
18
|
The Tetragnatha kauaiensis Genome Sheds Light on the Origins of Genomic Novelty in Spiders. Genome Biol Evol 2021; 13:evab262. [PMID: 34849853 PMCID: PMC8693713 DOI: 10.1093/gbe/evab262] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2021] [Indexed: 01/07/2023] Open
Abstract
Spiders (Araneae) have a diverse spectrum of morphologies, behaviors, and physiologies. Attempts to understand the genomic-basis of this diversity are often hindered by their large, heterozygous, and AT-rich genomes with high repeat content resulting in highly fragmented, poor-quality assemblies. As a result, the key attributes of spider genomes, including gene family evolution, repeat content, and gene function, remain poorly understood. Here, we used Illumina and Dovetail Chicago technologies to sequence the genome of the long-jawed spider Tetragnatha kauaiensis, producing an assembly distributed along 3,925 scaffolds with an N50 of ∼2 Mb. Using comparative genomics tools, we explore genome evolution across available spider assemblies. Our findings suggest that the previously reported and vast genome size variation in spiders is linked to the different representation and number of transposable elements. Using statistical tools to uncover gene-family level evolution, we find expansions associated with the sensory perception of taste, immunity, and metabolism. In addition, we report strikingly different histories of chemosensory, venom, and silk gene families, with the first two evolving much earlier, affected by the ancestral whole genome duplication in Arachnopulmonata (∼450 Ma) and exhibiting higher numbers. Together, our findings reveal that spider genomes are highly variable and that genomic novelty may have been driven by the burst of an ancient whole genome duplication, followed by gene family and transposable element expansion.
Collapse
|
19
|
Editorial: Chromosomal Evolution in Plants. FRONTIERS IN PLANT SCIENCE 2021; 12:726330. [PMID: 34394175 PMCID: PMC8360229 DOI: 10.3389/fpls.2021.726330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 07/07/2021] [Indexed: 06/13/2023]
|
20
|
Ancient Origin of Two 5S rDNA Families Dominating in the Genus Rosa and Their Behavior in the Canina-Type Meiosis. FRONTIERS IN PLANT SCIENCE 2021; 12:643548. [PMID: 33763100 PMCID: PMC7984461 DOI: 10.3389/fpls.2021.643548] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 02/15/2021] [Indexed: 05/02/2023]
Abstract
The genus Rosa comprises more than 100 woody species characterized by intensive hybridization, introgression, and an overall complex evolutionary history. Besides many diploid species (2n = 2x = 14) polyploids ranging from 3x to 10x are frequently found. Here we analyzed 5S ribosomal DNA in 19 species covering two subgenera and the major sections within subg. Rosa. In addition to diploids and polyploids with regular meiosis, we focused on 5x dogroses (Rosa sect. Caninae), which exhibit an asymmetric meiosis differentiating between bivalent- and univalent-forming chromosomes. Using genomic resources, we reconstructed 5S rDNA units to reveal their phylogenetic relationships. Additionally, we designed locus-specific probes derived from intergenic spacers (IGSs) and determined the position and number of 5S rDNA families on chromosomes. Two major 5S rDNA families (termed 5S_A and 5S_B, respectively) were found at variable ratios in both diploid and polyploid species including members of the early diverging subgenera, Rosa persica and Rosa minutifolia. Within subg. Rosa species of sect. Rosa amplified the 5S_A variant only, while taxa of other sections contained both variants at variable ratios. The 5S_B family was often co-localized with 35S rDNA at the nucleolar organizer regions (NOR) chromosomes, whereas the co-localization of the 5S_A family with NOR was only exceptionally observed. The allo-pentaploid dogroses showed a distinct distribution of 5S rDNA families between bivalent- and univalent-forming chromosomes. In conclusion, two divergent 5S rDNA families dominate rose genomes. Both gene families apparently arose in the early history of the genus, already 30 myrs ago, and apparently survived numerous speciation events thereafter. These observations are consistent with a relatively slow genome turnover in the Rosa genus.
Collapse
|
21
|
Signature changes in the expressions of protein-coding genes, lncRNAs, and repeat elements in early and late cellular senescence. ACTA ACUST UNITED AC 2021; 44:356-370. [PMID: 33402863 PMCID: PMC7759191 DOI: 10.3906/biy-2005-21] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 08/24/2020] [Indexed: 12/13/2022]
Abstract
Replicative cellular senescence is the main cause of aging. It is important to note that early senescence is linked to tissue regeneration, whereas late senescence is known to trigger a chronically inflammatory phenotype. Despite the presence of various genome-wide studies, there is a lack of information on distinguishing early and late senescent phenotypes at the transcriptome level. Particularly, the changes in the noncoding RNA portion of the aging cell have not been fully elucidated. By utilising RNA sequencing data of fibroblasts, hereby, we are not only reporting changes in gene expression profiles and relevant biological processes in the early and late senescent phenotypes but also presenting significant differences in the expressions of many unravelled long noncoding RNAs (lncRNAs) and transcripts arisen from repetitive DNA. Our results indicate that, in addition to previously reported L1 elements, various LTR and DNA transposons, as well as members of the classical satellites including HSAT5 and α-satellites (ALR/Alpha), are expressed at higher levels in late senescence. Moreover, we revealed finer links between the expression levels of repeats with the genes located near them and known to be involved in cell cycle and senescence. Noncoding elements reported here provide a new perspective to be explored in further experimental studies.
Collapse
|
22
|
Genome evolution of blind subterranean mole rats: Adaptive peripatric versus sympatric speciation. Proc Natl Acad Sci U S A 2020; 117:32499-32508. [PMID: 33277437 DOI: 10.1073/pnas.2018123117] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Speciation mechanisms remain controversial. Two speciation models occur in Israeli subterranean mole rats, genus Spalax: a regional speciation cline southward of four peripatric climatic chromosomal species and a local, geologic-edaphic, genic, and sympatric speciation. Here we highlight their genome evolution. The five species were separated into five genetic clusters by single nucleotide polymorphisms, copy number variations (CNVs), repeatome, and methylome in sympatry. The regional interspecific divergence correspond to Pleistocene climatic cycles. Climate warmings caused chromosomal speciation. Triple effective population size, N e , declines match glacial cold cycles. Adaptive genes evolved under positive selection to underground stresses and to divergent climates, involving interspecies reproductive isolation. Genomic islands evolved mainly due to adaptive evolution involving ancient polymorphisms. Repeatome, including both CNV and LINE1 repetitive elements, separated the five species. Methylation in sympatry identified geologically chalk-basalt species that differentially affect thermoregulation, hypoxia, DNA repair, P53, and other pathways. Genome adaptive evolution highlights climatic and geologic-edaphic stress evolution and the two speciation models, peripatric and sympatric.
Collapse
|
23
|
Characterization and Dynamics of Repeatomes in Closely Related Species of Hieracium (Asteraceae) and Their Synthetic and Apomictic Hybrids. FRONTIERS IN PLANT SCIENCE 2020; 11:591053. [PMID: 33224172 PMCID: PMC7667050 DOI: 10.3389/fpls.2020.591053] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 10/09/2020] [Indexed: 05/05/2023]
Abstract
The repetitive content of the plant genome (repeatome) often represents its largest fraction and is frequently correlated with its size. Transposable elements (TEs), the main component of the repeatome, are an important driver in the genome diversification due to their fast-evolving nature. Hybridization and polyploidization events are hypothesized to induce massive bursts of TEs resulting, among other effects, in an increase of copy number and genome size. Little is known about the repeatome dynamics following hybridization and polyploidization in plants that reproduce by apomixis (asexual reproduction via seeds). To address this, we analyzed the repeatomes of two diploid parental species, Hieracium intybaceum and H. prenanthoides (sexual), their diploid F1 synthetic and their natural triploid hybrids (H. pallidiflorum and H. picroides, apomictic). Using low-coverage next-generation sequencing (NGS) and a graph-based clustering approach, we detected high overall similarity across all major repeatome categories between the parental species, despite their large phylogenetic distance. Medium and highly abundant repetitive elements comprise ∼70% of Hieracium genomes; most prevalent were Ty3/Gypsy chromovirus Tekay and Ty1/Copia Maximus-SIRE elements. No TE bursts were detected, neither in synthetic nor in natural hybrids, as TE abundance generally followed theoretical expectations based on parental genome dosage. Slight over- and under-representation of TE cluster abundances reflected individual differences in genome size. However, in comparative analyses, apomicts displayed an overabundance of pararetrovirus clusters not observed in synthetic hybrids. Substantial deviations were detected in rDNAs and satellite repeats, but these patterns were sample specific. rDNA and satellite repeats (three of them were newly developed as cytogenetic markers) were localized on chromosomes by fluorescence in situ hybridization (FISH). In a few cases, low-abundant repeats (5S rDNA and certain satellites) showed some discrepancy between NGS data and FISH results, which is due partly to the bias of low-coverage sequencing and partly to low amounts of the satellite repeats or their sequence divergence. Overall, satellite DNA (including rDNA) was markedly affected by hybridization, but independent of the ploidy or reproductive mode of the progeny, whereas bursts of TEs did not play an important role in the evolutionary history of Hieracium.
Collapse
|
24
|
Comparative Analysis of Genomic Repeat Content in Gomphocerine Grasshoppers Reveals Expansion of Satellite DNA and Helitrons in Species with Unusually Large Genomes. Genome Biol Evol 2020; 12:1180-1193. [PMID: 32539114 PMCID: PMC7486953 DOI: 10.1093/gbe/evaa119] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/09/2020] [Indexed: 12/12/2022] Open
Abstract
Eukaryotic organisms vary widely in genome size and much of this variation can be explained by differences in the abundance of repetitive elements. However, the phylogenetic distributions and turnover rates of repetitive elements are largely unknown, particularly for species with large genomes. We therefore used de novo repeat identification based on low coverage whole-genome sequencing to characterize the repeatomes of six species of gomphocerine grasshoppers, an insect clade characterized by unusually large and variable genome sizes. Genome sizes of the six species ranged from 8.4 to 14.0 pg DNA per haploid genome and thus include the second largest insect genome documented so far (with the largest being another acridid grasshopper). Estimated repeat content ranged from 79% to 96% and was strongly correlated with genome size. Averaged over species, these grasshopper repeatomes comprised significant amounts of DNA transposons (24%), LINE elements (21%), helitrons (13%), LTR retrotransposons (12%), and satellite DNA (8.5%). The contribution of satellite DNA was particularly variable (ranging from <1% to 33%) as was the contribution of helitrons (ranging from 7% to 20%). The age distribution of divergence within clusters was unimodal with peaks ∼4-6%. The phylogenetic distribution of repetitive elements was suggestive of an expansion of satellite DNA in the lineages leading to the two species with the largest genomes. Although speculative at this stage, we suggest that the expansion of satellite DNA could be secondary and might possibly have been favored by selection as a means of stabilizing greatly expanded genomes.
Collapse
|
25
|
Asymmetrical canina meiosis is accompanied by the expansion of a pericentromeric satellite in non-recombining univalent chromosomes in the genus Rosa. ANNALS OF BOTANY 2020; 125:1025-1038. [PMID: 32095807 PMCID: PMC7262465 DOI: 10.1093/aob/mcaa028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 02/24/2020] [Indexed: 05/02/2023]
Abstract
BACKGROUND AND AIMS Despite their abundant odd-ploidy (2n = 5x = 35), dogroses (Rosa sect. Caninae) are capable of sexual reproduction due to their unique meiosis. During canina meiosis, two sets of chromosomes form bivalents and are transmitted by male and female gametes, whereas the remaining chromosomes form univalents and are exclusively transmitted by the egg cells. Thus, the evolution of chromosomes is expected to be driven by their behaviour during meiosis. METHODS To gain insight into differential chromosome evolution, fluorescence in situ hybridization was conducted for mitotic and meiotic chromosomes in four dogroses (two subsections) using satellite and ribosomal DNA probes. By exploiting high-throughput sequencing data, we determined the abundance and diversity of the satellite repeats in the genus Rosa by analysing 20 pentaploid, tetraploid and diploid species in total. KEY RESULTS A pericentromeric satellite repeat, CANR4, was found in all members of the genus Rosa, including the basal subgenera Hulthemia and Hesperhodos. The satellite was distributed across multiple chromosomes (5-20 sites per mitotic cell), and its genomic abundance was higher in pentaploid dogroses (2.3 %) than in non-dogrose species (1.3 %). In dogrose meiosis, univalent chromosomes were markedly enriched in CANR4 repeats based on both the number and the intensity of the signals compared to bivalent-forming chromosomes. Single-nucleotide polymorphisms and cluster analysis revealed high intragenomic homogeneity of the satellite in dogrose genomes. CONCLUSIONS The CANR4 satellite arose early in the evolution of the genus Rosa. Its high content and extraordinary homogeneity in dogrose genomes is explained by its recent amplification in non-recombining chromosomes. We hypothesize that satellite DNA expansion may contribute to the divergence of univalent chromosomes in Rosa species with non-symmetrical meiosis.
Collapse
|
26
|
The Utility of Graph Clustering of 5S Ribosomal DNA Homoeologs in Plant Allopolyploids, Homoploid Hybrids, and Cryptic Introgressants. FRONTIERS IN PLANT SCIENCE 2020; 11:41. [PMID: 32117380 PMCID: PMC7025596 DOI: 10.3389/fpls.2020.00041] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 01/13/2020] [Indexed: 05/18/2023]
Abstract
INTRODUCTION Ribosomal DNA (rDNA) loci have been widely used for identification of allopolyploids and hybrids, although few of these studies employed high-throughput sequencing data. Here we use graph clustering implemented in the RepeatExplorer (RE) pipeline to analyze homoeologous 5S rDNA arrays at the genomic level searching for hybridogenic origin of species. Data were obtained from more than 80 plant species, including several well-defined allopolyploids and homoploid hybrids of different evolutionary ages and from widely dispersed taxonomic groups. RESULTS (i) Diploids show simple circular-shaped graphs of their 5S rDNA clusters. In contrast, most allopolyploids and other interspecific hybrids exhibit more complex graphs composed of two or more interconnected loops representing intergenic spacers (IGS). (ii) There was a relationship between graph complexity and locus numbers. (iii) The sequences and lengths of the 5S rDNA units reconstituted in silico from k-mers were congruent with those experimentally determined. (iv) Three-genomic comparative cluster analysis of reads from allopolyploids and progenitor diploids allowed identification of homoeologous 5S rRNA gene families even in relatively ancient (c. 1 Myr) Gossypium and Brachypodium allopolyploids which already exhibit uniparental partial loss of rDNA repeats. (v) Finally, species harboring introgressed genomes exhibit exceptionally complex graph structures. CONCLUSION We found that the cluster graph shapes and graph parameters (k-mer coverage scores and connected component index) well-reflect the organization and intragenomic homogeneity of 5S rDNA repeats. We propose that the analysis of 5S rDNA cluster graphs computed by the RE pipeline together with the cytogenetic analysis might be a reliable approach for the determination of the hybrid or allopolyploid plant species parentage and may also be useful for detecting historical introgression events.
Collapse
|
27
|
Homology-Free Detection of Transposable Elements Unveils Their Dynamics in Three Ecologically Distinct Rhodnius Species. Genes (Basel) 2020; 11:genes11020170. [PMID: 32041215 PMCID: PMC7073582 DOI: 10.3390/genes11020170] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 01/30/2020] [Indexed: 01/09/2023] Open
Abstract
Transposable elements (TEs) are widely distributed repetitive sequences in the genomes across the tree of life, and represent an important source of genetic variability. Their distribution among genomes is specific to each lineage. A phenomenon associated with this feature is the sudden expansion of one or several TE families, called bursts of transposition. We previously proposed that bursts of the Mariner family (DNA transposons) contributed to the speciation of Rhodnius prolixus Stål, 1859. This hypothesis motivated us to study two additional species of the R. prolixus complex: Rhodnius montenegrensis da Rosa et al., 2012 and Rhodnius marabaensis Souza et al., 2016, together with a new, de novo annotation of the R. prolixus repeatome using unassembled short reads. Our analysis reveals that the total amount of TEs present in Rhodnius genomes (19% to 23.5%) is three to four times higher than that expected based on the original quantifications performed for the original genome description of R. prolixus. We confirm here that the repeatome of the three species is dominated by Class II elements of the superfamily Tc1-Mariner, as well as members of the LINE order (Class I). In addition to R. prolixus, we also identified a recent burst of transposition of the Mariner family in R. montenegrensis and R. marabaensis, suggesting that this phenomenon may not be exclusive to R. prolixus. Rather, we hypothesize that whilst the expansion of Mariner elements may have contributed to the diversification of the R. prolixus-R. robustus species complex, the distinct ecological characteristics of these new species did not drive the general evolutionary trajectories of these TEs.
Collapse
|
28
|
Evolutionary history and genetic diversity of apomictic allopolyploids in Hieracium s.str.: morphological versus genomic features. AMERICAN JOURNAL OF BOTANY 2020; 107:66-90. [PMID: 31903548 DOI: 10.1002/ajb2.1413] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2019] [Accepted: 11/13/2019] [Indexed: 05/02/2023]
Abstract
PREMISE The origin of allopolyploids is believed to shape their evolutionary potential, ecology, and geographical ranges. Morphologically distinct apomictic types sharing the same parental species belong to the most challenging groups of polyploids. We evaluated the origins and variation of two triploid taxa (Hieracium pallidiflorum, H. picroides) presumably derived from the same diploid parental pair (H. intybaceum, H. prenanthoides). METHODS We used a suite of approaches ranging from morphological, phylogenetic (three unlinked molecular markers), and cytogenetic analyses (in situ hybridization) to genome size screening and genome skimming. RESULTS Genotyping proved the expected parentage of all analyzed accessions of H. pallidiflorum and H. picroides and revealed that nearly all of them originated independently. Genome sizes and genome dosage largely corresponded to morphology, whereas the maternal origin of the allopolyploids had no discernable effect. Polyploid accessions of both parental species usually contained genetic material from other species. Given the phylogenetic distance of the parents, their chromosomes appeared only weakly differentiated in genomic in situ hybridization (GISH), as well as in overall comparisons of the repetitive fraction of their genomes. Furthermore, the repeatome of a phylogenetically more closely related species (H. umbellatum) differed significantly more. CONCLUSIONS We proved (1) multiple origins of hybridogeneous apomicts from the same diploid parental taxa, and (2) allopolyploid origins of polyploid accessions of the parental species. We also showed that the evolutionary dynamics of very fast evolving markers such as satellite DNA or transposable elements does not necessarily follow patterns of speciation.
Collapse
|
29
|
Evolution of Tandem Repeats Is Mirroring Post-polyploid Cladogenesis in Heliophila (Brassicaceae). FRONTIERS IN PLANT SCIENCE 2020; 11:607893. [PMID: 33510751 PMCID: PMC7835680 DOI: 10.3389/fpls.2020.607893] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 11/16/2020] [Indexed: 05/02/2023]
Abstract
The unigeneric tribe Heliophileae encompassing more than 100 Heliophila species is morphologically the most diverse Brassicaceae lineage. The tribe is endemic to southern Africa, confined chiefly to the southwestern South Africa, home of two biodiversity hotspots (Cape Floristic Region and Succulent Karoo). The monospecific Chamira (C. circaeoides), the only crucifer species with persistent cotyledons, is traditionally retrieved as the closest relative of Heliophileae. Our transcriptome analysis revealed a whole-genome duplication (WGD) ∼26.15-29.20 million years ago, presumably preceding the Chamira/Heliophila split. The WGD was then followed by genome-wide diploidization, species radiations, and cladogenesis in Heliophila. The expanded phylogeny based on nuclear ribosomal DNA internal transcribed spacer (ITS) uncovered four major infrageneric clades (A-D) in Heliophila and corroborated the sister relationship between Chamira and Heliophila. Herein, we analyzed how the diploidization process impacted the evolution of repetitive sequences through low-coverage whole-genome sequencing of 15 Heliophila species, representing the four clades, and Chamira. Despite the firmly established infrageneric cladogenesis and different ecological life histories (four perennials vs. 11 annual species), repeatome analysis showed overall comparable evolution of genome sizes (288-484 Mb) and repeat content (25.04-38.90%) across Heliophila species and clades. Among Heliophila species, long terminal repeat (LTR) retrotransposons were the predominant components of the analyzed genomes (11.51-22.42%), whereas tandem repeats had lower abundances (1.03-12.10%). In Chamira, the tandem repeat content (17.92%, 16 diverse tandem repeats) equals the abundance of LTR retrotransposons (16.69%). Among the 108 tandem repeats identified in Heliophila, only 16 repeats were found to be shared among two or more species; no tandem repeats were shared by Chamira and Heliophila genomes. Six "relic" tandem repeats were shared between any two different Heliophila clades by a common descent. Four and six clade-specific repeats shared among clade A and C species, respectively, support the monophyly of these two clades. Three repeats shared by all clade A species corroborate the recent diversification of this clade revealed by plastome-based molecular dating. Phylogenetic analysis based on repeat sequence similarities separated the Heliophila species to three clades [A, C, and (B+D)], mirroring the post-polyploid cladogenesis in Heliophila inferred from rDNA ITS and plastome sequences.
Collapse
|
30
|
Insights Into an Unexplored Component of the Mosquito Repeatome: Distribution and Variability of Viral Sequences Integrated Into the Genome of the Arboviral Vector Aedes albopictus. Front Genet 2019; 10:93. [PMID: 30809249 PMCID: PMC6379468 DOI: 10.3389/fgene.2019.00093] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Accepted: 01/29/2019] [Indexed: 01/01/2023] Open
Abstract
The Asian tiger mosquito Aedes albopictus is an invasive mosquito and a competent vector for public-health relevant arboviruses such as Chikungunya (Alphavirus), Dengue and Zika (Flavivirus) viruses. Unexpectedly, the sequencing of the genome of this mosquito revealed an unusually high number of integrated sequences with similarities to non-retroviral RNA viruses of the Flavivirus and Rhabdovirus genera. These Non-retroviral Integrated RNA Virus Sequences (NIRVS) are enriched in piRNA clusters and coding sequences and have been proposed to constitute novel mosquito immune factors. However, given the abundance of NIRVS and their variable viral origin, their relative biological roles remain unexplored. Here we used an analytical approach that intersects computational, evolutionary and molecular methods to study the genomic landscape of mosquito NIRVS. We demonstrate that NIRVS are differentially distributed across mosquito genomes, with a core set of seemingly the oldest integrations with similarity to Rhabdoviruses. Additionally, we compare the polymorphisms of NIRVS with respect to that of fast and slow-evolving genes within the Ae. albopictus genome. Overall, NIRVS appear to be less polymorphic than slow-evolving genes, with differences depending on whether they occur in intergenic regions or in piRNA clusters. Finally, two NIRVS that map within the coding sequences of genes annotated as Rhabdovirus RNA-dependent RNA polymerase and the nucleocapsid-encoding gene, respectively, are highly polymorphic and are expressed, suggesting exaptation possibly to enhance the mosquito's antiviral responses. These results greatly advance our understanding of the complexity of the mosquito repeatome and the biology of viral integrations in mosquito genomes.
Collapse
|