1
|
Despot-Slade E, Širca S, Mravinac B, Castagnone-Sereno P, Plohl M, Meštrović N. Satellitome analyses in nematodes illuminate complex species history and show conserved features in satellite DNAs. BMC Biol 2022; 20:259. [PMCID: PMC9673304 DOI: 10.1186/s12915-022-01460-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 11/07/2022] [Indexed: 11/19/2022] Open
Abstract
Abstract
Background
Satellite DNAs (satDNAs) are tandemly repeated non-coding DNA sequences that belong to the most abundant and the fastest evolving parts of the eukaryotic genome. A satellitome represents the collection of different satDNAs in a genome. Due to extreme diversity and methodological difficulties to characterize and compare satDNA collection in complex genomes, knowledge on their putative functional constraints and capacity to participate in genome evolution remains rather elusive. SatDNA transcripts have been detected in many species, however comparative studies of satDNA transcriptome between species are extremely rare.
Results
We conducted a genome-wide survey and comparative analyses of satellitomes among different closely related Meloidogyne spp. nematodes. The evolutionary trends of satDNAs suggest that each round of proposed polyploidization in the evolutionary history is concomitant with the addition of a new set of satDNAs in the satellitome of any particular Meloidogyne species. Successive incorporation of new sets of satDNAs in the genome along the process of polyploidization supports multiple hybridization events as the main factor responsible for the formation of these species. Through comparative analyses of 83 distinct satDNAs, we found a CENP-B box-like sequence motif conserved among 11 divergent satDNAs (similarity ranges from 36 to 74%). We also found satDNAs that harbor a splice leader (SL) sequence which, in spite of overall divergence, shows conservation across species in two putative functional regions, the 25-nt SL exon and the Sm binding site. Intra- and interspecific comparative expression analyses of the complete satDNA set in the analyzed Meloidogyne species revealed transcription profiles including a subset of 14 actively transcribed satDNAs. Among those, 9 show active transcription in every species where they are found in the genome and throughout developmental stages.
Conclusions
Our results demonstrate the feasibility and power of comparative analysis of the non-coding repetitive genome for elucidation of the origin of species with a complex history. Although satDNAs generally evolve extremely quickly, the comparative analyses of 83 satDNAs detected in the analyzed Meloidogyne species revealed conserved sequence features in some satDNAs suggesting sequence evolution under selective pressure. SatDNAs that are actively transcribed in related genomes and throughout nematode development support the view that their expression is not stochastic.
Collapse
|
2
|
Herklotz V, Kovařík A, Wissemann V, Lunerová J, Vozárová R, Buschmann S, Olbricht K, Groth M, Ritz CM. Power and Weakness of Repetition - Evaluating the Phylogenetic Signal From Repeatomes in the Family Rosaceae With Two Case Studies From Genera Prone to Polyploidy and Hybridization ( Rosa and Fragaria). FRONTIERS IN PLANT SCIENCE 2021; 12:738119. [PMID: 34950159 PMCID: PMC8688825 DOI: 10.3389/fpls.2021.738119] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 11/08/2021] [Indexed: 06/14/2023]
Abstract
Plant genomes consist, to a considerable extent, of non-coding repetitive DNA. Several studies showed that phylogenetic signals can be extracted from such repeatome data by using among-species dissimilarities from the RepeatExplorer2 pipeline as distance measures. Here, we advanced this approach by adjusting the read input for comparative clustering indirectly proportional to genome size and by summarizing all clusters into a main distance matrix subjected to Neighbor Joining algorithms and Principal Coordinate Analyses. Thus, our multivariate statistical method works as a "repeatomic fingerprint," and we proved its power and limitations by exemplarily applying it to the family Rosaceae at intrafamilial and, in the genera Fragaria and Rosa, at the intrageneric level. Since both taxa are prone to hybridization events, we wanted to show whether repeatome data are suitable to unravel the origin of natural and synthetic hybrids. In addition, we compared the results based on complete repeatomes with those from ribosomal DNA clusters only, because they represent one of the most widely used barcoding markers. Our results demonstrated that repeatome data contained a clear phylogenetic signal supporting the current subfamilial classification within Rosaceae. Accordingly, the well-accepted major evolutionary lineages within Fragaria were distinguished, and hybrids showed intermediate positions between parental species in data sets retrieved from both complete repeatomes and rDNA clusters. Within the taxonomically more complicated and particularly frequently hybridizing genus Rosa, we detected rather weak phylogenetic signals but surprisingly found a geographic pattern at a population scale. In sum, our method revealed promising results at larger taxonomic scales as well as within taxa with manageable levels of reticulation, but success remained rather taxon specific. Since repeatomes can be technically easy and comparably inexpensively retrieved even from samples of rather poor DNA quality, our phylogenomic method serves as a valuable alternative when high-quality genomes are unavailable, for example, in the case of old museum specimens.
Collapse
Affiliation(s)
- Veit Herklotz
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
| | - Aleš Kovařík
- Department of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
| | - Volker Wissemann
- Institute of Botany, Systematic Botany Group, Justus-Liebig-University, Gießen, Germany
| | - Jana Lunerová
- Department of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
| | - Radka Vozárová
- Department of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Sebastian Buschmann
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | | | - Marco Groth
- CF DNA Sequencing, Leibniz Institute on Aging – Fritz Lipmann Institute, Jena, Germany
| | - Christiane M. Ritz
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
- Chair of Biodiversity of Higher Plants, Technische Universität Dresden, International Institute (IHI) Zittau, Zittau, Germany
| |
Collapse
|
3
|
Costa L, Marques A, Buddenhagen C, Thomas WW, Huettel B, Schubert V, Dodsworth S, Houben A, Souza G, Pedrosa-Harand A. Aiming off the target: recycling target capture sequencing reads for investigating repetitive DNA. ANNALS OF BOTANY 2021; 128:835-848. [PMID: 34050647 PMCID: PMC8577205 DOI: 10.1093/aob/mcab063] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 05/26/2021] [Indexed: 05/28/2023]
Abstract
BACKGROUND AND AIMS With the advance of high-throughput sequencing, reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using these data. METHODS Repeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS datasets and the topologies were compared with a gene-alignment-based phylogenetic tree. KEY RESULTS All the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01 % in the GS data. Rank correlations between GS and TCS repeat abundances were moderately high (r = 0.58-0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66-0.92). Repeat data obtained by TCS were also reliable in developing a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree. CONCLUSIONS Our results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.
Collapse
Affiliation(s)
- Lucas Costa
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Recife-PE, Brazil
| | - André Marques
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | | | | | - Bruno Huettel
- Max Planck Genome Centre Cologne, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Veit Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | | | - Andreas Houben
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Gustavo Souza
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Recife-PE, Brazil
| | - Andrea Pedrosa-Harand
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Recife-PE, Brazil
| |
Collapse
|
4
|
Schwartz JH. Evolution, systematics, and the unnatural history of mitochondrial DNA. Mitochondrial DNA A DNA Mapp Seq Anal 2021; 32:126-151. [PMID: 33818247 DOI: 10.1080/24701394.2021.1899165] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The tenets underlying the use of mtDNA in phylogenetic and systematic analyses are strict maternal inheritance, clonality, homoplasmy, and difference due to mutation: that is, there are species-specific mtDNA sequences and phylogenetic reconstruction is a matter of comparing these sequences and inferring closeness of relatedness from the degree of sequence similarity. Yet, how mtDNA behavior became so defined is mysterious. Even though early studies of fertilization demonstrated for most animals that not only the head, but the sperm's tail and mitochondria-bearing midpiece penetrate the egg, the opposite - only the head enters the egg - became fact, and mtDNA conceived as maternally transmitted. When midpiece/tail penetration was realized as true, the conceptions 'strict maternal inheritance', etc., and their application to evolutionary endeavors, did not change. Yet there is mounting evidence of paternal mtDNA transmission, paternal and maternal combination, intracellular recombination, and intra- and intercellular heteroplasmy. Clearly, these phenomena impact the systematic and phylogenetic analysis of mtDNA sequences.
Collapse
Affiliation(s)
- Jeffrey H Schwartz
- Department of Anthropology, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
5
|
Negm S, Greenberg A, Larracuente A, Sproul J. RepeatProfiler: A pipeline for visualization and comparative analysis of repetitive DNA profiles. Mol Ecol Resour 2021; 21:969-981. [PMID: 33277787 PMCID: PMC7954937 DOI: 10.1111/1755-0998.13305] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 11/11/2020] [Accepted: 11/30/2020] [Indexed: 12/20/2022]
Abstract
Study of repetitive DNA elements in model organisms highlights the role of repetitive elements (REs) in many processes that drive genome evolution and phenotypic change. Because REs are much more dynamic than single-copy DNA, repetitive sequences can reveal signals of evolutionary history over short time scales that may not be evident in sequences from slower-evolving genomic regions. Many tools for studying REs are directed toward organisms with existing genomic resources, including genome assemblies and repeat libraries. However, signals in repeat variation may prove especially valuable in disentangling evolutionary histories in diverse non-model groups, for which genomic resources are limited. Here, we introduce RepeatProfiler, a tool for generating, visualizing, and comparing repetitive element DNA profiles from low-coverage, short-read sequence data. RepeatProfiler automates the generation and visualization of RE coverage depth profiles (RE profiles) and allows for statistical comparison of profile shape across samples. In addition, RepeatProfiler facilitates comparison of profiles by extracting signal from sequence variants across profiles which can then be analysed as molecular morphological characters using phylogenetic analysis. We validate RepeatProfiler with data sets from ground beetles (Bembidion), flies (Drosophila), and tomatoes (Solanum). We highlight the potential of RE profiles as a high-resolution data source for studies in species delimitation, comparative genomics, and repeat biology.
Collapse
Affiliation(s)
- S. Negm
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| | - A. Greenberg
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| | - A.M. Larracuente
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| | - J.S. Sproul
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| |
Collapse
|
6
|
Reconstructing phylogenetic relationships based on repeat sequence similarities. Mol Phylogenet Evol 2020; 147:106766. [DOI: 10.1016/j.ympev.2020.106766] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 12/09/2019] [Accepted: 02/12/2020] [Indexed: 12/25/2022]
|
7
|
Palacios-Gimenez OM, Milani D, Song H, Marti DA, López-León MD, Ruiz-Ruano FJ, Camacho JPM, Cabral-de-Mello DC. Eight Million Years of Satellite DNA Evolution in Grasshoppers of the Genus Schistocerca Illuminate the Ins and Outs of the Library Hypothesis. Genome Biol Evol 2020; 12:88-102. [PMID: 32211863 PMCID: PMC7093836 DOI: 10.1093/gbe/evaa018] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/30/2020] [Indexed: 12/21/2022] Open
Abstract
Satellite DNA (satDNA) is an abundant class of tandemly repeated noncoding sequences, showing high rate of change in sequence, abundance, and physical location. However, the mechanisms promoting these changes are still controversial. The library model was put forward to explain the conservation of some satDNAs for long periods, predicting that related species share a common collection of satDNAs, which mostly experience quantitative changes. Here, we tested the library model by analyzing three satDNAs in ten species of Schistocerca grasshoppers. This group represents a valuable material because it diversified during the last 7.9 Myr across the American continent from the African desert locust (Schistocerca gregaria), and this thus illuminates the direction of evolutionary changes. By combining bioinformatic and cytogenetic, we tested whether these three satDNA families found in S. gregaria are also present in nine American species, and whether differential gains and/or losses have occurred in the lineages. We found that the three satDNAs are present in all species but display remarkable interspecies differences in their abundance and sequences while being highly consistent with genus phylogeny. The number of chromosomal loci where satDNA is present was also consistent with phylogeny for two satDNA families but not for the other. Our results suggest eminently chance events for satDNA evolution. Several evolutionary trends clearly imply either massive amplifications or contractions, thus closely fitting the library model prediction that changes are mostly quantitative. Finally, we found that satDNA amplifications or contractions may influence the evolution of monomer consensus sequences and by chance playing a major role in driftlike dynamics.
Collapse
Affiliation(s)
- Octavio M Palacios-Gimenez
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Sweden
- Department of Organismal Biology, Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Diogo Milani
- Departamento de Biologia Geral e Aplicada, Instituto de Biociências/IB, UNESP - Univ Estadual Paulista, Rio Claro, São Paulo, Brazil
| | - Hojun Song
- Department of Entomology, Texas A&M University
| | - Dardo A Marti
- Laboratorio de Genética Evolutiva, IBS, Facultad de Ciencias Exactas, Químicas y Naturales, Universidad Nacional de Misiones, CONICET, Posadas, Argentina
| | - Maria D López-León
- Departamento de Genética, Facultad de Ciencias, UGR - Univ de Granada, Spain
| | - Francisco J Ruiz-Ruano
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Sweden
- Department of Organismal Biology, Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | | | - Diogo C Cabral-de-Mello
- Departamento de Biologia Geral e Aplicada, Instituto de Biociências/IB, UNESP - Univ Estadual Paulista, Rio Claro, São Paulo, Brazil
| |
Collapse
|