1
|
Castro N, Vilela B, Mata-Sucre Y, Marques A, Gagnon E, Lewis GP, Costa L, Souza G. Repeatome evolution across space and time: Unravelling repeats dynamics in the plant genus Erythrostemon Klotzsch (Leguminosae Juss). Mol Ecol 2024:e17510. [PMID: 39248108 DOI: 10.1111/mec.17510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 07/17/2024] [Accepted: 07/22/2024] [Indexed: 09/10/2024]
Abstract
Fluctuations in genomic repetitive fractions (repeatome) are known to impact several facets of evolution, such as ecological adaptation and speciation processes. Therefore, investigating the divergence of repetitive elements can provide insights into an important evolutionary force. However, it is not clear how the different repetitive element clades are impacted by the different factors such as ecological changes and/or phylogeny. To discuss this, we used the Neotropical legume genus Erythrostemon (Caesalpinioideae) as a model, given its ancient origin (~33 Mya), lineage-specific niche conservatism, macroecological heterogeneity, and disjunct distribution in Meso- and South American (MA and SA respectively) lineages. We performed a comparative repeatomic analysis of 18 Erythrostemon species to test the impact of environmental variables over repeats diversification. Overall, repeatome composition was diverse, with high abundances of satDNAs and Ty3/gypsy-Tekay transposable elements, predominantly in the MA and SA lineages respectively. However, unexpected repeatome profiles unrelated to the phylogeny/biogeography were found in a few MA (E. coccineus, E. pannosus and E. placidus) and SA (E. calycinus) species, related to reticulate evolution and incongruence between nuclear and plastid topology, suggesting ancient hybridizations. The plesiomorphic Tekay and satDNA pattern was altered in the MA-sensu stricto subclade with a striking genomic differentiation (expansion of satDNA and retraction of Tekay) associated with the colonization of a new environment in Central America around 20 Mya. Our data reveal that the current species-specific Tekay pool was the result of two bursts of amplification probably in the Miocene, with distinct patterns for the MA and SA repeatomes. This suggests a strong role of the Tekay elements as modulators of the genome-environment interaction in Erythrostemon, providing macroevolutionary insights about mechanisms of repeatome differentiation and plant diversification across space and time.
Collapse
Affiliation(s)
- Natália Castro
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Center, Federal University of Pernambuco, Recife, Brazil
| | - Bruno Vilela
- Institute of Biology, Federal University of Bahia, Salvador, Bahia, Brazil
| | - Yennifer Mata-Sucre
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Center, Federal University of Pernambuco, Recife, Brazil
| | - André Marques
- Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Edeline Gagnon
- Department of Integrative Biology, University of Guelph, Guelph, Ontario, Canada
| | - Gwilym P Lewis
- Accelerated Taxonomy Department, Royal Botanic Gardens, Kew, Richmond, UK
| | - Lucas Costa
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Center, Federal University of Pernambuco, Recife, Brazil
| | - Gustavo Souza
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Center, Federal University of Pernambuco, Recife, Brazil
| |
Collapse
|
2
|
Decena MÁ, Sancho R, Inda LA, Pérez-Collazos E, Catalán P. Expansions and contractions of repetitive DNA elements reveal contrasting evolutionary responses to the polyploid genome shock hypothesis in Brachypodium model grasses. FRONTIERS IN PLANT SCIENCE 2024; 15:1419255. [PMID: 39049853 PMCID: PMC11266827 DOI: 10.3389/fpls.2024.1419255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 06/19/2024] [Indexed: 07/27/2024]
Abstract
Brachypodium grass species have been selected as model plants for functional genomics of grass crops, and to elucidate the origins of allopolyploidy and perenniality in monocots, due to their small genome sizes and feasibility of cultivation. However, genome sizes differ greatly between diploid or polyploid Brachypodium lineages. We have used genome skimming sequencing data to uncover the composition, abundance, and phylogenetic value of repetitive elements in 44 representatives of the major Brachypodium lineages and cytotypes. We also aimed to test the possible mechanisms and consequences of the "polyploid genome shock hypothesis" (PGSH) under three different evolutionary scenarios of variation in repeats and genome sizes of Brachypodium allopolyploids. Our data indicated that the proportion of the genome covered by the repeatome in the Brachypodium species showed a 3.3-fold difference between the highest content of B. mexicanum-4x (67.97%) and the lowest of B. stacei-2x (20.77%), and that changes in the sizes of their genomes were a consequence of gains or losses in their repeat elements. LTR-Retand and Tekay retrotransposons were the most frequent repeat elements in the Brachypodium genomes, while Ogre retrotransposons were found exclusively in B. mexicanum. The repeatome phylogenetic network showed a high topological congruence with plastome and nuclear rDNA and transcriptome trees, differentiating the ancestral outcore lineages from the recently evolved core-perennial lineages. The 5S rDNA graph topologies had a strong match with the ploidy levels and nature of the subgenomes of the Brachypodium polyploids. The core-perennial B. sylvaticum presents a large repeatome and characteristics of a potential post-polyploid diploidized origin. Our study evidenced that expansions and contractions in the repeatome were responsible for the three contrasting responses to the PGSH. The exacerbated genome expansion of the ancestral allotetraploid B. mexicanum was a consequence of chromosome-wide proliferation of TEs and not of WGD, the additive repeatome pattern of young allotetraploid B. hybridum of stabilized post-WGD genome evolution, and the genomecontraction of recent core-perennials polyploids (B. pinnatum, B. phoenicoides) of repeat losses through recombination of these highly hybridizing lineages. Our analyses have contributed to unraveling the evolution of the repeatome and the genome size variation in model Brachypodium grasses.
Collapse
Affiliation(s)
- María Ángeles Decena
- Escuela Politécnica Superior de Huesca, Universidad de Zaragoza, Huesca, Spain
- Grupo de Bioquímica, Biofísica y Biología Computacional (Instituto de Biocomputación y Física de Sistemas Complejos (BIFI) Universidad de Zaragoza), Unidad Asociada al Consejo Superior de Investigaciones Científicas (CSIC), Zaragoza, Spain
| | - Rubén Sancho
- Escuela Politécnica Superior de Huesca, Universidad de Zaragoza, Huesca, Spain
- Grupo de Bioquímica, Biofísica y Biología Computacional (Instituto de Biocomputación y Física de Sistemas Complejos (BIFI) Universidad de Zaragoza), Unidad Asociada al Consejo Superior de Investigaciones Científicas (CSIC), Zaragoza, Spain
| | - Luis A. Inda
- Escuela Politécnica Superior de Huesca, Universidad de Zaragoza, Huesca, Spain
- Centro de Investigaciones Tecnológicas y Agroalimentarias de Aragón (CITA), Zaragoza, Spain
| | - Ernesto Pérez-Collazos
- Escuela Politécnica Superior de Huesca, Universidad de Zaragoza, Huesca, Spain
- Grupo de Bioquímica, Biofísica y Biología Computacional (Instituto de Biocomputación y Física de Sistemas Complejos (BIFI) Universidad de Zaragoza), Unidad Asociada al Consejo Superior de Investigaciones Científicas (CSIC), Zaragoza, Spain
| | - Pilar Catalán
- Escuela Politécnica Superior de Huesca, Universidad de Zaragoza, Huesca, Spain
- Grupo de Bioquímica, Biofísica y Biología Computacional (Instituto de Biocomputación y Física de Sistemas Complejos (BIFI) Universidad de Zaragoza), Unidad Asociada al Consejo Superior de Investigaciones Científicas (CSIC), Zaragoza, Spain
| |
Collapse
|
3
|
Ochiai KK, Hanawa D, Ogawa HA, Tanaka H, Uesaka K, Edzuka T, Shirae-Kurabayashi M, Toyoda A, Itoh T, Goshima G. Genome sequence and cell biological toolbox of the highly regenerative, coenocytic green feather alga Bryopsis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 119:1091-1111. [PMID: 38642374 DOI: 10.1111/tpj.16764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 02/10/2024] [Accepted: 03/27/2024] [Indexed: 04/22/2024]
Abstract
Green feather algae (Bryopsidales) undergo a unique life cycle in which a single cell repeatedly executes nuclear division without cytokinesis, resulting in the development of a thallus (>100 mm) with characteristic morphology called coenocyte. Bryopsis is a representative coenocytic alga that has exceptionally high regeneration ability: extruded cytoplasm aggregates rapidly in seawater, leading to the formation of protoplasts. However, the genetic basis of the unique cell biology of Bryopsis remains poorly understood. Here, we present a high-quality assembly and annotation of the nuclear genome of Bryopsis sp. (90.7 Mbp, 27 contigs, N50 = 6.7 Mbp, 14 034 protein-coding genes). Comparative genomic analyses indicate that the genes encoding BPL-1/Bryohealin, the aggregation-promoting lectin, are heavily duplicated in Bryopsis, whereas homologous genes are absent in other ulvophyceans, suggesting the basis of regeneration capability of Bryopsis. Bryopsis sp. possesses >30 kinesins but only a single myosin, which differs from other green algae that have multiple types of myosin genes. Consistent with this biased motor toolkit, we observed that the bidirectional motility of chloroplasts in the cytoplasm was dependent on microtubules but not actin in Bryopsis sp. Most genes required for cytokinesis in plants are present in Bryopsis, including those in the SNARE or kinesin superfamily. Nevertheless, a kinesin crucial for cytokinesis initiation in plants (NACK/Kinesin-7II) is hardly expressed in the coenocytic part of the thallus, possibly underlying the lack of cytokinesis in this portion. The present genome sequence lays the foundation for experimental biology in coenocytic macroalgae.
Collapse
Affiliation(s)
- Kanta K Ochiai
- Sugashima Marine Biological Laboratory, Graduate School of Science, Nagoya University, Toba, 517-0004, Japan
| | - Daiki Hanawa
- School of Life Science and Technology, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8550, Japan
| | - Harumi A Ogawa
- Sugashima Marine Biological Laboratory, Graduate School of Science, Nagoya University, Toba, 517-0004, Japan
| | - Hiroyuki Tanaka
- School of Life Science and Technology, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8550, Japan
| | - Kazuma Uesaka
- Centre for Gene Research, Nagoya University, Nagoya, 464-8602, Japan
| | - Tomoya Edzuka
- Sugashima Marine Biological Laboratory, Graduate School of Science, Nagoya University, Toba, 517-0004, Japan
| | - Maki Shirae-Kurabayashi
- Sugashima Marine Biological Laboratory, Graduate School of Science, Nagoya University, Toba, 517-0004, Japan
| | - Atsushi Toyoda
- Comparative Genomics Laboratory, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan
- Advanced Genomics Center, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan
| | - Takehiko Itoh
- School of Life Science and Technology, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8550, Japan
| | - Gohta Goshima
- Sugashima Marine Biological Laboratory, Graduate School of Science, Nagoya University, Toba, 517-0004, Japan
- Department of Biological Science, Graduate School of Science, Nagoya University, Nagoya, 464-8602, Japan
| |
Collapse
|
4
|
Schmidt N, Sielemann K, Breitenbach S, Fuchs J, Pucker B, Weisshaar B, Holtgräwe D, Heitkam T. Repeat turnover meets stable chromosomes: repetitive DNA sequences mark speciation and gene pool boundaries in sugar beet and wild beets. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:171-190. [PMID: 38128038 DOI: 10.1111/tpj.16599] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 12/05/2023] [Accepted: 12/08/2023] [Indexed: 12/23/2023]
Abstract
Sugar beet and its wild relatives share a base chromosome number of nine and similar chromosome morphologies. Yet, interspecific breeding is impeded by chromosome and sequence divergence that is still not fully understood. Since repetitive DNAs are among the fastest evolving parts of the genome, we investigated, if repeatome innovations and losses are linked to chromosomal differentiation and speciation. We traced genome and chromosome-wide evolution across 13 beet species comprising all sections of the genera Beta and Patellifolia. For this, we combined short and long read sequencing, flow cytometry, and cytogenetics to build a comprehensive framework that spans the complete scale from DNA to chromosome to genome. Genome sizes and repeat profiles reflect the separation into three gene pools with contrasting evolutionary patterns. Among all repeats, satellite DNAs harbor most genomic variability, leading to fundamentally different centromere architectures, ranging from chromosomal uniformity in Beta and Patellifolia to the formation of patchwork chromosomes in Corollinae/Nanae. We show that repetitive DNAs are causal for the genome expansions and contractions across the beet genera, providing insights into the genomic underpinnings of beet speciation. Satellite DNAs in particular vary considerably between beet genomes, leading to the evolution of distinct chromosomal setups in the three gene pools, likely contributing to the barriers in beet breeding. Thus, with their isokaryotypic chromosome sets, beet genomes present an ideal system for studying the link between repeats, genomic variability, and chromosomal differentiation and provide a theoretical fundament for understanding barriers in any crop breeding effort.
Collapse
Affiliation(s)
- Nicola Schmidt
- Faculty of Biology, Technische Universität Dresden, 01069, Dresden, Germany
| | - Katharina Sielemann
- Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec) & Faculty of Biology, Bielefeld University, 33615, Bielefeld, Germany
- Graduate School DILS, Bielefeld Institute for Bioinformatics Infrastructure (BIBI), Bielefeld University, 33615, Bielefeld, Germany
| | - Sarah Breitenbach
- Faculty of Biology, Technische Universität Dresden, 01069, Dresden, Germany
| | - Jörg Fuchs
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Stadt Seeland, Germany
| | - Boas Pucker
- Plant Biotechnology and Bioinformatics, Institute of Plant Biology & Braunschweig Integrated Centre of Systems Biology (BRICS), TU Braunschweig, 38106, Braunschweig, Germany
| | - Bernd Weisshaar
- Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec) & Faculty of Biology, Bielefeld University, 33615, Bielefeld, Germany
| | - Daniela Holtgräwe
- Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec) & Faculty of Biology, Bielefeld University, 33615, Bielefeld, Germany
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, 01069, Dresden, Germany
- Institute of Biology, NAWI Graz, Karl-Franzens-Universität, A-8010 Graz, Graz, Austria
| |
Collapse
|
5
|
Silva GS, Souza MM, Pamponét VDCC, Micheli F, de Melo CAF, de Oliveira SG, Costa EA. Cytogenomic Characterization of Transposable Elements and Satellite DNA in Passiflora L. Species. Genes (Basel) 2024; 15:418. [PMID: 38674353 PMCID: PMC11049143 DOI: 10.3390/genes15040418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 03/17/2024] [Accepted: 03/22/2024] [Indexed: 04/28/2024] Open
Abstract
The species Passiflora alata, P. cincinnata, and P. edulis have great economic value due to the use of their fruits for human consumption. In this study, we compared the repetitive genome fractions of these three species. The compositions of the repetitive DNA of these three species' genomes were analyzed using clustering and identification of the repetitive sequences with RepeatExplorer. It was found that repetitive DNA content represents 74.70%, 66.86%, and 62.24% of the genome of P. alata, P. edulis, and P. cincinnata, respectively. LTR Ty3/Gypsy retrotransposons represent the highest genome proportions in P. alata and P. edulis, while Ty1/Copia comprises the largest proportion of P. cincinnata genome. Chromosomal mapping by Fluorescent In Situ Hybridization (FISH) showed that LTR retrotransposons have a dispersed distribution along chromosomes. The subtelomeric region of chromosomes is where 145 bp satellite DNA is located, suggesting that these elements may play important roles in genome structure and organization in these species. In this work, we obtained the first global characterization of the composition of repetitive DNA in Passiflora, showing that an increase in genome size is related to an increase in repetitive DNA, which represents an important evolutionary route for these species.
Collapse
Affiliation(s)
- Gonçalo Santos Silva
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Margarete Magalhães Souza
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Vanessa de Carvalho Cayres Pamponét
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Fabienne Micheli
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
- CIRAD, UMR AGAP, F-34398 Montpellier, France
| | - Cláusio Antônio Ferreira de Melo
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Sárah Gomes de Oliveira
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo (USP), São Paulo 01049-010, SP, Brazil;
| | - Eduardo Almeida Costa
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| |
Collapse
|
6
|
Pei D, Yu X, Fu W, Ma X, Fang J. The evolution and formation of centromeric repeats analysis in Vitis vinifera. PLANTA 2024; 259:99. [PMID: 38522063 DOI: 10.1007/s00425-024-04374-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 03/03/2024] [Indexed: 03/25/2024]
Abstract
MAIN CONCLUSION Six grape centromere-specific markers for cytogenetics were mined by combining genetic and immunological assays, and the possible evolution mechanism of centromeric repeats was analyzed. Centromeric histone proteins are functionally conserved; however, centromeric repetitive DNA sequences may represent considerable diversity in related species. Therefore, studying the characteristics and structure of grape centromere repeat sequences contributes to a deeper understanding of the evolutionary process of grape plants, including their origin and mechanisms of polyploidization. Plant centromeric regions are mainly composed of repetitive sequences, including SatDNA and transposable elements (TE). In this research, the characterization of centromere sequences in the whole genome of grapevine (Vitis vinifera L.) has been conducted. Five centromeric tandem repeat sequences (Vv1, Vv2, Vv5, Vv6, and Vv8) and one long terminal repeat (LTR) sequence Vv24 were isolated. These sequences had different centromeric distributions, which indicates that grape centromeric sequences may undergo rapid evolution. The existence of extrachromosomal circular DNA (eccDNA) and gene expression in CenH3 subdomain region may provide various potential mechanisms for the generation of new centromeric regions.
Collapse
Affiliation(s)
- Dan Pei
- Key Laboratory of Genetics and Fruit Development, College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Xue Yu
- Key Laboratory of Genetics and Fruit Development, College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Weihong Fu
- Key Laboratory of Genetics and Fruit Development, College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Xuhui Ma
- College of Life Sciences, Zaozhuang University, Zaozhuang, 277000, China
| | - Jinggui Fang
- Key Laboratory of Genetics and Fruit Development, College of Horticulture, Nanjing Agricultural University, Nanjing, China.
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, Nanjing, 210095, China.
| |
Collapse
|
7
|
An SM, Kim BY, Kang H, Lee HR, Lee YB, Park YJ, Cheon KS, Kim KA. The complete chloroplast genome sequences of six Hylotelephium species: Comparative genomic analysis and phylogenetic relationships. PLoS One 2023; 18:e0292056. [PMID: 37815995 PMCID: PMC10564136 DOI: 10.1371/journal.pone.0292056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 09/12/2023] [Indexed: 10/12/2023] Open
Abstract
To evaluate the phylogenetic relationships between Hylotelephium and Orostachys, and to provide important information for further studies, we analyzed the complete chloroplast genomes of six Hylotelephium species and compared the sequences to those of published chloroplast genomes of congeneric species and species of the closely related genus, Orostachys. The total chloroplast genome length of nineteen species, including the six Hylotelephium species analyzed in this study and the thirteen Hylotelephium and Orostachys species analyzed in previous studies, ranged from 150,369 bp (O. minuta) to 151,739 bp (H. spectabile). Their overall GC contents were almost identical (37.7-37.8%). The chloroplast genomes of the nineteen species contained 113 unique genes comprising 79 protein-coding genes (PCGs), 30 transfer RNA genes (tRNAs), and four ribosomal RNA genes (rRNAs). Among the annotated genes, fourteen genes contained one intron, and two genes contained two introns. The chloroplast genomes of the nineteen Hylotelephium and Orostachys species had identical structures. Additionally, the large single copy (LSC), inverted repeat (IR), and small single copy (SSC) junction regions were conserved in the Hylotelephium and Orostachys species. The nucleotide diversity between the Hylotelephium chloroplast genomes was extremely low in all regions, and only one region showed a high Pi value (>0.03). In all nineteen chloroplast genomes, six regions had a high Pi value (>0.03). The phylogenetic analysis showed that the genus delimitation could not be clearly observed even in this study because Hylotelephium formed a paraphyly with subsect. Orostachys of the genus Orostachys. Additionally, the data supported the taxonomic position of Sedum taqeutii, which was treated as a synonym for H. viridescens in previous studies, as an independent taxon.
Collapse
Affiliation(s)
- Sung-Mo An
- Department of Biological Science, Sangji University, Wonju, South Korea
| | - Bo-Yun Kim
- Plant Resources Division, National Institute of Biological Resources, Incheon, South Korea
| | - Halam Kang
- Department of Biological Science, Sangji University, Wonju, South Korea
| | - Ha-Rim Lee
- Department of Biological Science, Sangji University, Wonju, South Korea
| | - Yoo-Bin Lee
- Department of Biological Science, Sangji University, Wonju, South Korea
| | - Yoo-Jung Park
- Department of Biological Science, Sangji University, Wonju, South Korea
| | - Kyeong-Sik Cheon
- Department of Biological Science, Sangji University, Wonju, South Korea
| | - Kyung-Ah Kim
- Environmental Research Institute, Kangwon National University, Chuncheon, South Korea
| |
Collapse
|
8
|
Ferraz ME, Ribeiro T, Sader M, Nascimento T, Pedrosa-Harand A. Comparative analysis of repetitive DNA in dysploid and non-dysploid Phaseolus beans. Chromosome Res 2023; 31:30. [PMID: 37812264 DOI: 10.1007/s10577-023-09739-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 08/31/2023] [Accepted: 09/15/2023] [Indexed: 10/10/2023]
Abstract
Structural karyotype changes result from ectopic recombination events frequently associated with repetitive DNA. Although most Phaseolus species present relatively stable karyotypes with 2n = 22 chromosomes, the karyotypes of species of the Leptostachyus group show high rates of structural rearrangements, including a nested chromosome fusion that led to the dysploid chromosome number of the group (2n = 20). We examined the roles of repetitive landscapes in the rearrangements of species of the Leptostachyus group using genome-skimming data to characterize the repeatome in a range of Phaseolus species and compared them to species of that group (P. leptostachyus and P. macvaughii). LTR retrotransposons, especially the Ty3/gypsy lineage Chromovirus, were the most abundant elements in the genomes. Differences in the abundance of Tekay, Retand, and SIRE elements between P. macvaughii and P. leptostachyus were reflected in their total amounts of Ty3/gypsy and Ty1/copia. The satellite DNA fraction was the most divergent among the species, varying both in abundance and distribution, even between P. leptostachyus and P. macvaughii. The rapid turnover of repeats in the Leptostachyus group may be associated with the several rearrangements observed.
Collapse
Affiliation(s)
- Maria Eduarda Ferraz
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Centre, Federal University of Pernambuco, Recife, PE, Brazil
| | - Tiago Ribeiro
- Integrative Plant Research Lab, Department of Botany and Ecology, Institute of Biosciences, Federal University of Mato Grosso, Cuiabá, MT, Brazil
| | - Mariela Sader
- Multidisciplinary Institute of Plant Biology, National Council for Scientific and Technical Research, National University of Córdoba, Córdoba, Argentina
| | - Thiago Nascimento
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Centre, Federal University of Pernambuco, Recife, PE, Brazil
| | - Andrea Pedrosa-Harand
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Centre, Federal University of Pernambuco, Recife, PE, Brazil.
| |
Collapse
|
9
|
Sproul JS, Hotaling S, Heckenhauer J, Powell A, Marshall D, Larracuente AM, Kelley JL, Pauls SU, Frandsen PB. Analyses of 600+ insect genomes reveal repetitive element dynamics and highlight biodiversity-scale repeat annotation challenges. Genome Res 2023; 33:1708-1717. [PMID: 37739812 PMCID: PMC10691545 DOI: 10.1101/gr.277387.122] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 09/20/2023] [Indexed: 09/24/2023]
Abstract
Repetitive elements (REs) are integral to the composition, structure, and function of eukaryotic genomes, yet remain understudied in most taxonomic groups. We investigated REs across 601 insect species and report wide variation in RE dynamics across groups. Analysis of associations between REs and protein-coding genes revealed dynamic evolution at the interface between REs and coding regions across insects, including notably elevated RE-gene associations in lineages with abundant long interspersed nuclear elements (LINEs). We leveraged this large, empirical data set to quantify impacts of long-read technology on RE detection and investigate fundamental challenges to RE annotation in diverse groups. In long-read assemblies, we detected ∼36% more REs than short-read assemblies, with long terminal repeats (LTRs) showing 162% increased detection, whereas DNA transposons and LINEs showed less respective technology-related bias. In most insect lineages, 25%-85% of repetitive sequences were "unclassified" following automated annotation, compared with only ∼13% in Drosophila species. Although the diversity of available insect genomes has rapidly expanded, we show the rate of community contributions to RE databases has not kept pace, preventing efficient annotation and high-resolution study of REs in most groups. We highlight the tremendous opportunity and need for the biodiversity genomics field to embrace REs and suggest collective steps for making progress toward this goal.
Collapse
Affiliation(s)
- John S Sproul
- Department of Biology, Brigham Young University, Provo, Utah 84602, USA;
- Department of Biology, University of Nebraska Omaha, Omaha, Nebraska 68182, USA
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
| | - Scott Hotaling
- School of Biological Sciences, Washington State University, Pullman, Washington 99163, USA
- Department of Watershed Sciences, Utah State University, Logan, Utah 84322, USA
| | - Jacqueline Heckenhauer
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, 60325 Frankfurt, Germany
| | - Ashlyn Powell
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602, USA
| | - Dez Marshall
- Department of Biology, University of Nebraska Omaha, Omaha, Nebraska 68182, USA
| | | | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, Washington 99163, USA
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Steffen U Pauls
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, 60325 Frankfurt, Germany
- Department of Insect Biotechnology, Justus-Liebig-University Gießen, 35392 Gießen, Germany
| | - Paul B Frandsen
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602, USA
- Data Science Lab, Smithsonian Institution, Washington, District of Columbia 20560, USA
| |
Collapse
|
10
|
Rutz C, Bonassin L, Kress A, Francesconi C, Boštjančić LL, Merlat D, Theissinger K, Lecompte O. Abundance and Diversification of Repetitive Elements in Decapoda Genomes. Genes (Basel) 2023; 14:1627. [PMID: 37628678 PMCID: PMC10454600 DOI: 10.3390/genes14081627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/05/2023] [Accepted: 08/12/2023] [Indexed: 08/27/2023] Open
Abstract
Repetitive elements are a major component of DNA sequences due to their ability to propagate through the genome. Characterization of Metazoan repetitive profiles is improving; however, current pipelines fail to identify a significant proportion of divergent repeats in non-model organisms. The Decapoda order, for which repeat content analyses are largely lacking, is characterized by extremely variable genome sizes that suggest an important presence of repetitive elements. Here, we developed a new standardized pipeline to annotate repetitive elements in non-model organisms, which we applied to twenty Decapoda and six other Crustacea genomes. Using this new tool, we identified 10% more repetitive elements than standard pipelines. Repetitive elements were more abundant in Decapoda species than in other Crustacea, with a very large number of highly repeated satellite DNA families. Moreover, we demonstrated a high correlation between assembly size and transposable elements and different repeat dynamics between Dendrobranchiata and Reptantia. The patterns of repetitive elements largely reflect the phylogenetic relationships of Decapoda and the distinct evolutionary trajectories within Crustacea. In summary, our results highlight the impact of repetitive elements on genome evolution in Decapoda and the value of our novel annotation pipeline, which will provide a baseline for future comparative analyses.
Collapse
Affiliation(s)
- Christelle Rutz
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Lena Bonassin
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Arnaud Kress
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Caterina Francesconi
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Ljudevit Luka Boštjančić
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Dorine Merlat
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Kathrin Theissinger
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
| | - Odile Lecompte
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| |
Collapse
|
11
|
Cornet C, Mora P, Augustijnen H, Nguyen P, Escudero M, Lucek K. Holocentric repeat landscapes: From micro-evolutionary patterns to macro-evolutionary associations with karyotype evolution. Mol Ecol 2023. [PMID: 37577951 DOI: 10.1111/mec.17100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 07/13/2023] [Accepted: 07/28/2023] [Indexed: 08/15/2023]
Abstract
Repetitive elements can cause large-scale chromosomal rearrangements, for example through ectopic recombination, potentially promoting reproductive isolation and speciation. Species with holocentric chromosomes, that lack a localized centromere, might be more likely to retain chromosomal rearrangements that lead to karyotype changes such as fusions and fissions. This is because chromosome segregation during cell division should be less affected than in organisms with a localized centromere. The relationships between repetitive elements and chromosomal rearrangements and how they may translate to patterns of speciation in holocentric organisms are though poorly understood. Here, we use a reference-free approach based on low-coverage short-read sequencing data to characterize the repeat landscape of two independently evolved holocentric groups: Erebia butterflies and Carex sedges. We consider both micro- and macro-evolutionary scales to investigate the repeat landscape differentiation between Erebia populations and the association between repeats and karyotype changes in a phylogenetic framework for both Erebia and Carex. At a micro-evolutionary scale, we found population differentiation in repeat landscape that increases with overall intraspecific genetic differentiation among four Erebia species. At a macro-evolutionary scale, we found indications for an association between repetitive elements and karyotype changes along both Erebia and Carex phylogenies. Altogether, our results suggest that repetitive elements are associated with the level of population differentiation and chromosomal rearrangements in holocentric clades and therefore likely play a role in adaptation and potentially species diversification.
Collapse
Affiliation(s)
- Camille Cornet
- Biodiversity Genomics Laboratory, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Pablo Mora
- Department of Experimental Biology, Genetics Area, University of Jaén, Jaén, Spain
- University of South Bohemia, Faculty of Science, České Budějovice, Czech Republic
| | - Hannah Augustijnen
- Department of Environmental Sciences, University of Basel, Basel, Switzerland
| | - Petr Nguyen
- University of South Bohemia, Faculty of Science, České Budějovice, Czech Republic
| | - Marcial Escudero
- Department of Plant Biology and Ecology, University of Seville, Seville, Spain
| | - Kay Lucek
- Biodiversity Genomics Laboratory, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
12
|
Pezzini FF, Ferrari G, Forrest LL, Hart ML, Nishii K, Kidner CA. Target capture and genome skimming for plant diversity studies. APPLICATIONS IN PLANT SCIENCES 2023; 11:e11537. [PMID: 37601316 PMCID: PMC10439825 DOI: 10.1002/aps3.11537] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 06/16/2023] [Accepted: 07/10/2023] [Indexed: 08/22/2023]
Abstract
Recent technological advances in long-read high-throughput sequencing and assembly methods have facilitated the generation of annotated chromosome-scale whole-genome sequence data for evolutionary studies; however, generating such data can still be difficult for many plant species. For example, obtaining high-molecular-weight DNA is typically impossible for samples in historical herbarium collections, which often have degraded DNA. The need to fast-freeze newly collected living samples to conserve high-quality DNA can be complicated when plants are only found in remote areas. Therefore, short-read reduced-genome representations, such as target capture and genome skimming, remain important for evolutionary studies. Here, we review the pros and cons of each technique for non-model plant taxa. We provide guidance related to logistics, budget, the genomic resources previously available for the target clade, and the nature of the study. Furthermore, we assess the available bioinformatic analyses, detailing best practices and pitfalls, and suggest pathways to combine newly generated data with legacy data. Finally, we explore the possible downstream analyses allowed by the type of data generated using each technique. We provide a practical guide to help researchers make the best-informed choice regarding reduced genome representation for evolutionary studies of non-model plants in cases where whole-genome sequencing remains impractical.
Collapse
Affiliation(s)
| | - Giada Ferrari
- Royal Botanic Garden EdinburghEdinburghUnited Kingdom
| | | | | | - Kanae Nishii
- Royal Botanic Garden EdinburghEdinburghUnited Kingdom
| | - Catherine A. Kidner
- Royal Botanic Garden EdinburghEdinburghUnited Kingdom
- School of Biological SciencesUniversity of EdinburghEdinburghUnited Kingdom
| |
Collapse
|
13
|
Yuxiang W, Peretolchina TE, Romanova EV, Sherbakov DY. Comparison of the evolutionary patterns of DNA repeats in ancient and young invertebrate species flocks of Lake Baikal. Vavilovskii Zhurnal Genet Selektsii 2023; 27:349-356. [PMID: 37465187 PMCID: PMC10350863 DOI: 10.18699/vjgb-23-42] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 02/20/2023] [Accepted: 02/20/2023] [Indexed: 07/20/2023] Open
Abstract
DNA repeat composition of low coverage (0.1-0.5) genomic libraries of four amphipods species endemic to Lake Baikal (East Siberia) and four endemic gastropod species of the fam. Baicaliidae have been compared to each other. In order to do so, a neighbor joining tree was inferred for each quartet of species (amphipods and mollusks) based on the ratio of repeat classes shared in each pair of species. The topology of this tree was compared to the phylogenies inferred for the same species from the concatenated protein-coding mitochondrial nucleotide sequences. In all species analyzed, the fraction of DNA repeats involved circa half of the genome. In relatively more ancient amphipods (most recent common ancestor, MRCA, existed approximately sixty millions years ago), the most abundant were species-specific repeats, while in much younger Baicaliidae (MRCA equal to ca. three millions years) most of the DNA repeats were shared among all four species. If the presence/absence of a repeat is regarded as a separate independent trait, and the ratio of shared to total numbers of repeats in a species pair is used as the measure of distance, the topology of the NJ tree is the same as the quartet phylogeny inferred for the mitogenomes protein coding nucleotide sequences. Meanwhile, in each group of species, a substantial number of repeats were detected pointing to the possibility of non-neutral evolution or a horizontal transfer between species occupying the same biotope. These repeats were shared by non-sister groups while being absent in the sister genomes. On the other hand, in such cases some traits of ecological significance were also shared.
Collapse
Affiliation(s)
- Wang Yuxiang
- Limnological institute of the Siberian Branch of the Russian Academy of Sciences, Irkutsk, Russia
| | - T E Peretolchina
- Limnological institute of the Siberian Branch of the Russian Academy of Sciences, Irkutsk, Russia
| | - E V Romanova
- Limnological institute of the Siberian Branch of the Russian Academy of Sciences, Irkutsk, Russia
| | - D Y Sherbakov
- Limnological institute of the Siberian Branch of the Russian Academy of Sciences, Irkutsk, Russia Novosibirsk State University, Novosibirsk, RussiaIrkutsk State University, Irkutsk, Russia
| |
Collapse
|
14
|
de la Fuente R, Díaz-Villanueva W, Arnau V, Moya A. Genomic Signature in Evolutionary Biology: A Review. BIOLOGY 2023; 12:biology12020322. [PMID: 36829597 PMCID: PMC9953303 DOI: 10.3390/biology12020322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 02/11/2023] [Accepted: 02/13/2023] [Indexed: 02/19/2023]
Abstract
Organisms are unique physical entities in which information is stored and continuously processed. The digital nature of DNA sequences enables the construction of a dynamic information reservoir. However, the distinction between the hardware and software components in the information flow is crucial to identify the mechanisms generating specific genomic signatures. In this work, we perform a bibliometric analysis to identify the different purposes of looking for particular patterns in DNA sequences associated with a given phenotype. This study has enabled us to make a conceptual breakdown of the genomic signature and differentiate the leading applications. On the one hand, it refers to gene expression profiling associated with a biological function, which may be shared across taxa. This signature is the focus of study in precision medicine. On the other hand, it also refers to characteristic patterns in species-specific DNA sequences. This interpretation plays a key role in comparative genomics, identifying evolutionary relationships. Looking at the relevant studies in our bibliographic database, we highlight the main factors causing heterogeneities in genome composition and how they can be quantified. All these findings lead us to reformulate some questions relevant to evolutionary biology.
Collapse
Affiliation(s)
- Rebeca de la Fuente
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
- Correspondence:
| | - Wladimiro Díaz-Villanueva
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
| | - Vicente Arnau
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
| | - Andrés Moya
- Institute of Integrative Systems Biology (I2Sysbio), University of Valencia and Spanish Research Council (CSIC), 46980 Valencia, Spain
- Foundation for the Promotion of Sanitary and Biomedical Research of the Valencian Community (FISABIO), 46020 Valencia, Spain
- CIBER in Epidemiology and Public Health (CIBEResp), 28029 Madrid, Spain
| |
Collapse
|
15
|
Becher H, Nichols RA. Assembly-free quantification of vagrant DNA inserts. Mol Ecol Resour 2023. [PMID: 36740932 DOI: 10.1111/1755-0998.13764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 01/27/2023] [Accepted: 01/30/2023] [Indexed: 02/07/2023]
Abstract
Inserts of DNA from extranuclear sources, such as organelles and microbes, are common in eukaryote nuclear genomes. However, sequence similarity between the nuclear and extranuclear DNA, and a history of multiple insertions, make the assembly of these regions challenging. Consequently, the number, sequence and location of these vagrant DNAs cannot be reliably inferred from the genome assemblies of most organisms. We introduce two statistical methods to estimate the abundance of nuclear inserts even in the absence of a nuclear genome assembly. The first (intercept method) only requires low-coverage (<1×) sequencing data, as commonly generated for population studies of organellar and ribosomal DNAs. The second method additionally requires that a subset of the individuals carry extranuclear DNA with diverged genotypes. We validated our intercept method using simulations and by re-estimating the frequency of human NUMTs (nuclear mitochondrial inserts). We then applied it to the grasshopper Podisma pedestris, exceptional for both its large genome size and reports of numerous NUMT inserts, estimating that NUMTs make up 0.056% of the nuclear genome, equivalent to >500 times the mitochondrial genome size. We also re-analysed a museomics data set of the parrot Psephotellus varius, obtaining an estimate of only 0.0043%, in line with reports from other species of bird. Our study demonstrates the utility of low-coverage high-throughput sequencing data for the quantification of nuclear vagrant DNAs. Beyond quantifying organellar inserts, these methods could also be used on endosymbiont-derived sequences. We provide an R implementation of our methods called "vagrantDNA" and code to simulate test data sets.
Collapse
Affiliation(s)
- Hannes Becher
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
| | - Richard A Nichols
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| |
Collapse
|
16
|
Phylogenomics and genome size evolution in Amomum s. s. (Zingiberaceae): Comparison of traditional and modern sequencing methods. Mol Phylogenet Evol 2023; 178:107666. [PMID: 36384185 DOI: 10.1016/j.ympev.2022.107666] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 11/05/2022] [Accepted: 11/08/2022] [Indexed: 11/15/2022]
Abstract
BACKGROUND AND AIMS A targeted enrichment NGS approach was used to construct the phylogeny of Amomum Roxb. (Zingiberaceae). Phylogenies based on hundreds of nuclear genes, the whole plastome and the rDNA cistron were compared with an ITS-based phylogeny. Trends in genome size (GS) evolution were examined, chromosomes were counted and the geographical distribution of phylogenetic lineages was evaluated. METHODS In total, 92 accessions of 54 species were analysed. ITS was obtained for 79 accessions, 37 accessions were processed with Hyb-Seq and sequences from 449 nuclear genes, the whole cpDNA, and the rDNA cistron were analysed using concatenation, coalescence and supertree approaches. The evolution of absolute GS was analysed in a phylogenetic and geographical context. The chromosome numbers of 12 accessions were counted. KEY RESULTS Four groups were recognised in all datasets though their mutual relationships differ among datasets. While group A (A. subulatum and A. petaloideum) is basal to the remaining groups in the nuclear gene phylogeny, in the cpDNA topology it is sister to group B (A. repoeense and related species) and, in the ITS topology, it is sister to group D (the Elettariopsis lineage). The former Elettariopsis makes a monophyletic group. There is an increasing trend in GS during evolution. The largest GS values were found in group D in two tetraploid taxa, A. cinnamomeum and A. aff. biphyllum (both 2n = 96 chromosomes). The rest varied in GS (2C = 3.54-8.78 pg) with a constant chromosome number 2n = 48. There is a weak connection between phylogeny, GS and geography in Amomum. CONCLUSIONS Amomum consists of four groups, and the former Elettariopsis is monophyletic. Species in this group have the largest GS. Two polyploids were found and GS greatly varied in the rest of Amomum.
Collapse
|
17
|
Reginato M. A pipeline for assembling low copy nuclear markers from plant genome skimming data for phylogenetic use. PeerJ 2022; 10:e14525. [PMID: 36523475 PMCID: PMC9745922 DOI: 10.7717/peerj.14525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 11/15/2022] [Indexed: 12/12/2022] Open
Abstract
Background Genome skimming is a popular method in plant phylogenomics that do not include a biased enrichment step, relying on random shallow sequencing of total genomic DNA. From these data the plastome is usually readily assembled and constitutes the bulk of phylogenetic information generated in these studies. Despite a few attempts to use genome skims to recover low copy nuclear loci for direct phylogenetic use, such endeavor remains neglected. Causes might include the trade-off between libraries with few reads and species with large genomes (i.e., missing data caused by low coverage), but also might relate to the lack of pipelines for data assembling. Methods A pipeline and its companion R package designed to automate the recovery of low copy nuclear markers from genome skimming libraries are presented. Additionally, a series of analyses aiming to evaluate the impact of key assembling parameters, reference selection and missing data are presented. Results A substantial amount of putative low copy nuclear loci was assembled and proved useful to base phylogenetic inference across the libraries tested (4 to 11 times more data than previously assembled plastomes from the same libraries). Discussion Critical aspects of assembling low copy nuclear markers from genome skims include the minimum coverage and depth of a sequence to be used. More stringent values of these parameters reduces the amount of assembled data and increases the relative amount of missing data, which can compromise phylogenetic inference, in turn relaxing the same parameters might increase sequence error. These issues are discussed in the text, and parameter tuning through multiple comparisons tracking their effects on support and congruence is highly recommended when using this pipeline. The skimmingLoci pipeline (https://github.com/mreginato/skimmingLoci) might stimulate the use of genome skims to recover nuclear loci for direct phylogenetic use, increasing the power of genome skimming data to resolve phylogenetic relationships, while reducing the amount of sequenced DNA that is commonly wasted.
Collapse
|
18
|
Torres-Arciga K, Flores-León M, Ruiz-Pérez S, Trujillo-Pineda M, González-Barrios R, Herrera LA. Histones and their chaperones: Adaptive remodelers of an ever-changing chromatinic landscape. Front Genet 2022; 13:1057846. [PMID: 36468032 PMCID: PMC9709290 DOI: 10.3389/fgene.2022.1057846] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 11/02/2022] [Indexed: 07/29/2023] Open
Abstract
Chromatin maintenance and remodeling are processes that take place alongside DNA repair, replication, or transcription to ensure the survival and adaptability of a cell. The environment and the needs of the cell dictate how chromatin is remodeled; particularly where and which histones are deposited, thus changing the canonical histone array to regulate chromatin structure and gene expression. Chromatin is highly dynamic, and histone variants and their chaperones play a crucial role in maintaining the epigenetic regulation at different genomic regions. Despite the large number of histone variants reported to date, studies on their roles in physiological processes and pathologies are emerging but continue to be scarce. Here, we present recent advances in the research on histone variants and their chaperones, with a focus on their importance in molecular mechanisms such as replication, transcription, and DNA damage repair. Additionally, we discuss the emerging role they have in transposable element regulation, aging, and chromatin remodeling syndromes. Finally, we describe currently used methods and their limitations in the study of these proteins and highlight the importance of improving the experimental approaches to further understand this epigenetic machinery.
Collapse
Affiliation(s)
- Karla Torres-Arciga
- Doctorado en Ciencias Biológicas, Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
- Unidad de Investigación Biomédica en Cáncer, Instituto Nacional de Cancerología (INCan)-Instituto de Investigaciones Biomédicas (IIBO), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Manuel Flores-León
- Departamento de Medicina Genómica y Toxicología Ambiental, Instituto de Investigaciones Biomédicas (IIBO), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Samuel Ruiz-Pérez
- Unidad de Investigación Biomédica en Cáncer, Instituto Nacional de Cancerología (INCan)-Instituto de Investigaciones Biomédicas (IIBO), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Magalli Trujillo-Pineda
- Unidad de Investigación Biomédica en Cáncer, Instituto Nacional de Cancerología (INCan)-Instituto de Investigaciones Biomédicas (IIBO), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Rodrigo González-Barrios
- Unidad de Investigación Biomédica en Cáncer, Instituto Nacional de Cancerología (INCan)-Instituto de Investigaciones Biomédicas (IIBO), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Luis A. Herrera
- Unidad de Investigación Biomédica en Cáncer, Instituto Nacional de Cancerología (INCan)-Instituto de Investigaciones Biomédicas (IIBO), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
- Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
| |
Collapse
|
19
|
Kaya Y, Aydın ZU, Cai X, Wang X, Dönmez AA. Genome-wide characterization of two Aubrieta taxa: Aubrieta canescens subsp. canescens and Au. macrostyla (Brassicaceae). AOB PLANTS 2022; 14:plac035. [PMID: 36196394 PMCID: PMC9521481 DOI: 10.1093/aobpla/plac035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 09/09/2022] [Indexed: 06/16/2023]
Abstract
Aubrieta canescens complex is divided into two subspecies, Au. canescens subsp. canescens, Au. canescens subsp. cilicica and a distinct species, Au. macrostyla, based on molecular phylogeny. We generated a draft assembly of Au. canescens subsp. canescens and Au. macrostyla using paired-end shotgun sequencing. This is the first attempt at genome characterization for the genus. In the presented study, ~165 and ~157 Mbp of the genomes of Au. canescens subsp. canescens and Au. macrostyla were assembled, respectively, and a total of 32 425 and 31 372 gene models were predicted in the genomes of the target taxa, respectively. We corroborated the phylogenomic affinity of taxa with some core Brassicaceae species (Clades A and B) including Arabis alpina. The orthology-based tree suggested that Aubrieta species differentiated from A. alpina 1.3-2.0 mya (million years ago). The genome-wide syntenic comparison of two Aubrieta taxa revealed that Au. canescens subsp. canescens (46 %) and Au. macrostyla (45 %) have an almost identical syntenic gene pair ratio. These novel genome assemblies are the first steps towards the chromosome-level assembly of Au. canescens and understanding the genome diversity within the genus.
Collapse
Affiliation(s)
| | - Zübeyde Uğurlu Aydın
- Molecular Plant Systematic Laboratory (MOBIS), Department of Biology, Faculty of Science, Hacettepe University, Ankara 06800, Turkey
| | - Xu Cai
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Xiaowu Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Ali A Dönmez
- Molecular Plant Systematic Laboratory (MOBIS), Department of Biology, Faculty of Science, Hacettepe University, Ankara 06800, Turkey
| |
Collapse
|
20
|
Telomeres and Their Neighbors. Genes (Basel) 2022; 13:genes13091663. [PMID: 36140830 PMCID: PMC9498494 DOI: 10.3390/genes13091663] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 09/08/2022] [Accepted: 09/09/2022] [Indexed: 11/21/2022] Open
Abstract
Telomeres are essential structures formed from satellite DNA repeats at the ends of chromosomes in most eukaryotes. Satellite DNA repeat sequences are useful markers for karyotyping, but have a more enigmatic role in the eukaryotic cell. Much work has been done to investigate the structure and arrangement of repetitive DNA elements in classical models with implications for species evolution. Still more is needed until there is a complete picture of the biological function of DNA satellite sequences, particularly when considering non-model organisms. Celebrating Gregor Mendel’s anniversary by going to the roots, this review is designed to inspire and aid new research into telomeres and satellites with a particular focus on non-model organisms and accessible experimental and in silico methods that do not require specialized equipment or expensive materials. We describe how to identify telomere (and satellite) repeats giving many examples of published (and some unpublished) data from these techniques to illustrate the principles behind the experiments. We also present advice on how to perform and analyse such experiments, including details of common pitfalls. Our examples are a selection of recent developments and underexplored areas of research from the past. As a nod to Mendel’s early work, we use many examples from plants and insects, especially as much recent work has expanded beyond the human and yeast models traditional in telomere research. We give a general introduction to the accepted knowledge of telomere and satellite systems and include references to specialized reviews for the interested reader.
Collapse
|
21
|
Silva GS, Souza MM, Carvalho Cayres Pamponét V. Identification of 45S rDNA in Passiflora using low coverage sequencing: analysis of GC content and chromosomal localization. Mol Biol Rep 2022; 49:8555-8566. [PMID: 35997851 DOI: 10.1007/s11033-022-07686-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 06/08/2022] [Indexed: 11/28/2022]
Abstract
BACKGROUND The 45S rDNA is considered the most useful chromosomal marker for cytogenetic analysis of Passiflora. Amplification of 45S rDNA sequence via PCR are more advantageous than sequence maintenance in vectors for chromosomal hybridization via FISH. We aimed both to identify 45S rDNA by sequencing data for chromosomal localization and to verify the relationship between GC content and CMA3/DAPI banding. METHODS AND RESULTS Low-coverage sequencing of Passiflora alata, P. cincinnata, and P. edulis was performed, and 45S rDNA units were identified using RepeatExplorer. The 45S rDNA units were used to construct a neighbor-joining tree to verify the similarities between the three species' 18S and 26S rDNA sequences. Clusters (CL)116 (P. alata), CL71 (P. cincinnata), and CL116 (P. edulis) were remarkably similar among the three species, and the 26S rDNA sequences of the clusters were similar to those of Populus tremuloides, Salix interior, and Averrhoa carambola (98% identity). The 26S rDNA was cytologically localized in the chromosomes of P. edulis, P. bahiensis, and the backcrossed hybrid (P. sublanceolata vs. HD13). The hybridization transfer capacity was evaluated in Citrus sunki and Cucumis melo. Finally, a chromosomal pair with a heteromorphic 26S rDNA site was observed in P. edulis, which was the same to that observed for CMA3. CONCLUSION The amplification of the 26S rDNA in Passiflora via PCR and the chromosomal localization in Passiflora and other plant species was successfully achieved. The CMA3 bands were found to be related not only to the amount of GC but also to its structure and the number of repetitions.
Collapse
Affiliation(s)
- Gonçalo Santos Silva
- Laboratório de Melhoramento de Plantas (LAMEP), Departamento de Ciências Biológicas (DCB), Universidade Estadual de Santa Cruz (UESC), Rod. Jorge Amado, Km 16, Salobrinho, Ilhéus, BA, 45662-900, Brasil
| | - Margarete Magalhães Souza
- Laboratório de Melhoramento de Plantas (LAMEP), Departamento de Ciências Biológicas (DCB), Universidade Estadual de Santa Cruz (UESC), Rod. Jorge Amado, Km 16, Salobrinho, Ilhéus, BA, 45662-900, Brasil.
| | - Vanessa Carvalho Cayres Pamponét
- Laboratório de Melhoramento de Plantas (LAMEP), Departamento de Ciências Biológicas (DCB), Universidade Estadual de Santa Cruz (UESC), Rod. Jorge Amado, Km 16, Salobrinho, Ilhéus, BA, 45662-900, Brasil
| |
Collapse
|
22
|
Becher H, Sampson J, Twyford AD. Measuring the Invisible: The Sequences Causal of Genome Size Differences in Eyebrights ( Euphrasia) Revealed by k-mers. FRONTIERS IN PLANT SCIENCE 2022; 13:818410. [PMID: 35968114 PMCID: PMC9372453 DOI: 10.3389/fpls.2022.818410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 06/20/2022] [Indexed: 06/15/2023]
Abstract
Genome size variation within plant taxa is due to presence/absence variation, which may affect low-copy sequences or genomic repeats of various frequency classes. However, identifying the sequences underpinning genome size variation is challenging because genome assemblies commonly contain collapsed representations of repetitive sequences and because genome skimming studies by design miss low-copy number sequences. Here, we take a novel approach based on k-mers, short sub-sequences of equal length k, generated from whole-genome sequencing data of diploid eyebrights (Euphrasia), a group of plants that have considerable genome size variation within a ploidy level. We compare k-mer inventories within and between closely related species, and quantify the contribution of different copy number classes to genome size differences. We further match high-copy number k-mers to specific repeat types as retrieved from the RepeatExplorer2 pipeline. We find genome size differences of up to 230Mbp, equivalent to more than 20% genome size variation. The largest contributions to these differences come from rDNA sequences, a 145-nt genomic satellite and a repeat associated with an Angela transposable element. We also find size differences in the low-copy number class (copy number ≤ 10×) of up to 27 Mbp, possibly indicating differences in gene space between our samples. We demonstrate that it is possible to pinpoint the sequences causing genome size variation within species without the use of a reference genome. Such sequences can serve as targets for future cytogenetic studies. We also show that studies of genome size variation should go beyond repeats if they aim to characterise the full range of genomic variants. To allow future work with other taxonomic groups, we share our k-mer analysis pipeline, which is straightforward to run, relying largely on standard GNU command line tools.
Collapse
Affiliation(s)
- Hannes Becher
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
| | - Jacob Sampson
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
| | - Alex D. Twyford
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
- Royal Botanic Garden Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
23
|
Samoluk SS, Vaio M, Ortíz AM, Chalup LMI, Robledo G, Bertioli DJ, Seijo G. Comparative repeatome analysis reveals new evidence on genome evolution in wild diploid Arachis (Fabaceae) species. PLANTA 2022; 256:50. [PMID: 35895167 DOI: 10.1007/s00425-022-03961-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 07/12/2022] [Indexed: 06/15/2023]
Abstract
Opposing changes in the abundance of satellite DNA and long terminal repeat (LTR) retroelements are the main contributors to the variation in genome size and heterochromatin amount in Arachis diploids. The South American genus Arachis (Fabaceae) comprises 83 species organized in nine taxonomic sections. Among them, section Arachis is characterized by species with a wide genome and karyotype diversity. Such diversity is determined mainly by the amount and composition of repetitive DNA. Here we performed computational analysis on low coverage genome sequencing to infer the dynamics of changes in major repeat families that led to the differentiation of genomes in diploid species (x = 10) of genus Arachis, focusing on section Arachis. Estimated repeat content ranged from 62.50 to 71.68% of the genomes. Species with different genome composition tended to have different landscapes of repeated sequences. Athila family retrotransposons were the most abundant and variable lineage among Arachis repeatomes, with peaks of transpositional activity inferred at different times in the evolution of the species. Satellite DNAs (satDNAs) were less abundant, but differentially represented among species. High rates of evolution of an AT-rich superfamily of satDNAs led to the differential accumulation of heterochromatin in Arachis genomes. The relationship between genome size variation and the repetitive content is complex. However, largest genomes presented a higher accumulation of LTR elements and lower contents of satDNAs. In contrast, species with lowest genome sizes tended to accumulate satDNAs in detriment of LTR elements. Phylogenetic analysis based on repetitive DNA supported the genome arrangement of section Arachis. Altogether, our results provide the most comprehensive picture on the repeatome dynamics that led to the genome differentiation of Arachis species.
Collapse
Affiliation(s)
- Sergio S Samoluk
- Instituto de Botánica del Nordeste (UNNE-CONICET), Facultad de Ciencias Agrarias, Corrientes, Argentina.
| | - Magdalena Vaio
- Laboratory of Plant Genome Evolution and Domestication, Department of Plant Biology, Faculty of Agronomy, University of the Republic, Montevideo, Uruguay
| | - Alejandra M Ortíz
- Instituto de Botánica del Nordeste (UNNE-CONICET), Facultad de Ciencias Agrarias, Corrientes, Argentina
| | - Laura M I Chalup
- Instituto de Botánica del Nordeste (UNNE-CONICET), Facultad de Ciencias Agrarias, Corrientes, Argentina
| | - Germán Robledo
- Instituto de Botánica del Nordeste (UNNE-CONICET), Facultad de Ciencias Agrarias, Corrientes, Argentina
- Facultad de Ciencias Exactas y Naturales y Agrimensura, Universidad Nacional del Nordeste, Corrientes, Argentina
| | - David J Bertioli
- Center for Applied Genetic Technologies, University of Georgia, Athens, GA, USA
| | - Guillermo Seijo
- Instituto de Botánica del Nordeste (UNNE-CONICET), Facultad de Ciencias Agrarias, Corrientes, Argentina
- Facultad de Ciencias Exactas y Naturales y Agrimensura, Universidad Nacional del Nordeste, Corrientes, Argentina
| |
Collapse
|
24
|
Cao DL, Zhang XJ, Xie SQ, Fan SJ, Qu XJ. Application of chloroplast genome in the identification of Traditional Chinese Medicine Viola philippica. BMC Genomics 2022; 23:540. [PMID: 35896957 PMCID: PMC9327190 DOI: 10.1186/s12864-022-08727-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 06/29/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Viola philippica Cav. is the only source plant of "Zi Hua Di Ding", which is a Traditional Chinese Medicine (TCM) that is utilized as an antifebrile and detoxicant agent for the treatment of acute pyogenic infections. Historically, many Viola species with violet flowers have been misused in "Zi Hua Di Ding". Viola have been recognized as a taxonomically difficult genera due to their highly similar morphological characteristics. Here, all common V. philippica adulterants were sampled. A total of 24 complete chloroplast (cp) genomes were analyzed, among these 5 cp genome sequences were downloaded from GenBank and 19 cp genomes, including 2 "Zi Hua Di Ding" purchased from a local TCM pharmacy, were newly sequenced. RESULTS The Viola cp genomes ranged from 156,483 bp to 158,940 bp in length. A total of 110 unique genes were annotated, including 76 protein-coding genes, 30 tRNAs, and four rRNAs. Sequence divergence analysis screening identified 16 highly diverged sequences; these could be used as markers for the identification of Viola species. The morphological, maximum likelihood and Bayesian inference trees of whole cp genome sequences and highly diverged sequences were divided into five monophyletic clades. The species in each of the five clades were identical in their positions within the morphological and cp genome tree. The shared morphological characters belonging to each clade was summarized. Interestingly, unique variable sites were found in ndhF, rpl22, and ycf1 of V. philippica, and these sites can be selected to distinguish V. philippica from samples all other Viola species, including its most closely related species. In addition, important morphological characteristics were proposed to assist the identification of V. philippica. We applied these methods to examine 2 "Zi Hua Di Ding" randomly purchased from the local TCM pharmacy, and this analysis revealed that the morphological and molecular characteristics were valid for the identification of V. philippica. CONCLUSIONS This study provides invaluable data for the improvement of species identification and germplasm of V. philippica that may facilitate the application of a super-barcode in TCM identification and enable future studies on phylogenetic evolution and safe medical applications.
Collapse
Affiliation(s)
- Dong-Ling Cao
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji'nan, 250014, China
| | - Xue-Jie Zhang
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji'nan, 250014, China
| | - Shao-Qiu Xie
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji'nan, 250014, China
| | - Shou-Jin Fan
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji'nan, 250014, China.
| | - Xiao-Jian Qu
- Shandong Provincial Key Laboratory of Plant Stress Research, College of Life Sciences, Shandong Normal University, Ji'nan, 250014, China.
| |
Collapse
|
25
|
Moreno-Aguilar MF, Inda LA, Sánchez-Rodríguez A, Arnelas I, Catalán P. Evolutionary Dynamics of the Repeatome Explains Contrasting Differences in Genome Sizes and Hybrid and Polyploid Origins of Grass Loliinae Lineages. FRONTIERS IN PLANT SCIENCE 2022; 13:901733. [PMID: 35845705 PMCID: PMC9284676 DOI: 10.3389/fpls.2022.901733] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 05/25/2022] [Indexed: 06/15/2023]
Abstract
The repeatome is composed of diverse families of repetitive DNA that keep signatures on the historical events that shaped the evolution of their hosting species. The cold seasonal Loliinae subtribe includes worldwide distributed taxa, some of which are the most important forage and lawn species (fescues and ray-grasses). The Loliinae are prone to hybridization and polyploidization. It has been observed a striking two-fold difference in genome size between the broad-leaved (BL) and fine-leaved (FL) Loliinae diploids and a general trend of genome reduction of some high polyploids. We have used genome skimming data to uncover the composition, abundance, and potential phylogenetic signal of repetitive elements across 47 representatives of the main Loliinae lineages. Independent and comparative analyses of repetitive sequences and of 5S rDNA loci were performed for all taxa under study and for four evolutionary Loliinae groups [Loliinae, Broad-leaved (BL), Fine-leaved (FL), and Schedonorus lineages]. Our data showed that the proportion of the genome covered by the repeatome in the Loliinae species was relatively high (average ∼ 51.8%), ranging from high percentages in some diploids (68.7%) to low percentages in some high-polyploids (30.7%), and that changes in their genome sizes were likely caused by gains or losses in their repeat elements. Ty3-gypsy Retand and Ty1-copia Angela retrotransposons were the most frequent repeat families in the Loliinae although the relatively more conservative Angela repeats presented the highest correlation of repeat content with genome size variation and the highest phylogenetic signal of the whole repeatome. By contrast, Athila retrotransposons presented evidence of recent proliferations almost exclusively in the Lolium clade. The repeatome evolutionary networks showed an overall topological congruence with the nuclear 35S rDNA phylogeny and a geographic-based structure for some lineages. The evolution of the Loliinae repeatome suggests a plausible scenario of recurrent allopolyploidizations followed by diploidizations that generated the large genome sizes of BL diploids as well as large genomic rearrangements in highly hybridogenous lineages that caused massive repeatome and genome contractions in the Schedonorus and Aulaxyper polyploids. Our study has contributed to disentangling the impact of the repeatome dynamics on the genome diversification and evolution of the Loliinae grasses.
Collapse
Affiliation(s)
| | - Luis A. Inda
- Escuela Politécnica Superior de Huesca, Universidad de Zaragoza, Huesca, Spain
- Instituto Agroalimentario de Aragón, Universidad de Zaragoza, Centro de Investigación y Tecnología Agroalimentaria, Zaragoza, Spain
| | - Aminael Sánchez-Rodríguez
- Departamento de Ciencias Biológicas y Agropecuarias, Universidad Técnica Particular de Loja, Loja, Ecuador
| | - Itziar Arnelas
- Departamento de Ciencias Biológicas y Agropecuarias, Universidad Técnica Particular de Loja, Loja, Ecuador
| | - Pilar Catalán
- Escuela Politécnica Superior de Huesca, Universidad de Zaragoza, Huesca, Spain
- Grupo de Bioquímica, Biofísica y Biología Computacional, Instituto de Biocomputación y Física de Sistemas Complejos, Universidad de Zaragoza, Unidad Asociada al CSIC, Zaragoza, Spain
| |
Collapse
|
26
|
Fernández P, Hidalgo O, Juan A, Leitch IJ, Leitch AR, Palazzesi L, Pegoraro L, Viruel J, Pellicer J. Genome Insights into Autopolyploid Evolution: A Case Study in Senecio doronicum (Asteraceae) from the Southern Alps. PLANTS 2022; 11:plants11091235. [PMID: 35567236 PMCID: PMC9099586 DOI: 10.3390/plants11091235] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 04/27/2022] [Accepted: 04/29/2022] [Indexed: 11/16/2022]
Abstract
Polyploidy is a widespread phenomenon across angiosperms, and one of the main drivers of diversification. Whilst it frequently involves hybridisation, autopolyploidy is also an important feature of plant evolution. Minority cytotypes are frequently overlooked due to their lower frequency in populations, but the development of techniques such as flow cytometry, which enable the rapid screening of cytotype diversity across large numbers of individuals, is now providing a more comprehensive understanding of cytotype diversity within species. Senecio doronicum is a relatively common daisy found throughout European mountain grasslands from subalpine to almost nival elevations. We have carried out a population-level cytotype screening of 500 individuals from Tête Grosse (Alpes-de-Haute-Provence, France), confirming the coexistence of tetraploid (28.2%) and octoploid cytotypes (71.2%), but also uncovering a small number of hexaploid individuals (0.6%). The analysis of repetitive elements from short-read genome-skimming data combined with nuclear (ITS) and whole plastid DNA sequences support an autopolyploid origin of the polyploid S. doronicum individuals and provide molecular evidence regarding the sole contribution of tetraploids in the formation of hexaploid individuals. The evolutionary impact and resilience of the new cytotype have yet to be determined, although the coexistence of different cytotypes may indicate nascent speciation.
Collapse
Affiliation(s)
- Pol Fernández
- Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Passeig del Migdia s.n., Parc de Montjuïc, 08038 Barcelona, Spain;
- Correspondence: (P.F.); (J.P.); Tel.: +34-932890611 (P.F. & J.P.)
| | - Oriane Hidalgo
- Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Passeig del Migdia s.n., Parc de Montjuïc, 08038 Barcelona, Spain;
- Royal Botanic Gardens, Kew, Kew Green, Richmond TW9 3AE, UK; (I.J.L.); (J.V.)
| | - Ana Juan
- Departamento de Ciencias Ambientales y Recursos Naturales, Universidad de Alicante, 03080 Alicante, Spain;
| | - Ilia J. Leitch
- Royal Botanic Gardens, Kew, Kew Green, Richmond TW9 3AE, UK; (I.J.L.); (J.V.)
| | - Andrew R. Leitch
- School of Biological and Chemical Sciences, Queen Mary University of London, London E1 4NS, UK;
| | - Luis Palazzesi
- Museo Argentino de Ciencias Naturales, CONICET, División Paleobotánica, Buenos Aires C1405DJR, Argentina;
| | - Luca Pegoraro
- Biodiversity and Conservation Biology Research Unit, Swiss Federal Institute for Forest, Snow and Landscape Research WSL, 8903 Bimensdorf, Switzerland;
| | - Juan Viruel
- Royal Botanic Gardens, Kew, Kew Green, Richmond TW9 3AE, UK; (I.J.L.); (J.V.)
| | - Jaume Pellicer
- Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Passeig del Migdia s.n., Parc de Montjuïc, 08038 Barcelona, Spain;
- Royal Botanic Gardens, Kew, Kew Green, Richmond TW9 3AE, UK; (I.J.L.); (J.V.)
- Correspondence: (P.F.); (J.P.); Tel.: +34-932890611 (P.F. & J.P.)
| |
Collapse
|
27
|
Mascagni F, Barghini E, Ceccarelli M, Baldoni L, Trapero C, Díez CM, Natali L, Cavallini A, Giordani T. The Singular Evolution of Olea Genome Structure. FRONTIERS IN PLANT SCIENCE 2022; 13:869048. [PMID: 35432417 PMCID: PMC9009077 DOI: 10.3389/fpls.2022.869048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 03/07/2022] [Indexed: 06/14/2023]
Abstract
The current view of plant genome evolution proposes that genome size has mainly been determined by polyploidisation and amplification/loss of transposons, with a minor role played by other repeated sequences, such as tandem repeats. In cultivated olive (Olea europaea subsp. europaea var. europaea), available data suggest a singular model of genome evolution, in which a massive expansion of tandem-repeated sequences accompanied changes in nuclear architecture. This peculiar scenario highlights the importance of focusing on Olea genus evolution, to shed light on mechanisms that led to its present genomic structure. Next-generation sequencing technologies, bioinformatics and in situ hybridisation were applied to study the genomic structure of five related Olea taxa, which originated at different times from their last common ancestor. On average, repetitive DNA in the Olea taxa ranged from ~59% to ~73% of the total genome, showing remarkable differences in terms of composition. Among repeats, we identified 11 major families of tandem repeats, with different abundances in the analysed taxa, five of which were novel discoveries. Interestingly, overall tandem repeat abundance was inversely correlated to that of retrotransposons. This trend might imply a competition in the proliferation of these repeat classes. Indeed, O. paniculata, the species closest to the Olea common ancestor, showed very few tandem-repeated sequences, while it was rich in long terminal repeat retrotransposons, suggesting that the amplification of tandem repeats occurred after its divergence from the Olea ancestor. Furthermore, some tandem repeats were physically localised in closely related O. europaea subspecies (i.e., cultivated olive and O. europaea subsp. cuspidata), which showed a significant difference in tandem repeats abundance. For 4 tandem repeats families, a similar number of hybridisation signals were observed in both subspecies, apparently indicating that, after their dissemination throughout the olive genome, these tandem repeats families differentially amplified maintaining the same positions in each genome. Overall, our research identified the temporal dynamics shaping genome structure during Olea speciation, which represented a singular model of genome evolution in higher plants.
Collapse
Affiliation(s)
- Flavia Mascagni
- Department of Agriculture, Food and Environment, University of Pisa, Pisa, Italy
| | - Elena Barghini
- Department of Agriculture, Food and Environment, University of Pisa, Pisa, Italy
| | - Marilena Ceccarelli
- Department of Chemistry, Biology and Biotechnology, University of Perugia, Perugia, Italy
| | - Luciana Baldoni
- CNR, Institute of Biosciences and BioResources, Perugia, Italy
| | - Carlos Trapero
- CSIRO Agriculture & Food, Narrabri, NSW, Australia
- Agronomy Department, University of Cordoba, Cordoba, Spain
| | | | - Lucia Natali
- Department of Agriculture, Food and Environment, University of Pisa, Pisa, Italy
| | - Andrea Cavallini
- Department of Agriculture, Food and Environment, University of Pisa, Pisa, Italy
| | - Tommaso Giordani
- Department of Agriculture, Food and Environment, University of Pisa, Pisa, Italy
| |
Collapse
|
28
|
de Lima Ferreira P, Batista R, Andermann T, Groppo M, Bacon CD, Antonelli A. Target sequence capture of Barnadesioideae (Compositae) demonstrates the utility of low coverage loci in phylogenomic analyses. Mol Phylogenet Evol 2022; 169:107432. [DOI: 10.1016/j.ympev.2022.107432] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 12/21/2021] [Accepted: 01/14/2022] [Indexed: 11/26/2022]
|
29
|
Guo XX, Qu XJ, Zhang XJ, Fan SJ. Comparative and Phylogenetic Analysis of Complete Plastomes among Aristidoideae Species (Poaceae). BIOLOGY 2022; 11:biology11010063. [PMID: 35053061 PMCID: PMC8773369 DOI: 10.3390/biology11010063] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 12/26/2021] [Accepted: 12/29/2021] [Indexed: 11/16/2022]
Abstract
Aristidoideae is a subfamily in the PACMAD clade of family Poaceae, including three genera, Aristida, Stipagrostis, and Sartidia. In this study, the plastomes of Aristida adscensionis and Stipagrostis pennata were newly sequenced, and a total of 16 Aristidoideae plastomes were compared. All plastomes were conservative in genome size, gene number, structure, and IR boundary. Repeat sequence analysis showed that forward and palindrome repeats were the most common repeat types. The number of SSRs ranged from 30 (Sartidia isaloensis) to 54 (Aristida purpurea). Codon usage analysis showed that plastome genes preferred to use codons ending with A/T. A total of 12 highly variable regions were screened, including four protein coding sequences (matK, ndhF, infA, and rpl32) and eight non-coding sequences (rpl16-1-rpl16-2, ccsA-ndhD, trnY-GUA-trnD-GUC, ndhF-rpl32, petN-trnC-GCA, trnT-GGU-trnE-UUC, trnG-GCC-trnfM-CAU, and rpl32-trnL-UAG). Furthermore, the phylogenetic position of this subfamily and their intergeneric relationships need to be illuminated. All Maximum Likelihood and Bayesian Inference trees strongly support the monophyly of Aristidoideae and each of three genera, and the clade of Aristidoideae and Panicoideae was a sister to other subfamilies in the PACMAD clade. Within Aristidoideae, Aristida is a sister to the clade composed of Stipagrostis and Sartidia. The divergence between C4 Stipagrostis and C3 Sartidia was estimated at 11.04 Ma, which may be associated with the drought event in the Miocene period. Finally, the differences in carbon fixation patterns, geographical distributions, and ploidy may be related to the difference of species numbers among these three genera. This study provides insights into the phylogeny and evolution of the subfamily Aristidoideae.
Collapse
Affiliation(s)
| | | | - Xue-Jie Zhang
- Correspondence: (X.-J.Z.); (S.-J.F.); Tel.: +86-531-86180718 (S.-J.F.)
| | - Shou-Jin Fan
- Correspondence: (X.-J.Z.); (S.-J.F.); Tel.: +86-531-86180718 (S.-J.F.)
| |
Collapse
|
30
|
Affiliation(s)
| | - Francisco J. Ruiz-Ruano
- Department of Organismal Biology – Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
- School of Biological Sciences, Norwich Research Park University of East Anglia, Norwich, UK
| |
Collapse
|
31
|
Herklotz V, Kovařík A, Wissemann V, Lunerová J, Vozárová R, Buschmann S, Olbricht K, Groth M, Ritz CM. Power and Weakness of Repetition - Evaluating the Phylogenetic Signal From Repeatomes in the Family Rosaceae With Two Case Studies From Genera Prone to Polyploidy and Hybridization ( Rosa and Fragaria). FRONTIERS IN PLANT SCIENCE 2021; 12:738119. [PMID: 34950159 PMCID: PMC8688825 DOI: 10.3389/fpls.2021.738119] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 11/08/2021] [Indexed: 06/14/2023]
Abstract
Plant genomes consist, to a considerable extent, of non-coding repetitive DNA. Several studies showed that phylogenetic signals can be extracted from such repeatome data by using among-species dissimilarities from the RepeatExplorer2 pipeline as distance measures. Here, we advanced this approach by adjusting the read input for comparative clustering indirectly proportional to genome size and by summarizing all clusters into a main distance matrix subjected to Neighbor Joining algorithms and Principal Coordinate Analyses. Thus, our multivariate statistical method works as a "repeatomic fingerprint," and we proved its power and limitations by exemplarily applying it to the family Rosaceae at intrafamilial and, in the genera Fragaria and Rosa, at the intrageneric level. Since both taxa are prone to hybridization events, we wanted to show whether repeatome data are suitable to unravel the origin of natural and synthetic hybrids. In addition, we compared the results based on complete repeatomes with those from ribosomal DNA clusters only, because they represent one of the most widely used barcoding markers. Our results demonstrated that repeatome data contained a clear phylogenetic signal supporting the current subfamilial classification within Rosaceae. Accordingly, the well-accepted major evolutionary lineages within Fragaria were distinguished, and hybrids showed intermediate positions between parental species in data sets retrieved from both complete repeatomes and rDNA clusters. Within the taxonomically more complicated and particularly frequently hybridizing genus Rosa, we detected rather weak phylogenetic signals but surprisingly found a geographic pattern at a population scale. In sum, our method revealed promising results at larger taxonomic scales as well as within taxa with manageable levels of reticulation, but success remained rather taxon specific. Since repeatomes can be technically easy and comparably inexpensively retrieved even from samples of rather poor DNA quality, our phylogenomic method serves as a valuable alternative when high-quality genomes are unavailable, for example, in the case of old museum specimens.
Collapse
Affiliation(s)
- Veit Herklotz
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
| | - Aleš Kovařík
- Department of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
| | - Volker Wissemann
- Institute of Botany, Systematic Botany Group, Justus-Liebig-University, Gießen, Germany
| | - Jana Lunerová
- Department of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
| | - Radka Vozárová
- Department of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Sebastian Buschmann
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | | | - Marco Groth
- CF DNA Sequencing, Leibniz Institute on Aging – Fritz Lipmann Institute, Jena, Germany
| | - Christiane M. Ritz
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
- Chair of Biodiversity of Higher Plants, Technische Universität Dresden, International Institute (IHI) Zittau, Zittau, Germany
| |
Collapse
|
32
|
Breman FC, Chen G, Snijder RC, Schranz ME, Bakker FT. Repeatome-Based Phylogenetics in Pelargonium Section Ciconium (Sweet) Harvey. Genome Biol Evol 2021; 13:6454096. [PMID: 34893846 PMCID: PMC8684485 DOI: 10.1093/gbe/evab269] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2021] [Indexed: 12/23/2022] Open
Abstract
The repetitive part of the genome (the repeatome) contains a wealth of often overlooked information that can be used to resolve phylogenetic relationships and test evolutionary hypotheses for clades of related plant species such as Pelargonium. We have generated genome skimming data for 18 accessions of Pelargonium section Ciconium and one outgroup. We analyzed repeat abundancy and repeat similarity in order to construct repeat profiles and then used these for phylogenetic analyses. We found that phylogenetic trees based on read similarity were largely congruent with previous work based on morphological and chloroplast sequence data. For example, results agreed in identifying a “Core Ciconium” group which evolved after the split with P. elongatum. We found that this group was characterized by a unique set of repeats, which confirmed currently accepted phylogenetic hypotheses. We also found four species groups within P. sect. Ciconium that reinforce previous plastome-based reconstructions. A second repeat expansion was identified in a subclade which contained species that are considered to have dispersed from Southern Africa into Eastern Africa and the Arabian Peninsula. We speculate that the Core Ciconium repeat set correlates with a possible WGD event leading to this branch.
Collapse
Affiliation(s)
- Floris C Breman
- Biosystematics Group, Wageningen University & Research, Netherlands
| | - Guangnan Chen
- Biosystematics Group, Wageningen University & Research, Netherlands
| | | | - M Eric Schranz
- Biosystematics Group, Wageningen University & Research, Netherlands
| | - Freek T Bakker
- Biosystematics Group, Wageningen University & Research, Netherlands
| |
Collapse
|
33
|
Comparative Genomics Analysis of Repetitive Elements in Ten Gymnosperm Species: "Dark Repeatome" and Its Abundance in Conifer and Gnetum Species. Life (Basel) 2021; 11:life11111234. [PMID: 34833110 PMCID: PMC8620675 DOI: 10.3390/life11111234] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Revised: 11/09/2021] [Accepted: 11/09/2021] [Indexed: 11/16/2022] Open
Abstract
Repetitive elements (RE) and transposons (TE) can comprise up to 80% of some plant genomes and may be essential for regulating their evolution and adaptation. The “repeatome” information is often unavailable in assembled genomes because genomic areas of repeats are challenging to assemble and are often missing from final assembly. However, raw genomic sequencing data contain rich information about RE/TEs. Here, raw genomic NGS reads of 10 gymnosperm species were studied for the content and abundance patterns of their “repeatome”. We utilized a combination of alignment on databases of repetitive elements and de novo assembly of highly repetitive sequences from genomic sequencing reads to characterize and calculate the abundance of known and putative repetitive elements in the genomes of 10 conifer plants: Pinus taeda, Pinus sylvestris, Pinus sibirica, Picea glauca, Picea abies, Abies sibirica, Larix sibirica, Juniperus communis, Taxus baccata, and Gnetum gnemon. We found that genome abundances of known and newly discovered putative repeats are specific to phylogenetically close groups of species and match biological taxa. The grouping of species based on abundances of known repeats closely matches the grouping based on abundances of newly discovered putative repeats (kChains) and matches the known taxonomic relations.
Collapse
|
34
|
Costa L, Marques A, Buddenhagen C, Thomas WW, Huettel B, Schubert V, Dodsworth S, Houben A, Souza G, Pedrosa-Harand A. Aiming off the target: recycling target capture sequencing reads for investigating repetitive DNA. ANNALS OF BOTANY 2021; 128:835-848. [PMID: 34050647 PMCID: PMC8577205 DOI: 10.1093/aob/mcab063] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 05/26/2021] [Indexed: 05/28/2023]
Abstract
BACKGROUND AND AIMS With the advance of high-throughput sequencing, reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using these data. METHODS Repeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS datasets and the topologies were compared with a gene-alignment-based phylogenetic tree. KEY RESULTS All the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01 % in the GS data. Rank correlations between GS and TCS repeat abundances were moderately high (r = 0.58-0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66-0.92). Repeat data obtained by TCS were also reliable in developing a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree. CONCLUSIONS Our results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.
Collapse
Affiliation(s)
- Lucas Costa
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Recife-PE, Brazil
| | - André Marques
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | | | | | - Bruno Huettel
- Max Planck Genome Centre Cologne, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Veit Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | | | - Andreas Houben
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Gustavo Souza
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Recife-PE, Brazil
| | - Andrea Pedrosa-Harand
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Recife-PE, Brazil
| |
Collapse
|
35
|
Oliveira MAS, Nunes T, Dos Santos MA, Ferreira Gomes D, Costa I, Van-Lume B, Marques Da Silva SS, Oliveira RS, Simon MF, Lima GSA, Gissi DS, Almeida CCDS, Souza G, Marques A. High-Throughput Genomic Data Reveal Complex Phylogenetic Relationships in Stylosanthes Sw (Leguminosae). Front Genet 2021; 12:727314. [PMID: 34630521 PMCID: PMC8495327 DOI: 10.3389/fgene.2021.727314] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 09/08/2021] [Indexed: 11/22/2022] Open
Abstract
Allopolyploidy is widely present across plant lineages. Though estimating the correct phylogenetic relationships and origin of allopolyploids may sometimes become a hard task. In the genus Stylosanthes Sw. (Leguminosae), an important legume crop, allopolyploidy is a key speciation force. This makes difficult adequate species recognition and breeding efforts on the genus. Based on comparative analysis of nine high-throughput sequencing (HTS) samples, including three allopolyploids (S. capitata Vogel cv. “Campo Grande,” S. capitata “RS024” and S. scabra Vogel) and six diploids (S. hamata Taub, S. viscosa (L.) Sw., S. macrocephala M. B. Ferreira and Sousa Costa, S. guianensis (Aubl.) Sw., S. pilosa M. B. Ferreira and Sousa Costa and S. seabrana B. L. Maass & 't Mannetje) we provide a working pipeline to identify organelle and nuclear genome signatures that allowed us to trace the origin and parental genome recognition of allopolyploids. First, organelle genomes were de novo assembled and used to identify maternal genome donors by alignment-based phylogenies and synteny analysis. Second, nuclear-derived reads were subjected to repetitive DNA identification with RepeatExplorer2. Identified repeats were compared based on abundance and presence on diploids in relation to allopolyploids by comparative repeat analysis. Third, reads were extracted and grouped based on the following groups: chloroplast, mitochondrial, satellite DNA, ribosomal DNA, repeat clustered- and total genomic reads. These sets of reads were then subjected to alignment and assembly free phylogenetic analyses and were compared to classical alignment-based phylogenetic methods. Comparative analysis of shared and unique satellite repeats also allowed the tracing of allopolyploid origin in Stylosanthes, especially those with high abundance such as the StyloSat1 in the Scabra complex. This satellite was in situ mapped in the proximal region of the chromosomes and made it possible to identify its previously proposed parents. Hence, with simple genome skimming data we were able to provide evidence for the recognition of parental genomes and understand genome evolution of two Stylosanthes allopolyploids.
Collapse
Affiliation(s)
| | - Tomáz Nunes
- Laboratory of Genetic Resources, Federal University of Alagoas, Arapiraca, Brazil
| | | | | | - Iara Costa
- Laboratory of Genetic Resources, Federal University of Alagoas, Arapiraca, Brazil
| | - Brena Van-Lume
- Laboratory of Plant Cytogenetics and Evolution, Federal University of Pernambuco, Recife, Brazil
| | | | - Ronaldo Simão Oliveira
- Campus Xique Xique, Federal Institute of Education, Science and Technology of Bahia, Xique-Xique, Brazil
| | | | - Gaus S A Lima
- Center of Agronomic Sciences, Federal University of Alagoas, Rio Largo, Brazil
| | - Danilo Soares Gissi
- Department of Biostatistics, Institute of Biosciences-IBB, Plant Biology, Parasitology and Zoology, São Paulo State University-UNESP, Botucatu, Brazil
| | | | - Gustavo Souza
- Laboratory of Plant Cytogenetics and Evolution, Federal University of Pernambuco, Recife, Brazil
| | - André Marques
- Laboratory of Genetic Resources, Federal University of Alagoas, Arapiraca, Brazil.,Department of Chromosome Biology, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| |
Collapse
|
36
|
Chalopin D, Clark LG, Wysocki WP, Park M, Duvall MR, Bennetzen JL. Integrated Genomic Analyses From Low-Depth Sequencing Help Resolve Phylogenetic Incongruence in the Bamboos (Poaceae: Bambusoideae). FRONTIERS IN PLANT SCIENCE 2021; 12:725728. [PMID: 34567039 PMCID: PMC8456298 DOI: 10.3389/fpls.2021.725728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 08/13/2021] [Indexed: 06/13/2023]
Abstract
The bamboos (Bambusoideae, Poaceae) comprise a major grass lineage with a complex evolutionary history involving ancient hybridization and allopolyploidy. About 1700 described species are classified into three tribes, Olyreae (herbaceous bamboos), Bambuseae (tropical woody bamboos), and Arundinarieae (temperate woody bamboos). Nuclear analyses strongly support monophyly of the woody tribes, whereas plastome analyses strongly support paraphyly, with Bambuseae sister to Olyreae. Our objectives were to clarify the origin(s) of the woody bamboo tribes and resolve the nuclear vs. plastid conflict using genomic tools. For the first time, plastid and nuclear genomic information from the same bamboo species were combined in a single study. We sampled 51 species of bamboos representing the three tribes, estimated their genome sizes and generated low-depth sample sequence data, from which plastomes were assembled and nuclear repeats were analyzed. The distribution of repeat families was found to agree with nuclear gene phylogenies, but also provides novel insights into nuclear evolutionary history. We infer two early, independent hybridization events, one between an Olyreae ancestor and a woody ancestor giving rise to the two Bambuseae lineages, and another between two woody ancestors giving rise to the Arundinarieae. Retention of the Olyreae plastome associated with differential dominance of nuclear genomes and subsequent diploidization in some lineages explains the paraphyly observed in plastome phylogenetic estimations. We confirm ancient hybridization and allopolyploidy in the origins of the extant woody bamboo lineages and propose biased fractionation and diploidization as important factors in their evolution.
Collapse
Affiliation(s)
- Domitille Chalopin
- Department of Genetics, University of Georgia, Athens, GA, United States
| | - Lynn G. Clark
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, United States
| | - William P. Wysocki
- Center for Translational Data Science, University of Chicago, Chicago, IL, United States
| | - Minkyu Park
- Department of Genetics, University of Georgia, Athens, GA, United States
| | - Melvin R. Duvall
- Department of Biology and Institute for the Study of the Environment, Sustainability, and Energy, Northern Illinois University, DeKalb, IL, United States
| | | |
Collapse
|
37
|
Ebrahimzadegan R, Orooji F, Ma P, Mirzaghaderi G. Differentially Amplified Repetitive Sequences Among Aegilops tauschii Subspecies and Genotypes. FRONTIERS IN PLANT SCIENCE 2021; 12:716750. [PMID: 34490015 PMCID: PMC8417419 DOI: 10.3389/fpls.2021.716750] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Accepted: 07/27/2021] [Indexed: 06/13/2023]
Abstract
Genomic repetitive sequences commonly show species-specific sequence type, abundance, and distribution patterns, however, their intraspecific characteristics have been poorly described. We quantified the genomic repetitive sequences and performed single nucleotide polymorphism (SNP) analysis between 29 Ae. tauschii genotypes and subspecies using publicly available raw genomic Illumina sequence reads and used fluorescence in situ hybridization (FISH) to experimentally analyze some repeats. The majority of the identified repetitive sequences had similar contents and proportions between anathera, meyeri, and strangulata subspecies. However, two Ty3/gypsy retrotransposons (CL62 and CL87) showed significantly higher abundances, and CL1, CL119, CL213, CL217 tandem repeats, and CL142 retrotransposon (Ty1/copia type) showed significantly lower abundances in subspecies strangulata compared with the subspecies anathera and meyeri. One tandem repeat and 45S ribosomal DNA (45S rDNA) abundances showed a high variation between genotypes but their abundances were not subspecies specific. Phylogenetic analysis using the repeat abundances of the aforementioned clusters placed the strangulata subsp. in a distinct clade but could not discriminate anathera and meyeri. A near complete differentiation of anathera and strangulata subspecies was observed using SNP analysis; however, var. meyeri showed higher genetic diversity. FISH using major tandem repeats couldn't detect differences between subspecies, although (GAA)10 signal patterns generated two different karyotype groups. Taken together, the different classes of repetitive DNA sequences have differentially accumulated between strangulata and the other two subspecies of Ae. tauschii that is generally in agreement with spike morphology, implying that factors affecting repeatome evolution are variable even among highly closely related lineages.
Collapse
Affiliation(s)
- Rahman Ebrahimzadegan
- Department of Agronomy and Plant Breeding, Faculty of Agriculture, University of Kurdistan, Sanandaj, Iran
| | - Fatemeh Orooji
- Department of Agronomy and Plant Breeding, Faculty of Agriculture, University of Kurdistan, Sanandaj, Iran
| | - Pengtao Ma
- College of Life Sciences, Yantai University, Yantai, China
| | - Ghader Mirzaghaderi
- Department of Agronomy and Plant Breeding, Faculty of Agriculture, University of Kurdistan, Sanandaj, Iran
| |
Collapse
|
38
|
Patil AB, Vijay N. Repetitive genomic regions and the inference of demographic history. Heredity (Edinb) 2021; 127:151-166. [PMID: 34002046 PMCID: PMC8322061 DOI: 10.1038/s41437-021-00443-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2021] [Revised: 04/16/2021] [Accepted: 04/17/2021] [Indexed: 02/03/2023] Open
Abstract
Inference of demographic histories using whole-genome datasets has provided insights into diversification, adaptation, hybridization, and plant-pathogen interactions, and stimulated debate on the impact of anthropogenic interventions and past climate on species demography. However, the impact of repetitive genomic regions on these inferences has mostly been ignored by masking of repeats. We use the Populus trichocarpa genome (Pop_tri_v3) to show that masking of repeat regions leads to lower estimates of effective population size (Ne) in the distant past in contrast to an increase in Ne estimates in recent times. However, in human datasets, masking of repeats resulted in lower estimates of Ne at all time points. We demonstrate that repeats affect demographic inferences using diverse methods like PSMC, MSMC, SMC++, and the Stairway plot. Our genomic analysis revealed that the biases in Ne estimates were dependent on the repeat class type and its abundance in each atomic interval. Notably, we observed a weak, yet consistently significant negative correlation between the repeat abundance of an atomic interval and the Ne estimates for that interval, which potentially reflects the recombination rate variation within the genome. The rationale for the masking of repeats has been that variants identified within these regions are erroneous. We find that polymorphisms in some repeat classes occur in callable regions and reflect reliable coalescence histories (e.g., LTR Gypsy, LTR Copia). The current demography inference methods do not handle repeats explicitly, and hence the effect of individual repeat classes needs careful consideration in comparative analysis. Deciphering the repeat demographic histories might provide a clear understanding of the processes involved in repeat accumulation.
Collapse
Affiliation(s)
- Ajinkya Bharatraj Patil
- Computational Evolutionary Genomics Lab, Department of Biological Sciences, IISER Bhopal, Bhauri, Madhya Pradesh, India
| | - Nagarjun Vijay
- Computational Evolutionary Genomics Lab, Department of Biological Sciences, IISER Bhopal, Bhauri, Madhya Pradesh, India.
| |
Collapse
|
39
|
Beric A, Mabry ME, Harkess AE, Brose J, Schranz ME, Conant GC, Edger PP, Meyers BC, Pires JC. Comparative phylogenetics of repetitive elements in a diverse order of flowering plants (Brassicales). G3 (BETHESDA, MD.) 2021; 11:jkab140. [PMID: 33993297 PMCID: PMC8495927 DOI: 10.1093/g3journal/jkab140] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 04/10/2021] [Indexed: 11/14/2022]
Abstract
Genome sizes of plants have long piqued the interest of researchers due to the vast differences among organisms. However, the mechanisms that drive size differences have yet to be fully understood. Two important contributing factors to genome size are expansions of repetitive elements, such as transposable elements (TEs), and whole-genome duplications (WGD). Although studies have found correlations between genome size and both TE abundance and polyploidy, these studies typically test for these patterns within a genus or species. The plant order Brassicales provides an excellent system to further test if genome size evolution patterns are consistent across larger time scales, as there are numerous WGDs. This order is also home to one of the smallest plant genomes, Arabidopsis thaliana-chosen as the model plant system for this reason-as well as to species with very large genomes. With new methods that allow for TE characterization from low-coverage genome shotgun data and 71 taxa across the Brassicales, we confirm the correlation between genome size and TE content, however, we are unable to reconstruct phylogenetic relationships and do not detect any shift in TE abundance associated with WGD.
Collapse
Affiliation(s)
- Aleksandra Beric
- Donald Danforth Plant Science Center, St. Louis, MO 63132, USA
- Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Makenzie E Mabry
- Division of Biological Sciences and Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Alex E Harkess
- Department of Crop, Soil, and Environmental Sciences, Auburn University, Auburn, AL 36849, USA
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Julia Brose
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - M Eric Schranz
- Biosystematics Group, Wageningen University, Wageningen 6700 AA, The Netherlands
| | - Gavin C Conant
- Bioinformatics Research Center, Program in Genetics and Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA
- Department of Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| | - Blake C Meyers
- Donald Danforth Plant Science Center, St. Louis, MO 63132, USA
- Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA
| | - J Chris Pires
- Division of Biological Sciences and Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
40
|
Heitkam T, Schulte L, Weber B, Liedtke S, Breitenbach S, Kögler A, Morgenstern K, Brückner M, Tröber U, Wolf H, Krabel D, Schmidt T. Comparative Repeat Profiling of Two Closely Related Conifers ( Larix decidua and Larix kaempferi) Reveals High Genome Similarity With Only Few Fast-Evolving Satellite DNAs. Front Genet 2021; 12:683668. [PMID: 34322154 PMCID: PMC8312256 DOI: 10.3389/fgene.2021.683668] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 05/25/2021] [Indexed: 12/26/2022] Open
Abstract
In eukaryotic genomes, cycles of repeat expansion and removal lead to large-scale genomic changes and propel organisms forward in evolution. However, in conifers, active repeat removal is thought to be limited, leading to expansions of their genomes, mostly exceeding 10 giga base pairs. As a result, conifer genomes are largely littered with fragmented and decayed repeats. Here, we aim to investigate how the repeat landscapes of two related conifers have diverged, given the conifers' accumulative genome evolution mode. For this, we applied low-coverage sequencing and read clustering to the genomes of European and Japanese larch, Larix decidua (Lamb.) Carrière and Larix kaempferi (Mill.), that arose from a common ancestor, but are now geographically isolated. We found that both Larix species harbored largely similar repeat landscapes, especially regarding the transposable element content. To pin down possible genomic changes, we focused on the repeat class with the fastest sequence turnover: satellite DNAs (satDNAs). Using comparative bioinformatics, Southern, and fluorescent in situ hybridization, we reveal the satDNAs' organizational patterns, their abundances, and chromosomal locations. Four out of the five identified satDNAs are widespread in the Larix genus, with two even present in the more distantly related Pseudotsuga and Abies genera. Unexpectedly, the EulaSat3 family was restricted to L. decidua and absent from L. kaempferi, indicating its evolutionarily young age. Taken together, our results exemplify how the accumulative genome evolution of conifers may limit the overall divergence of repeats after speciation, producing only few repeat-induced genomic novelties.
Collapse
Affiliation(s)
- Tony Heitkam
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Luise Schulte
- Institute of Botany, Technische Universität Dresden, Dresden, Germany.,Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | - Beatrice Weber
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Susan Liedtke
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Sarah Breitenbach
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Anja Kögler
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Kristin Morgenstern
- Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, Tharandt, Germany
| | | | - Ute Tröber
- Staatsbetrieb Sachsenforst, Pirna, Germany
| | - Heino Wolf
- Staatsbetrieb Sachsenforst, Pirna, Germany
| | - Doris Krabel
- Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, Tharandt, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
41
|
Chumová Z, Záveská E, Hloušková P, Ponert J, Schmidt PA, Čertner M, Mandáková T, Trávníček P. Repeat proliferation and partial endoreplication jointly shape the patterns of genome size evolution in orchids. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 107:511-524. [PMID: 33960537 DOI: 10.1111/tpj.15306] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Revised: 04/27/2021] [Accepted: 05/03/2021] [Indexed: 05/21/2023]
Abstract
Although the evolutionary drivers of genome size change are known, the general patterns and mechanisms of plant genome size evolution are yet to be established. Here we aim to assess the relative importance of proliferation of repetitive DNA, chromosomal variation (including polyploidy), and the type of endoreplication for genome size evolution of the Pleurothallidinae, the most species-rich orchid lineage. Phylogenetic relationships between 341 Pleurothallidinae representatives were refined using a target enrichment hybrid capture combined with high-throughput sequencing approach. Genome size and the type of endoreplication were assessed using flow cytometry supplemented with karyological analysis and low-coverage Illumina sequencing for repeatome analysis on a subset of samples. Data were analyzed using phylogeny-based models. Genome size diversity (0.2-5.1 Gbp) was mostly independent of profound chromosome count variation (2n = 12-90) but tightly linked with the overall content of repetitive DNA elements. Species with partial endoreplication (PE) had significantly greater genome sizes, and genomic repeat content was tightly correlated with the size of the non-endoreplicated part of the genome. In PE species, repetitive DNA is preferentially accumulated in the non-endoreplicated parts of their genomes. Our results demonstrate that proliferation of repetitive DNA elements and PE together shape the patterns of genome size diversity in orchids.
Collapse
Affiliation(s)
- Zuzana Chumová
- Institute of Botany of the Czech Academy of Sciences, Zámek 1, Průhonice, CZ-25243, Czech Republic
- Department of Botany, Faculty of Science, Charles University, Benátská 2, Prague, CZ-12800, Czech Republic
| | - Eliška Záveská
- Institute of Botany of the Czech Academy of Sciences, Zámek 1, Průhonice, CZ-25243, Czech Republic
- Department of Botany, University of Innsbruck, Sternwartestraße 15, Innsbruck, 6020, Austria
| | | | - Jan Ponert
- Institute of Botany of the Czech Academy of Sciences, Zámek 1, Průhonice, CZ-25243, Czech Republic
- Prague Botanical Garden, Trojská 800/196, Prague, CZ-17100, Czech Republic
- Department of Experimental Plant Biology, Faculty of Science, Charles University, Viničná 5, Prague, CZ-12844, Czech Republic
| | - Philipp-André Schmidt
- Institute of Botany of the Czech Academy of Sciences, Zámek 1, Průhonice, CZ-25243, Czech Republic
| | - Martin Čertner
- Institute of Botany of the Czech Academy of Sciences, Zámek 1, Průhonice, CZ-25243, Czech Republic
- Department of Botany, Faculty of Science, Charles University, Benátská 2, Prague, CZ-12800, Czech Republic
| | - Terezie Mandáková
- CEITEC, Masaryk University, Brno, CZ-62500, Czech Republic
- Faculty of Science, Masaryk University, Brno, CZ-62500, Czech Republic
| | - Pavel Trávníček
- Institute of Botany of the Czech Academy of Sciences, Zámek 1, Průhonice, CZ-25243, Czech Republic
| |
Collapse
|
42
|
Baker WJ, Dodsworth S, Forest F, Graham SW, Johnson MG, McDonnell A, Pokorny L, Tate JA, Wicke S, Wickett NJ. Exploring Angiosperms353: An open, community toolkit for collaborative phylogenomic research on flowering plants. AMERICAN JOURNAL OF BOTANY 2021; 108:1059-1065. [PMID: 34293179 DOI: 10.1002/ajb2.1703] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 05/14/2021] [Indexed: 06/13/2023]
Affiliation(s)
| | - Steven Dodsworth
- School of Life Sciences, University of Bedfordshire, University Square, Luton, LU1 3JU, UK
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Sean W Graham
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
| | - Matthew G Johnson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, 79409, USA
| | - Angela McDonnell
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL, 60022, USA
| | - Lisa Pokorny
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Jennifer A Tate
- School of Fundamental Sciences, Massey University, Palmerston North, 4442, New Zealand
| | - Susann Wicke
- Plant Evolutionary Biology, Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
- Plant Systematics and Biodiversity, Institute for Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Norman J Wickett
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL, 60022, USA
| |
Collapse
|
43
|
Cacho NI, McIntyre PJ, Kliebenstein DJ, Strauss SY. Genome size evolution is associated with climate seasonality and glucosinolates, but not life history, soil nutrients or range size, across a clade of mustards. ANNALS OF BOTANY 2021; 127:887-902. [PMID: 33675229 PMCID: PMC8225284 DOI: 10.1093/aob/mcab028] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Accepted: 02/21/2021] [Indexed: 05/02/2023]
Abstract
BACKGROUND AND AIMS We investigate patterns of evolution of genome size across a morphologically and ecologically diverse clade of Brassicaceae, in relation to ecological and life history traits. While numerous hypotheses have been put forward regarding autecological and environmental factors that could favour small vs. large genomes, a challenge in understanding genome size evolution in plants is that many hypothesized selective agents are intercorrelated. METHODS We contribute genome size estimates for 47 species of Streptanthus Nutt. and close relatives, and take advantage of many data collections for this group to assemble data on climate, life history, soil affinity and composition, geographic range and plant secondary chemistry to identify simultaneous correlates of variation in genome size in an evolutionary framework. We assess models of evolution across clades and use phylogenetically informed analyses as well as model selection and information criteria approaches to identify variables that can best explain genome size variation in this clade. KEY RESULTS We find differences in genome size and heterogeneity in its rate of evolution across subclades of Streptanthus and close relatives. We show that clade-wide genome size is positively associated with climate seasonality and glucosinolate compounds. Model selection and information criteria approaches identify a best model that includes temperature seasonality and fraction of aliphatic glucosinolates, suggesting a possible role for genome size in climatic adaptation or a role for biotic interactions in shaping the evolution of genome size. We find no evidence supporting hypotheses of life history, range size or soil nutrients as forces shaping genome size in this system. CONCLUSIONS Our findings suggest climate seasonality and biotic interactions as potential forces shaping the evolution of genome size and highlight the importance of evaluating multiple factors in the context of phylogeny to understand the effect of possible selective agents on genome size.
Collapse
Affiliation(s)
- N Ivalú Cacho
- Instituto de Biología, Universidad Nacional Autónoma de México. Circuito Exterior, Ciudad Universitaria, Mexico City, Mexico
- Center for Population Biology, University of California, One Shields Avenue, Davis, CA, USA
- Department of Evolution of Ecology, University of California, One Shields Avenue, Davis, CA, USA
| | - Patrick J McIntyre
- Center for Population Biology, University of California, One Shields Avenue, Davis, CA, USA
- NatureServe, Boulder, CO, USA
| | - Daniel J Kliebenstein
- Department of Plant Sciences, University of California, One Shields Avenue, Davis, CA, USA
- DynaMo Centre of Excellence, University of Copenhagen, Thorvaldsensvej 40, Frederiksberg C, Denmark
| | - Sharon Y Strauss
- Center for Population Biology, University of California, One Shields Avenue, Davis, CA, USA
| |
Collapse
|
44
|
Negm S, Greenberg A, Larracuente A, Sproul J. RepeatProfiler: A pipeline for visualization and comparative analysis of repetitive DNA profiles. Mol Ecol Resour 2021; 21:969-981. [PMID: 33277787 PMCID: PMC7954937 DOI: 10.1111/1755-0998.13305] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 11/11/2020] [Accepted: 11/30/2020] [Indexed: 12/20/2022]
Abstract
Study of repetitive DNA elements in model organisms highlights the role of repetitive elements (REs) in many processes that drive genome evolution and phenotypic change. Because REs are much more dynamic than single-copy DNA, repetitive sequences can reveal signals of evolutionary history over short time scales that may not be evident in sequences from slower-evolving genomic regions. Many tools for studying REs are directed toward organisms with existing genomic resources, including genome assemblies and repeat libraries. However, signals in repeat variation may prove especially valuable in disentangling evolutionary histories in diverse non-model groups, for which genomic resources are limited. Here, we introduce RepeatProfiler, a tool for generating, visualizing, and comparing repetitive element DNA profiles from low-coverage, short-read sequence data. RepeatProfiler automates the generation and visualization of RE coverage depth profiles (RE profiles) and allows for statistical comparison of profile shape across samples. In addition, RepeatProfiler facilitates comparison of profiles by extracting signal from sequence variants across profiles which can then be analysed as molecular morphological characters using phylogenetic analysis. We validate RepeatProfiler with data sets from ground beetles (Bembidion), flies (Drosophila), and tomatoes (Solanum). We highlight the potential of RE profiles as a high-resolution data source for studies in species delimitation, comparative genomics, and repeat biology.
Collapse
Affiliation(s)
- S. Negm
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| | - A. Greenberg
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| | - A.M. Larracuente
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| | - J.S. Sproul
- University of Rochester, Department of Biology, 337 Hutchison Hall, Rochester, NY, 14627
| |
Collapse
|
45
|
Sproul JS, Barton LM, Maddison DR. Repetitive DNA Profiles Reveal Evidence of Rapid Genome Evolution and Reflect Species Boundaries in Ground Beetles. Syst Biol 2021; 69:1137-1148. [PMID: 32267949 DOI: 10.1093/sysbio/syaa030] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 03/16/2020] [Accepted: 03/30/2020] [Indexed: 12/11/2022] Open
Abstract
Genome architecture is a complex, multidimensional property of an organism defined by the content and spatial organization of the genome's component parts. Comparative study of entire genome architecture in model organisms is shedding light on mechanisms underlying genome regulation, evolution, and diversification, but such studies require costly analytical approaches which make extensive comparative study impractical for most groups. However, lower-cost methods that measure a single architectural component (e.g., distribution of one class of repeats) have potential as a new data source for evolutionary studies insofar as that measure correlates with more complex biological phenomena, and for which it could serve as part of an explanatory framework. We investigated copy number variation (CNV) profiles in ribosomal DNA (rDNA) as a simple measure reflecting the distribution of rDNA subcomponents across the genome. We find that signatures present in rDNA CNV profiles strongly correlate with species boundaries in the breve species group of Bembidion, and vary across broader taxonomic sampling in Bembidion subgenus Plataphus. Profiles of several species show evidence of re-patterning of rDNA-like sequences throughout the genome, revealing evidence of rapid genome evolution (including among sister pairs) not evident from analysis of traditional data sources such as multigene data sets. Major re-patterning of rDNA-like sequences has occurred frequently within the evolutionary history of Plataphus. We confirm that CNV profiles represent an aspect of genomic architecture (i.e., the linear distribution of rDNA components across the genome) via fluorescence in-situ hybridization. In at least one species, novel rDNA-like elements are spread throughout all chromosomes. We discuss the potential of copy number profiles of rDNA, or other repeats, as a low-cost tool for incorporating signal of genomic architecture variation in studies of species delimitation and genome evolution. [Bembidion; Carabidae; copy number variation profiles; rapid genome evolution; ribosomal DNA; species delimitation.].
Collapse
Affiliation(s)
- John S Sproul
- Department of Integrative Biology, Oregon State University, 3029 Cordley Hall, Corvallis, OR 97331, USA.,Department of Biology, University of Rochester, 402 Hutchison Hall, PO Box 270211, Rochester, NY 14627, USA
| | - Lindsey M Barton
- Department of Integrative Biology, Oregon State University, 3029 Cordley Hall, Corvallis, OR 97331, USA
| | - David R Maddison
- Department of Integrative Biology, Oregon State University, 3029 Cordley Hall, Corvallis, OR 97331, USA
| |
Collapse
|
46
|
Bakker FT, Bieker VC, Martin MD. Editorial: Herbarium Collection-Based Plant Evolutionary Genetics and Genomics. Front Ecol Evol 2020. [DOI: 10.3389/fevo.2020.603948] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
47
|
Bohmann K, Mirarab S, Bafna V, Gilbert MTP. Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification. Mol Ecol 2020; 29:2521-2534. [PMID: 32542933 PMCID: PMC7496323 DOI: 10.1111/mec.15507] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 06/03/2020] [Accepted: 06/05/2020] [Indexed: 02/06/2023]
Abstract
Genetic tools are increasingly used to identify and discriminate between species. One key transition in this process was the recognition of the potential of the ca 658bp fragment of the organelle cytochrome c oxidase I (COI) as a barcode region, which revolutionized animal bioidentification and lead, among others, to the instigation of the Barcode of Life Database (BOLD), containing currently barcodes from >7.9 million specimens. Following this discovery, suggestions for other organellar regions and markers, and the primers with which to amplify them, have been continuously proposed. Most recently, the field has taken the leap from PCR-based generation of DNA references into shotgun sequencing-based "genome skimming" alternatives, with the ultimate goal of assembling organellar reference genomes. Unfortunately, in genome skimming approaches, much of the nuclear genome (as much as 99% of the sequence data) is discarded, which is not only wasteful, but can also limit the power of discrimination at, or below, the species level. Here, we advocate that the full shotgun sequence data can be used to assign an identity (that we term for convenience its "DNA-mark") for both voucher and query samples, without requiring any computationally intensive pretreatment (e.g. assembly) of reads. We argue that if reference databases are populated with such "DNA-marks," it will enable future DNA-based taxonomic identification to complement, or even replace PCR of barcodes with genome skimming, and we discuss how such methodology ultimately could enable identification to population, or even individual, level.
Collapse
Affiliation(s)
- Kristine Bohmann
- Section for Evolutionary GenomicsThe GLOBE InstituteUniversity of CopenhagenCopenhagenDenmark
| | - Siavash Mirarab
- Department of Electrical and Computer EngineeringUniversity of CaliforniaSan DiegoCAUSA
| | - Vineet Bafna
- Department of Computer Science and EngineeringUniversity of CaliforniaSan DiegoCAUSA
| | - M. Thomas P. Gilbert
- Section for Evolutionary GenomicsThe GLOBE InstituteUniversity of CopenhagenCopenhagenDenmark
- Center for Evolutionary HologenomicsThe GLOBE InstituteUniversity of CopenhagenCopenhagenDenmark
- NTNU University MuseumTrondheimNorway
| |
Collapse
|
48
|
Zwyrtková J, Němečková A, Čížková J, Holušová K, Kapustová V, Svačina R, Kopecký D, Till BJ, Doležel J, Hřibová E. Comparative analyses of DNA repeats and identification of a novel Fesreba centromeric element in fescues and ryegrasses. BMC PLANT BIOLOGY 2020; 20:280. [PMID: 32552738 PMCID: PMC7302162 DOI: 10.1186/s12870-020-02495-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 06/15/2020] [Indexed: 05/10/2023]
Abstract
BACKGROUND Cultivated grasses are an important source of food for domestic animals worldwide. Increased knowledge of their genomes can speed up the development of new cultivars with better quality and greater resistance to biotic and abiotic stresses. The most widely grown grasses are tetraploid ryegrass species (Lolium) and diploid and hexaploid fescue species (Festuca). In this work, we characterized repetitive DNA sequences and their contribution to genome size in five fescue and two ryegrass species as well as one fescue and two ryegrass cultivars. RESULTS Partial genome sequences produced by Illumina sequencing technology were used for genome-wide comparative analyses with the RepeatExplorer pipeline. Retrotransposons were the most abundant repeat type in all seven grass species. The Athila element of the Ty3/gypsy family showed the most striking differences in copy number between fescues and ryegrasses. The sequence data enabled the assembly of the long terminal repeat (LTR) element Fesreba, which is highly enriched in centromeric and (peri)centromeric regions in all species. A combination of fluorescence in situ hybridization (FISH) with a probe specific to the Fesreba element and immunostaining with centromeric histone H3 (CENH3) antibody showed their co-localization and indicated a possible role of Fesreba in centromere function. CONCLUSIONS Comparative repeatome analyses in a set of fescues and ryegrasses provided new insights into their genome organization and divergence, including the assembly of the LTR element Fesreba. A new LTR element Fesreba was identified and found in abundance in centromeric regions of the fescues and ryegrasses. It may play a role in the function of their centromeres.
Collapse
Affiliation(s)
- Jana Zwyrtková
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - Alžběta Němečková
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - Jana Čížková
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - Kateřina Holušová
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - Veronika Kapustová
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - Radim Svačina
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - David Kopecký
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - Bradley John Till
- Centro de Genómica Nutricional Agroacuícola, Las Heras 350, Temuco, Chile
| | - Jaroslav Doležel
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| | - Eva Hřibová
- Institute of Experimental Botany, Czech Academy of Sciences, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-77900 Olomouc, Czech Republic
| |
Collapse
|
49
|
Easterling KA, Pitra NJ, Morcol TB, Aquino JR, Lopes LG, Bussey KC, Matthews PD, Bass HW. Identification of tandem repeat families from long-read sequences of Humulus lupulus. PLoS One 2020; 15:e0233971. [PMID: 32502183 PMCID: PMC7274563 DOI: 10.1371/journal.pone.0233971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Accepted: 05/16/2020] [Indexed: 11/28/2022] Open
Abstract
Hop (Humulus lupulus L.) is known for its use as a bittering agent in beer and has a rich history of cultivation, beginning in Europe and now spanning the globe. There are five wild varieties worldwide, which may have been introgressed with cultivated varieties. As a dioecious species, its obligate outcrossing, non-Mendelian inheritance, and genomic structural variability have confounded directed breeding efforts. Consequently, understanding the hop genome represents a considerable challenge, requiring additional resources. In order to facilitate investigations into the transmission genetics of hop, we report here a tandem repeat discovery pipeline developed using k-mer filtering and dot plot analysis of PacBio long-read sequences from the hop cultivar Apollo. From this we identified 17 new and distinct tandem repeat sequence families, which represent candidates for FISH probe development. For two of these candidates, HuluTR120 and HuluTR225, we produced oligonucleotide FISH probes from conserved regions of and demonstrated their utility by staining meiotic chromosomes from wild hop, var. neomexicanus to address, for example, questions about hop transmission genetics. Collectively, these tandem repeat sequence families represent new resources suitable for development of additional cytogenomic tools for hop research.
Collapse
Affiliation(s)
- Katherine A. Easterling
- Department of Biological Science, Florida State University, Tallahassee, FL, United States America
- Hopsteiner, S.S. Steiner, Inc., New York, New York, United States America
| | - Nicholi J. Pitra
- Hopsteiner, S.S. Steiner, Inc., New York, New York, United States America
| | - Taylan B. Morcol
- Hopsteiner, S.S. Steiner, Inc., New York, New York, United States America
- Department of Biological Sciences, Lehman College, City University of New York, Bronx, New York, United States America
- The Graduate Center, City University of New York, New York, New York, United States America
| | - Jenna R. Aquino
- Department of Biological Science, Florida State University, Tallahassee, FL, United States America
| | - Lauren G. Lopes
- Department of Biological Science, Florida State University, Tallahassee, FL, United States America
| | - Kristin C. Bussey
- Department of Biological Science, Florida State University, Tallahassee, FL, United States America
| | - Paul D. Matthews
- Hopsteiner, S.S. Steiner, Inc., New York, New York, United States America
| | - Hank W. Bass
- Department of Biological Science, Florida State University, Tallahassee, FL, United States America
| |
Collapse
|
50
|
Lunerová J, Herklotz V, Laudien M, Vozárová R, Groth M, Kovařík A, Ritz CM. Asymmetrical canina meiosis is accompanied by the expansion of a pericentromeric satellite in non-recombining univalent chromosomes in the genus Rosa. ANNALS OF BOTANY 2020; 125:1025-1038. [PMID: 32095807 PMCID: PMC7262465 DOI: 10.1093/aob/mcaa028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 02/24/2020] [Indexed: 05/02/2023]
Abstract
BACKGROUND AND AIMS Despite their abundant odd-ploidy (2n = 5x = 35), dogroses (Rosa sect. Caninae) are capable of sexual reproduction due to their unique meiosis. During canina meiosis, two sets of chromosomes form bivalents and are transmitted by male and female gametes, whereas the remaining chromosomes form univalents and are exclusively transmitted by the egg cells. Thus, the evolution of chromosomes is expected to be driven by their behaviour during meiosis. METHODS To gain insight into differential chromosome evolution, fluorescence in situ hybridization was conducted for mitotic and meiotic chromosomes in four dogroses (two subsections) using satellite and ribosomal DNA probes. By exploiting high-throughput sequencing data, we determined the abundance and diversity of the satellite repeats in the genus Rosa by analysing 20 pentaploid, tetraploid and diploid species in total. KEY RESULTS A pericentromeric satellite repeat, CANR4, was found in all members of the genus Rosa, including the basal subgenera Hulthemia and Hesperhodos. The satellite was distributed across multiple chromosomes (5-20 sites per mitotic cell), and its genomic abundance was higher in pentaploid dogroses (2.3 %) than in non-dogrose species (1.3 %). In dogrose meiosis, univalent chromosomes were markedly enriched in CANR4 repeats based on both the number and the intensity of the signals compared to bivalent-forming chromosomes. Single-nucleotide polymorphisms and cluster analysis revealed high intragenomic homogeneity of the satellite in dogrose genomes. CONCLUSIONS The CANR4 satellite arose early in the evolution of the genus Rosa. Its high content and extraordinary homogeneity in dogrose genomes is explained by its recent amplification in non-recombining chromosomes. We hypothesize that satellite DNA expansion may contribute to the divergence of univalent chromosomes in Rosa species with non-symmetrical meiosis.
Collapse
Affiliation(s)
- Jana Lunerová
- Department of Molecular Epigenetics, Institute of Biophysics, Czech Academy of Sciences, Brno, Czech Republic
| | - Veit Herklotz
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
| | - Melanie Laudien
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
- Technical University Dresden, International Institute Zittau (IHI), Chair of Biodiversity of Higher Plants, Zittau, Germany
| | - Radka Vozárová
- Department of Molecular Epigenetics, Institute of Biophysics, Czech Academy of Sciences, Brno, Czech Republic
- Masaryk University, Faculty of Science, Brno, Czech Republic
| | - Marco Groth
- Leibniz Institute on Ageing – Fritz Lipmann Institute, Jena, Germany
| | - Aleš Kovařík
- Department of Molecular Epigenetics, Institute of Biophysics, Czech Academy of Sciences, Brno, Czech Republic
| | - Christiane M Ritz
- Department of Botany, Senckenberg Museum of Natural History Görlitz, Görlitz, Germany
| |
Collapse
|